1
|
Silva MKDP, Nicoleti VYU, Rodrigues BDPP, Araujo ASF, Ellwanger JH, de Almeida JM, Lemos LN. Exploring deep learning in phage discovery and characterization. Virology 2025; 609:110559. [PMID: 40359589 DOI: 10.1016/j.virol.2025.110559] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2024] [Revised: 03/24/2025] [Accepted: 04/28/2025] [Indexed: 05/15/2025]
Abstract
Bacteriophages, or bacterial viruses, play diverse ecological roles by shaping bacterial populations and also hold significant biotechnological and medical potential, including the treatment of infections caused by multidrug-resistant bacteria. The discovery of novel bacteriophages using large-scale metagenomic data has been accelerated by the accessibility of deep learning (Artificial Intelligence), the increased computing power of graphical processing units (GPUs), and new bioinformatics tools. This review addresses the recent revolution in bacteriophage research, ranging from the adoption of neural network algorithms applied to metagenomic data to the use of pre-trained language models, such as BERT, which have improved the reconstruction of viral metagenome-assembled genomes (vMAGs). This article also discusses the main aspects of bacteriophage biology using deep learning, highlighting the advances and limitations of this approach. Finally, prospects of deep-learning-based metagenomic algorithms and recommendations for future investigations are described.
Collapse
Affiliation(s)
| | - Vitória Yumi Uetuki Nicoleti
- Ilum School of Science, Brazilian Center for Research in Energy and Materials (CNPEM), Campinas, São Paulo, Brazil.
| | | | | | - Joel Henrique Ellwanger
- Laboratory of Immunobiology and Immunogenetics, Department of Genetics, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, Rio Grande do Sul, Brazil.
| | - James Moraes de Almeida
- Ilum School of Science, Brazilian Center for Research in Energy and Materials (CNPEM), Campinas, São Paulo, Brazil.
| | - Leandro Nascimento Lemos
- Ilum School of Science, Brazilian Center for Research in Energy and Materials (CNPEM), Campinas, São Paulo, Brazil.
| |
Collapse
|
2
|
Xing J, Han R, Zhao J, Zhang Y, Zhang M, Zhang Y, Zhang H, Nang SC, Zhai Y, Yuan L, Wang S, Wu H. Revisiting therapeutic options against resistant klebsiella pneumoniae infection: Phage therapy is key. Microbiol Res 2025; 293:128083. [PMID: 39904002 DOI: 10.1016/j.micres.2025.128083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2024] [Revised: 01/10/2025] [Accepted: 01/23/2025] [Indexed: 02/06/2025]
Abstract
Multi-drug resistant and carbapenem-resistant hypervirulent Klebsiella pneumoniae strains are spreading globally at an alarming rate, emerging as one of the most serious threats to global public health. The formidable challenges posed by the current arsenal of antimicrobials highlight the urgent need for novel strategies to combat K. pneumoniae infections. This review begins with a comprehensive analysis of the global dissemination of virulence factors and critical resistance profiles in K. pneumoniae, followed by an evaluation of the accessibility of novel therapeutic approaches for treating K. pneumoniae in clinical settings. Among these, phage therapy stands out for its considerable potential in addressing life-threatening K. pneumoniae infections. We critically examine the existing preclinical and clinical evidence supporting phage therapy, identifying key limitations that impede its broader clinical adoption. Additionally, we rigorously explore the role of genetic engineering in expanding the host range of K. pneumoniae phages, and discuss the future trajectory of this technology. In light of the 'Bad Bugs, No Drugs' era, we advocate leveraging artificial intelligence and deep learning to optimize and expand the application of phage therapy, representing a crucial advancement in the fight against the escalating threat of K. pneumoniae infections.
Collapse
Affiliation(s)
- Jiabao Xing
- Department of Pharmacology and Toxicology, College of Veterinary Medicine, Henan Agricultural University, Zhengzhou, China
| | - Rongjia Han
- Department of Pharmacology and Toxicology, College of Veterinary Medicine, Henan Agricultural University, Zhengzhou, China
| | - Jinxin Zhao
- Biomedicine Discovery Institute, Monash University, Clayton, Victoria, Australia
| | - Yuying Zhang
- Department of Pharmacology and Toxicology, College of Veterinary Medicine, Henan Agricultural University, Zhengzhou, China
| | - Meng Zhang
- Department of Pulmonary and Critical Care Medicine, Shanghai Fifth People's Hospital, Fudan University, Shanghai, China
| | - Yichao Zhang
- Department of Pharmacology and Toxicology, College of Veterinary Medicine, Henan Agricultural University, Zhengzhou, China
| | - Hang Zhang
- Department of Pharmacology and Toxicology, College of Veterinary Medicine, Henan Agricultural University, Zhengzhou, China
| | - Sue C Nang
- Biomedicine Discovery Institute, Monash University, Clayton, Victoria, Australia
| | - Yajun Zhai
- Department of Pharmacology and Toxicology, College of Veterinary Medicine, Henan Agricultural University, Zhengzhou, China
| | - Li Yuan
- Department of Pharmacology and Toxicology, College of Veterinary Medicine, Henan Agricultural University, Zhengzhou, China
| | - Shanmei Wang
- Department of Microbiology Laboratory, Henan Provincial People's Hospital, Zhengzhou, China.
| | - Hua Wu
- Department of Pharmacology and Toxicology, College of Veterinary Medicine, Henan Agricultural University, Zhengzhou, China; Biomedicine Discovery Institute, Monash University, Clayton, Victoria, Australia.
| |
Collapse
|
3
|
Xiao Z, Sun H, Wei A, Zhao W, Jiang X. A Novel Framework for Predicting Phage-Host Interactions via Host Specificity-Aware Graph Autoencoder. IEEE J Biomed Health Inform 2025; 29:3069-3078. [PMID: 40030240 DOI: 10.1109/jbhi.2024.3500137] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/05/2025]
Abstract
Due to the abuse of antibiotics, some pathogenic bacteria have developed resistance to most antibiotics, leading to the emergence of antibiotic-resistant superbugs. Therefore, researchers resort to phage therapy for bacterial infections. For phage therapy, the fundamental step is to accurately identify phage-host interactions. Although various methods have been proposed, the existing methods suffer from the following two shortcomings: 1) they fail to make full use of genetic information including both genome and protein sequence of phages; 2) host specificity of phages is not explicitly utilized when learning representations of phages and bacteria. In this paper, we present an efficient computational method called PHISGAE for predicting phage-host interactions, in which the host specificity is explicitly employed. Firstly, initial phage-phage connections are efficiently constructed via utilizing phage genome and protein sequence. Then, the refined heterogeneous network is derived by applying K-nearest neighbor strategy, keeping relatively more meaningful local semantics among phages and bacteria. Finally, a host specificity-aware graph autoencoder is proposed to learn high-quality representations of phages and bacteria for predicting phage-host interactions. Experimental results show that PHISGAE outperforms the state-of-the-art methods on predicting phage-host interactions at both species level and genus level (AUC values of 94.73% and 96.32%, respectively). Moreover, results of case study demonstrate that PHISGAE is able to identify candidate hosts with high probability for previously unseen phages identified from metagenomics, effectively predicting potential phage-host interactions in real-world applications.
Collapse
|
4
|
Peng H, Chen IA, Qimron U. Engineering Phages to Fight Multidrug-Resistant Bacteria. Chem Rev 2025; 125:933-971. [PMID: 39680919 PMCID: PMC11758799 DOI: 10.1021/acs.chemrev.4c00681] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2024] [Revised: 12/04/2024] [Accepted: 12/09/2024] [Indexed: 12/18/2024]
Abstract
Facing the global "superbug" crisis due to the emergence and selection for antibiotic resistance, phages are among the most promising solutions. Fighting multidrug-resistant bacteria requires precise diagnosis of bacterial pathogens and specific cell-killing. Phages have several potential advantages over conventional antibacterial agents such as host specificity, self-amplification, easy production, low toxicity as well as biofilm degradation. However, the narrow host range, uncharacterized properties, as well as potential risks from exponential replication and evolution of natural phages, currently limit their applications. Engineering phages can not only enhance the host bacteria range and improve phage efficacy, but also confer new functions. This review first summarizes major phage engineering techniques including both chemical modification and genetic engineering. Subsequent sections discuss the applications of engineered phages for bacterial pathogen detection and ablation through interdisciplinary approaches of synthetic biology and nanotechnology. We discuss future directions and persistent challenges in the ongoing exploration of phage engineering for pathogen control.
Collapse
Affiliation(s)
- Huan Peng
- Cellular
Signaling Laboratory, International Research Center for Sensory Biology
and Technology of MOST, Key Laboratory of Molecular Biophysics of
MOE, College of Life Science and Technology, Huazhong University of Science and Technology, 430074 Wuhan, Hubei China
| | - Irene A. Chen
- Department
of Chemical and Biomolecular Engineering, Department of Chemistry
and Biochemistry, University of California
Los Angeles, Los Angeles, California 90095-1592, United States
| | - Udi Qimron
- Department
of Clinical Microbiology and Immunology, School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel
| |
Collapse
|
5
|
Feng Y, Lu X, Zhao J, Li H, Xu J, Li Z, Wang M, Peng Y, Tian T, Yuan G, Zhang Y, Liu J, Zhang M, Zhu La ALT, Qu G, Mu Y, Guo W, Wu Y, Zhang Y, Wang D, Hu Y, Kan B. Regional antimicrobial resistance gene flow among the One Health sectors in China. MICROBIOME 2025; 13:3. [PMID: 39763003 PMCID: PMC11705761 DOI: 10.1186/s40168-024-01983-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/11/2024] [Accepted: 11/19/2024] [Indexed: 01/11/2025]
Abstract
BACKGROUND Antimicrobial resistance poses a significant threat to global health, with its spread intricately linked across human, animal, and environmental sectors. Revealing the antimicrobial resistance gene (ARG) flow among the One Health sectors is essential for better control of antimicrobial resistance. RESULTS In this study, we investigated regional ARG transmission among humans, food, and the environment in Dengfeng, Henan Province, China by combining large-scale metagenomic sequencing with culturing of resistant bacterial isolates in 592 samples. A total of 40 ARG types and 743 ARG subtypes were identified, with a predominance of multidrug resistance genes. Compared with microbes from human fecal samples, those from food and environmental samples showed a significantly higher load of ARGs. We revealed that dietary habits and occupational exposure significantly affect ARG abundance. Pseudomonadota, particularly Enterobacteriaceae, were identified as the main ARG carriers shaping the resistome. The resistome in food samples was found more affected by mobile genetic elements (MGEs), whereas in environmental samples, it was more associated with the microbial composition. We evidenced that horizontal gene transfer (HGT) mediated by plasmids and phages, together with strain transmission, particularly those associated with the Enterobacteriaceae members, drive regional ARG flow. Lifestyle, dietary habits, and occupational exposure are all correlated with ARG dissemination and flies and food are important potential sources of ARGs to humans. The widespread mobile carbapenemase gene, OXA-347, carried by non-Enterobacteriaceae bacteria in the human gut microbiota, requires particular attention. Finally, we showed that machine learning models based on microbiome profiles were effective in predicting the presence of carbapenem-resistant strains, suggesting a valuable approach for AMR surveillance. CONCLUSIONS Our study provides a full picture of regional ARG transmission among the One Health sectors in a county-level city in China, which facilitates a better understanding of the complex routes of ARG transmission and highlights new points of focus for AMR surveillance and control. Video Abstract.
Collapse
Affiliation(s)
- Yuqing Feng
- State Key Laboratory of Animal Nutrition and Feeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Xin Lu
- National Key Laboratory of Intelligent Tracking and Forecasting for Infectious Diseases, National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, 102206, China.
| | - Jiayong Zhao
- Institute of Infectious Disease Prevention and Control, Henan Center for Disease Control and Prevention, Zhengzhou, 450016, China
| | - Hongmin Li
- Dengfeng Center for Disease Control and Prevention, Dengfeng, Zhengzhou, 452470, China
| | - Jialiang Xu
- School of Light Industry Science and Engineering, Beijing Technology and Business University, Beijing, 100048, China
| | - Zhenpeng Li
- National Key Laboratory of Intelligent Tracking and Forecasting for Infectious Diseases, National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, 102206, China
| | - Mengyu Wang
- National Key Laboratory of Intelligent Tracking and Forecasting for Infectious Diseases, National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, 102206, China
- School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, 250012, Shandong, China
| | - Yao Peng
- National Key Laboratory of Intelligent Tracking and Forecasting for Infectious Diseases, National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, 102206, China
| | - Tian Tian
- Dengfeng Center for Disease Control and Prevention, Dengfeng, Zhengzhou, 452470, China
| | - Gailing Yuan
- Dengfeng Center for Disease Control and Prevention, Dengfeng, Zhengzhou, 452470, China
| | - Yuan Zhang
- National Key Laboratory of Intelligent Tracking and Forecasting for Infectious Diseases, National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, 102206, China
| | - Jiaqi Liu
- National Key Laboratory of Intelligent Tracking and Forecasting for Infectious Diseases, National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, 102206, China
- School of Light Industry Science and Engineering, Beijing Technology and Business University, Beijing, 100048, China
| | - Meihong Zhang
- State Key Laboratory of Animal Nutrition and Feeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - A La Teng Zhu La
- State Key Laboratory of Animal Nutrition and Feeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Geruo Qu
- National Key Laboratory of Intelligent Tracking and Forecasting for Infectious Diseases, National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, 102206, China
- School of Light Industry Science and Engineering, Beijing Technology and Business University, Beijing, 100048, China
| | - Yujiao Mu
- Institute of Infectious Disease Prevention and Control, Henan Center for Disease Control and Prevention, Zhengzhou, 450016, China
| | - Wanshen Guo
- Institute of Infectious Disease Prevention and Control, Henan Center for Disease Control and Prevention, Zhengzhou, 450016, China
| | - Yongning Wu
- NHC Key Laboratory of Food Safety Risk Assessment, China National Center for Food Safety Risk Assessment, Beijing, 100022, China
| | - Yuyu Zhang
- School of Light Industry Science and Engineering, Beijing Technology and Business University, Beijing, 100048, China.
| | - Dexiang Wang
- Dengfeng Center for Disease Control and Prevention, Dengfeng, Zhengzhou, 452470, China.
| | - Yongfei Hu
- State Key Laboratory of Animal Nutrition and Feeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China.
| | - Biao Kan
- National Key Laboratory of Intelligent Tracking and Forecasting for Infectious Diseases, National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, 102206, China.
- School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, 250012, Shandong, China.
| |
Collapse
|
6
|
Gonzales MEM, Ureta JC, Shrestha AMS. PHIStruct: improving phage-host interaction prediction at low sequence similarity settings using structure-aware protein embeddings. Bioinformatics 2024; 41:btaf016. [PMID: 39804673 PMCID: PMC11783280 DOI: 10.1093/bioinformatics/btaf016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2024] [Revised: 12/04/2024] [Accepted: 01/10/2025] [Indexed: 02/01/2025] Open
Abstract
MOTIVATION Recent computational approaches for predicting phage-host interaction have explored the use of sequence-only protein language models to produce embeddings of phage proteins without manual feature engineering. However, these embeddings do not directly capture protein structure information and structure-informed signals related to host specificity. RESULTS We present PHIStruct, a multilayer perceptron that takes in structure-aware embeddings of receptor-binding proteins, generated via the structure-aware protein language model SaProt, and then predicts the host from among the ESKAPEE genera. Compared against recent tools, PHIStruct exhibits the best balance of precision and recall, with the highest and most stable F1 score across a wide range of confidence thresholds and sequence similarity settings. The margin in performance is most pronounced when the sequence similarity between the training and test sets drops below 40%, wherein, at a relatively high-confidence threshold of above 50%, PHIStruct presents a 7%-9% increase in class-averaged F1 over machine learning tools that do not directly incorporate structure information, as well as a 5%-6% increase over BLASTp. AVAILABILITY AND IMPLEMENTATION The data and source code for our experiments and analyses are available at https://github.com/bioinfodlsu/PHIStruct.
Collapse
Affiliation(s)
- Mark Edward M Gonzales
- Bioinformatics Lab, Advanced Research Institute for Informatics, Computing and Networking, De La Salle University, Manila 1004, Philippines
- College of Computer Studies, De La Salle University, Manila 1004, Philippines
| | - Jennifer C Ureta
- Bioinformatics Lab, Advanced Research Institute for Informatics, Computing and Networking, De La Salle University, Manila 1004, Philippines
- College of Computer Studies, De La Salle University, Manila 1004, Philippines
- Walter and Eliza Hall Institute of Medical Research, Melbourne, VIC 3052, Australia
| | - Anish M S Shrestha
- Bioinformatics Lab, Advanced Research Institute for Informatics, Computing and Networking, De La Salle University, Manila 1004, Philippines
- College of Computer Studies, De La Salle University, Manila 1004, Philippines
| |
Collapse
|
7
|
Zhang J, Shang J, Liu B, Zhu D, Li Q, Yin L, Ohore OE, Wen S, Ding C, Zhang Y, Yue Z, Zou Y. Hot spots of resistance: Transit centers as breeding grounds for airborne ARG-carrying bacteriophages. JOURNAL OF HAZARDOUS MATERIALS 2024; 480:136165. [PMID: 39418908 DOI: 10.1016/j.jhazmat.2024.136165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/06/2024] [Revised: 09/15/2024] [Accepted: 10/12/2024] [Indexed: 10/19/2024]
Abstract
The presence of pathogenic bacteria and antibiotic resistance genes (ARGs) in urban air poses a significant threat to public health. While prevailing research predominantly focuses on the airborne transmission of ARGs by bacteria, the potential influence of other vectors, such as bacteriophages, is often overlooked. This study aims to investigate the characteristics of phages and ARGs in aerosols originating from hospitals, public transit centers, wastewater treatment plants, and landfill sites. The average abundance of ARGs carried by phages in the public transit centers was 8.81 ppm, which was 2 to 3 times higher than that at the other three sites. Additionally, the abundance of ARGs across different risk levels at this site was also significantly higher than at the other three sites. The assembled phage communities bearing ARGs in public transit centers are chiefly governed by homogeneous selection processes, likely influenced by human movement. Furthermore, observations at public transit sites revealed that the average abundance ratio of virulent phages to their hosts was 1.01, and the correlation coefficient between their auxiliary metabolic genes and hosts' metabolic genes was 0.59, which were 20 times and 3 times higher, respectively, than those of temperate phages. This suggests that virulent phages may enhance their survival by altering host metabolism, thereby aiding the dispersion of ARGs and bacterial resistance. These revelations furnish fresh insights into phage-mediated ARG transmission, offering scientific substantiation for strategies aimed at preventing and controlling resistance within aerosols.
Collapse
Affiliation(s)
- Jing Zhang
- NHC Key Laboratory of Tropical Disease Control, School of Tropical Medicine, Hainan Medical University, Haikou, Hainan 571199, China.
| | - Jiayu Shang
- Department of Information Engineering, The Chinese University of Hong Kong, Hong Kong (SAR), China
| | - Beibei Liu
- Hainan Key Laboratory of Tropical Eco-Circular Agriculture, Environmental and Plant Protection Institute, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China; Key Laboratory of Low-carbon Green Agriculture in Tropical Region of China, Ministry of Agriculture and Rural Affairs, Environmental and Plant Protection Institute, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China; Hainan Danzhou Tropical Agro-ecosystem National Observation and Research Station, Chinese Academy of Tropical Agricultural Sciences, Danzhou 571737, China
| | - Dong Zhu
- Key Laboratory of Urban Environment and Health, Ningbo Observation and Research Station, Institute of Urban Environment, Chinese Academy of Sciences, Xiamen 361021, China
| | - Qinfen Li
- Hainan Key Laboratory of Tropical Eco-Circular Agriculture, Environmental and Plant Protection Institute, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China; Key Laboratory of Low-carbon Green Agriculture in Tropical Region of China, Ministry of Agriculture and Rural Affairs, Environmental and Plant Protection Institute, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China; Hainan Danzhou Tropical Agro-ecosystem National Observation and Research Station, Chinese Academy of Tropical Agricultural Sciences, Danzhou 571737, China
| | - Li Yin
- NHC Key Laboratory of Tropical Disease Control, School of Tropical Medicine, Hainan Medical University, Haikou, Hainan 571199, China
| | - Okugbe Ebiotubo Ohore
- NHC Key Laboratory of Tropical Disease Control, School of Tropical Medicine, Hainan Medical University, Haikou, Hainan 571199, China
| | - Shaobai Wen
- NHC Key Laboratory of Tropical Disease Control, School of Tropical Medicine, Hainan Medical University, Haikou, Hainan 571199, China
| | - Changfeng Ding
- CAS Key Laboratory of Soil Environment and Pollution Remediation, Institute of Soil Science, Chinese Academy of Sciences, Nanjing 210008, China
| | - Yican Zhang
- College of Life Sciences, Jilin Agricultural University, Changchun 130118, China
| | - Zhengfu Yue
- Hainan Key Laboratory of Tropical Eco-Circular Agriculture, Environmental and Plant Protection Institute, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China; Key Laboratory of Low-carbon Green Agriculture in Tropical Region of China, Ministry of Agriculture and Rural Affairs, Environmental and Plant Protection Institute, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China; Hainan Danzhou Tropical Agro-ecosystem National Observation and Research Station, Chinese Academy of Tropical Agricultural Sciences, Danzhou 571737, China.
| | - Yukun Zou
- Hainan Key Laboratory of Tropical Eco-Circular Agriculture, Environmental and Plant Protection Institute, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China; Key Laboratory of Low-carbon Green Agriculture in Tropical Region of China, Ministry of Agriculture and Rural Affairs, Environmental and Plant Protection Institute, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China; Hainan Danzhou Tropical Agro-ecosystem National Observation and Research Station, Chinese Academy of Tropical Agricultural Sciences, Danzhou 571737, China
| |
Collapse
|
8
|
Wei A, Zhan H, Xiao Z, Zhao W, Jiang X. A novel framework for phage-host prediction via logical probability theory and network sparsification. Brief Bioinform 2024; 26:bbae708. [PMID: 39780485 PMCID: PMC11711101 DOI: 10.1093/bib/bbae708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Revised: 11/25/2024] [Accepted: 12/26/2024] [Indexed: 01/11/2025] Open
Abstract
Bacterial resistance has emerged as one of the greatest threats to human health, and phages have shown tremendous potential in addressing the issue of drug-resistant bacteria by lysing host. The identification of phage-host interactions (PHI) is crucial for addressing bacterial infections. Some existing computational methods for predicting PHI are suboptimal in terms of prediction efficiency due to the limited types of available information. Despite the emergence of some supporting information, the generalizability of models using this information is limited by the small scale of the databases. Additionally, most existing models overlook the sparsity of association data, which severely impacts their predictive performance as well. In this study, we propose a dual-view sparse network model (DSPHI) to predict PHI, which leverages logical probability theory and network sparsification. Specifically, we first constructed similarity networks using the sequences of phages and hosts respectively, and then sparsified these networks, enabling the model to focus more on key information during the learning process, thereby improving prediction efficiency. Next, we utilize logical probability theory to compute high-order logical information between phages (hosts), which is known as mutual information. Subsequently, we connect this information in node form to the sparse phage (host) similarity network, resulting in a phage (host) heterogeneous network that better integrates the two information views, thereby reducing the complexity of model computation and enhancing information aggregation capabilities. The hidden features of phages and hosts are explored through graph learning algorithms. Experimental results demonstrate that mutual information is effective information in predicting PHI, and the sparsification procedure of similarity networks significantly improves the model's predictive performance.
Collapse
Affiliation(s)
- Ankang Wei
- Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan 430079, China
- School of Computer Science, Central China Normal University, Wuhan 430079, China
- School of Mathematics and Statistics, Central China Normal University, Wuhan 430079, China
| | - Huanghan Zhan
- Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan 430079, China
- School of Computer Science, Central China Normal University, Wuhan 430079, China
| | - Zhen Xiao
- Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan 430079, China
- School of Computer Science, Central China Normal University, Wuhan 430079, China
- School of Mathematics and Statistics, Central China Normal University, Wuhan 430079, China
| | - Weizhong Zhao
- Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan 430079, China
- School of Computer Science, Central China Normal University, Wuhan 430079, China
- National Language Resources Monitoring & Research Center for Network Media, Central China Normal University, Wuhan 430079, China
| | - Xingpeng Jiang
- Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan 430079, China
- School of Computer Science, Central China Normal University, Wuhan 430079, China
- National Language Resources Monitoring & Research Center for Network Media, Central China Normal University, Wuhan 430079, China
| |
Collapse
|
9
|
Wei A, Xiao Z, Fu L, Zhao W, Jiang X. Predicting phage-host interactions via feature augmentation and regional graph convolution. Brief Bioinform 2024; 26:bbae672. [PMID: 39756070 PMCID: PMC11671694 DOI: 10.1093/bib/bbae672] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2024] [Revised: 11/05/2024] [Accepted: 12/14/2024] [Indexed: 12/28/2024] Open
Abstract
Identifying phage-host interactions (PHIs) is a crucial step in developing phage therapy, which is the promising solution to addressing the issue of antibiotic resistance in superbugs. However, the lifestyle of phages, which strongly depends on their host for life activities, limits their cultivability, making the study of predicting PHIs time-consuming and labor-intensive for traditional wet lab experiments. Although many deep learning (DL) approaches have been applied to PHIs prediction, most DL methods are predominantly based on sequence information, failing to comprehensively model the intricate relationships within PHIs. Moreover, most existing approaches are limited for sub-optimal performance, due to the potential risk of overfitting induced by the highly data sparsity in the task of PHIs prediction. In this study, we propose a novel approach called MI-RGC, which introduces mutual information for feature augmentation and employs regional graph convolution to learn meaningful representations. Specifically, MI-RGC treats the presence status of phages in environmental samples as random variables, and derives the mutual information between these random variables as the dependency relationships among phages. Consequently, a mutual information-based heterogeneous network is construted as feature augmentation for sequence information of phages, which is utilized for building a sequence information-based heterogeneous network. By considering the different contributions of neighboring nodes at varying distances, a regional graph convolutional model is designed, in which the neighboring nodes are segmented into different regions and a regional-level attention mechanism is employed to derive node embeddings. Finally, the embeddings learned from these two networks are aggregated through an attention mechanism, on which the prediction of PHIs is condcuted accordingly. Experimental results on three benchmark datasets demonstrate that MI-RGC derives superior performance over other methods on the task of PHIs prediction.
Collapse
Affiliation(s)
- Ankang Wei
- School of Mathematics and Statistics, Central China Normal University, Wuhan 430079, China
- Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan 430079, China
- School of Computer Science, Central China Normal University, Wuhan 430079, China
| | - Zhen Xiao
- School of Mathematics and Statistics, Central China Normal University, Wuhan 430079, China
- Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan 430079, China
- School of Computer Science, Central China Normal University, Wuhan 430079, China
| | - Lingling Fu
- School of Mathematics and Statistics, Central China Normal University, Wuhan 430079, China
- Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan 430079, China
- School of Computer Science, Central China Normal University, Wuhan 430079, China
| | - Weizhong Zhao
- Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan 430079, China
- School of Computer Science, Central China Normal University, Wuhan 430079, China
- National Language Resources Monitoring & Research Center for Network Media,Central China Normal University, Wuhan 430079, China
| | - Xingpeng Jiang
- Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan 430079, China
- School of Computer Science, Central China Normal University, Wuhan 430079, China
- National Language Resources Monitoring & Research Center for Network Media,Central China Normal University, Wuhan 430079, China
| |
Collapse
|
10
|
Liu F, Zhao Z, Liu Y. PHPGAT: predicting phage hosts based on multimodal heterogeneous knowledge graph with graph attention network. Brief Bioinform 2024; 26:bbaf017. [PMID: 39833104 PMCID: PMC11745545 DOI: 10.1093/bib/bbaf017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2024] [Revised: 12/18/2024] [Accepted: 01/07/2025] [Indexed: 01/22/2025] Open
Abstract
Antibiotic resistance poses a significant threat to global health, making the development of alternative strategies to combat bacterial pathogens increasingly urgent. One such promising approach is the strategic use of bacteriophages (or phages) to specifically target and eradicate antibiotic-resistant bacteria. Phages, being among the most prevalent life forms on Earth, play a critical role in maintaining ecological balance by regulating bacterial communities and driving genetic diversity. Accurate prediction of phage hosts is essential for successfully applying phage therapy. However, existing prediction models may not fully encapsulate the complex dynamics of phage-host interactions in diverse microbial environments, indicating a need for improved accuracy through more sophisticated modeling techniques. In response to this challenge, this study introduces a novel phage-host prediction model, PHPGAT, which leverages a multimodal heterogeneous knowledge graph with the advanced GATv2 (Graph Attention Network v2) framework. The model first constructs a multimodal heterogeneous knowledge graph by integrating phage-phage, host-host, and phage-host interactions to capture the intricate connections between biological entities. GATv2 is then employed to extract deep node features and learn dynamic interdependencies, generating context-aware embeddings. Finally, an inner product decoder is designed to compute the likelihood of interaction between a phage and host pair based on the embedding vectors produced by GATv2. Evaluation results using two datasets demonstrate that PHPGAT achieves precise phage host predictions and outperforms other models. PHPGAT is available at https://github.com/ZhaoZMer/PHPGAT.
Collapse
Affiliation(s)
- Fu Liu
- College of Communication Engineering, Jilin University, No. 2699 Qianjin Street, Chaoyang District, Changchun 130012, China
| | - Zhimiao Zhao
- School of Artificial Intelligence, Jilin University, No. 5988 Renmin Street, Nanguan District, Changchun 130022, China
| | - Yun Liu
- College of Communication Engineering, Jilin University, No. 2699 Qianjin Street, Chaoyang District, Changchun 130012, China
| |
Collapse
|
11
|
McDonnell B, Parlindungan E, Vasiliauskaite E, Bottacini F, Coughlan K, Krishnaswami LP, Sassen T, Lugli GA, Ventura M, Mastroleo F, Mahony J, van Sinderen D. Viromic and Metagenomic Analyses of Commercial Spirulina Fermentations Reveal Remarkable Microbial Diversity. Viruses 2024; 16:1039. [PMID: 39066202 PMCID: PMC11281685 DOI: 10.3390/v16071039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2024] [Revised: 06/18/2024] [Accepted: 06/25/2024] [Indexed: 07/28/2024] Open
Abstract
Commercially produced cyanobacteria preparations sold under the name spirulina are widely consumed, due to their traditional use as a nutrient-rich foodstuff and subsequent marketing as a superfood. Despite their popularity, the microbial composition of ponds used to cultivate these bacteria is understudied. A total of 19 pond samples were obtained from small-scale spirulina farms and subjected to metagenome and/or virome sequencing, and the results were analysed. A remarkable level of prokaryotic and viral diversity was found to be present in the ponds, with Limnospira sp. and Arthrospira sp. sometimes being notably scarce. A detailed breakdown of prokaryotic and viral components of 15 samples is presented. Twenty putative Limnospira sp.-infecting bacteriophage contigs were identified, though no correlation between the performance of these cultures and the presence of phages was found. The high diversity of these samples prevented the identification of clear trends in sample performance over time, between ponds or when comparing successful and failed fermentations.
Collapse
Affiliation(s)
- Brian McDonnell
- School of Microbiology, University College Cork, T12 Y337 Cork, Ireland; (B.M.); (E.V.); (K.C.); (L.P.K.); (J.M.)
- APC Microbiome Ireland, University College Cork, T12 YT20 Cork, Ireland;
| | - Elvina Parlindungan
- School of Microbiology, University College Cork, T12 Y337 Cork, Ireland; (B.M.); (E.V.); (K.C.); (L.P.K.); (J.M.)
- APC Microbiome Ireland, University College Cork, T12 YT20 Cork, Ireland;
| | - Erika Vasiliauskaite
- School of Microbiology, University College Cork, T12 Y337 Cork, Ireland; (B.M.); (E.V.); (K.C.); (L.P.K.); (J.M.)
- APC Microbiome Ireland, University College Cork, T12 YT20 Cork, Ireland;
| | - Francesca Bottacini
- APC Microbiome Ireland, University College Cork, T12 YT20 Cork, Ireland;
- Biological Sciences, Munster Technological University, Bishopstown, T12 P928 Cork, Ireland
| | - Keith Coughlan
- School of Microbiology, University College Cork, T12 Y337 Cork, Ireland; (B.M.); (E.V.); (K.C.); (L.P.K.); (J.M.)
- APC Microbiome Ireland, University College Cork, T12 YT20 Cork, Ireland;
| | - Lakshmi Priyadarshini Krishnaswami
- School of Microbiology, University College Cork, T12 Y337 Cork, Ireland; (B.M.); (E.V.); (K.C.); (L.P.K.); (J.M.)
- APC Microbiome Ireland, University College Cork, T12 YT20 Cork, Ireland;
| | - Tom Sassen
- School of Microbiology, University College Cork, T12 Y337 Cork, Ireland; (B.M.); (E.V.); (K.C.); (L.P.K.); (J.M.)
- APC Microbiome Ireland, University College Cork, T12 YT20 Cork, Ireland;
- Microbiology Unit, Nuclear Medical Applications, Belgian Nuclear Research Centre, SCK CEN, 2400 Mol, Belgium;
| | - Gabriele Andrea Lugli
- Laboratory of Probiogenomics, Department of Chemistry, Life Sciences, and Environmental Sustainability, University of Parma, 43124 Parma, Italy; (G.A.L.); (M.V.)
- Interdepartmental Research Centre “Microbiome Research Hub”, University of Parma, 43124 Parma, Italy
| | - Marco Ventura
- Laboratory of Probiogenomics, Department of Chemistry, Life Sciences, and Environmental Sustainability, University of Parma, 43124 Parma, Italy; (G.A.L.); (M.V.)
- Interdepartmental Research Centre “Microbiome Research Hub”, University of Parma, 43124 Parma, Italy
| | - Felice Mastroleo
- Microbiology Unit, Nuclear Medical Applications, Belgian Nuclear Research Centre, SCK CEN, 2400 Mol, Belgium;
| | - Jennifer Mahony
- School of Microbiology, University College Cork, T12 Y337 Cork, Ireland; (B.M.); (E.V.); (K.C.); (L.P.K.); (J.M.)
- APC Microbiome Ireland, University College Cork, T12 YT20 Cork, Ireland;
| | - Douwe van Sinderen
- School of Microbiology, University College Cork, T12 Y337 Cork, Ireland; (B.M.); (E.V.); (K.C.); (L.P.K.); (J.M.)
- APC Microbiome Ireland, University College Cork, T12 YT20 Cork, Ireland;
| |
Collapse
|
12
|
Xie K, Lin B, Sun X, Zhu P, Liu C, Liu G, Cao X, Pan J, Qiu S, Yuan X, Liang M, Jiang J, Yuan L. Identification and classification of the genomes of novel microviruses in poultry slaughterhouse. Front Microbiol 2024; 15:1393153. [PMID: 38756731 PMCID: PMC11096546 DOI: 10.3389/fmicb.2024.1393153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Accepted: 04/11/2024] [Indexed: 05/18/2024] Open
Abstract
Microviridae is a family of phages with circular ssDNA genomes and they are widely found in various environments and organisms. In this study, virome techniques were employed to explore potential members of Microviridae in a poultry slaughterhouse, leading to the identification of 98 novel and complete microvirus genomes. Using a similarity clustering network classification approach, these viruses were found to belong to at least 6 new subfamilies within Microviridae and 3 higher-level taxonomic units. Genome size, GC content and genome structure of these new taxa showed evident regularities, validating the rationality of our classification method. Our method can divide microviruses into about 45 additional detailed clusters, which may serve as a new standard for classifying Microviridae members. Furthermore, by addressing the scarcity of host information for microviruses, the current study significantly broadened their host range and discovered over 20 possible new hosts, including important pathogenic bacteria such as Helicobacter pylori and Vibrio cholerae, as well as different taxa demonstrated different host specificities. The findings of this study effectively expand the diversity of the Microviridae family, providing new insights for their classification and identification. Additionally, it offers a novel perspective for monitoring and controlling pathogenic microorganisms in poultry slaughterhouse environments.
Collapse
Affiliation(s)
- Keming Xie
- School of Life Sciences and Biopharmaceutics, Guangdong Pharmaceutical University, Guangzhou, Guangdong, China
- Key Laboratory of South China Sea Fishery Resources Exploitation and Utilization, Ministry of Agriculture and Rural Affairs, South China Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Guangzhou, Guangdong, China
| | - Benfu Lin
- Huadu District Animal Health Supervision Institution, Guangzhou, Guangdong, China
| | - Xinyu Sun
- School of Life Sciences and Biopharmaceutics, Guangdong Pharmaceutical University, Guangzhou, Guangdong, China
| | - Peng Zhu
- Key Laboratory of South China Sea Fishery Resources Exploitation and Utilization, Ministry of Agriculture and Rural Affairs, South China Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Guangzhou, Guangdong, China
- College of Marine Ecology and Environment, Shanghai Ocean University, Shanghai, China
| | - Chang Liu
- Key Laboratory of South China Sea Fishery Resources Exploitation and Utilization, Ministry of Agriculture and Rural Affairs, South China Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Guangzhou, Guangdong, China
| | - Guangfeng Liu
- Key Laboratory of South China Sea Fishery Resources Exploitation and Utilization, Ministry of Agriculture and Rural Affairs, South China Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Guangzhou, Guangdong, China
| | - Xudong Cao
- Department of Chemical and Biological Engineering, University of Ottawa, Ottawa, ON, Canada
| | - Jingqi Pan
- School of Life Sciences and Biopharmaceutics, Guangdong Pharmaceutical University, Guangzhou, Guangdong, China
| | - Suiping Qiu
- School of Life Sciences and Biopharmaceutics, Guangdong Pharmaceutical University, Guangzhou, Guangdong, China
| | - Xiaoqi Yuan
- School of Life Sciences and Biopharmaceutics, Guangdong Pharmaceutical University, Guangzhou, Guangdong, China
| | - Mengshi Liang
- School of Life Sciences and Biopharmaceutics, Guangdong Pharmaceutical University, Guangzhou, Guangdong, China
| | - Jingzhe Jiang
- School of Life Sciences and Biopharmaceutics, Guangdong Pharmaceutical University, Guangzhou, Guangdong, China
- Key Laboratory of South China Sea Fishery Resources Exploitation and Utilization, Ministry of Agriculture and Rural Affairs, South China Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Guangzhou, Guangdong, China
| | - Lihong Yuan
- School of Life Sciences and Biopharmaceutics, Guangdong Pharmaceutical University, Guangzhou, Guangdong, China
| |
Collapse
|
13
|
Wu Y, Gao N, Sun C, Feng T, Liu Q, Chen WH. A compendium of ruminant gastrointestinal phage genomes revealed a higher proportion of lytic phages than in any other environments. MICROBIOME 2024; 12:69. [PMID: 38576042 PMCID: PMC10993611 DOI: 10.1186/s40168-024-01784-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Accepted: 02/29/2024] [Indexed: 04/06/2024]
Abstract
BACKGROUND Ruminants are important livestock animals that have a unique digestive system comprising multiple stomach compartments. Despite significant progress in the study of microbiome in the gastrointestinal tract (GIT) sites of ruminants, we still lack an understanding of the viral community of ruminants. Here, we surveyed its viral ecology using 2333 samples from 10 sites along the GIT of 8 ruminant species. RESULTS We present the Unified Ruminant Phage Catalogue (URPC), a comprehensive survey of phages in the GITs of ruminants including 64,922 non-redundant phage genomes. We characterized the distributions of the phage genomes in different ruminants and GIT sites and found that most phages were organism-specific. We revealed that ~ 60% of the ruminant phages were lytic, which was the highest as compared with those in all other environments and certainly will facilitate their applications in microbial interventions. To further facilitate the future applications of the phages, we also constructed a comprehensive virus-bacteria/archaea interaction network and identified dozens of phages that may have lytic effects on methanogenic archaea. CONCLUSIONS The URPC dataset represents a useful resource for future microbial interventions to improve ruminant production and ecological environmental qualities. Phages have great potential for controlling pathogenic bacterial/archaeal species and reducing methane emissions. Our findings provide insights into the virome ecology research of the ruminant GIT and offer a starting point for future research on phage therapy in ruminants. Video Abstract.
Collapse
Affiliation(s)
- Yingjian Wu
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-Imaging, Center for Artificial Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China
| | - Na Gao
- Department of Laboratory Medicine, Zhongnan Hospital of Wuhan University, Wuhan University, Wuhan, 430071, China
| | - Chuqing Sun
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-Imaging, Center for Artificial Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China
| | - Tong Feng
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-Imaging, Center for Artificial Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China
| | - Qingyou Liu
- Guangdong Provincial Key Laboratory of Animal Molecular Design and Precise Breeding, School of Life Science and Engineering, Foshan University, Foshan, 528225, China.
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning, 530005, China.
| | - Wei-Hua Chen
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-Imaging, Center for Artificial Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China.
- Institution of Medical Artificial Intelligence, Binzhou Medical University, Yantai, 264003, China.
| |
Collapse
|
14
|
Liu X, Liu Y, Liu J, Zhang H, Shan C, Guo Y, Gong X, Cui M, Li X, Tang M. Correlation between the gut microbiome and neurodegenerative diseases: a review of metagenomics evidence. Neural Regen Res 2024; 19:833-845. [PMID: 37843219 PMCID: PMC10664138 DOI: 10.4103/1673-5374.382223] [Citation(s) in RCA: 16] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Revised: 04/19/2023] [Accepted: 06/17/2023] [Indexed: 10/17/2023] Open
Abstract
A growing body of evidence suggests that the gut microbiota contributes to the development of neurodegenerative diseases via the microbiota-gut-brain axis. As a contributing factor, microbiota dysbiosis always occurs in pathological changes of neurodegenerative diseases, such as Alzheimer's disease, Parkinson's disease, and amyotrophic lateral sclerosis. High-throughput sequencing technology has helped to reveal that the bidirectional communication between the central nervous system and the enteric nervous system is facilitated by the microbiota's diverse microorganisms, and for both neuroimmune and neuroendocrine systems. Here, we summarize the bioinformatics analysis and wet-biology validation for the gut metagenomics in neurodegenerative diseases, with an emphasis on multi-omics studies and the gut virome. The pathogen-associated signaling biomarkers for identifying brain disorders and potential therapeutic targets are also elucidated. Finally, we discuss the role of diet, prebiotics, probiotics, postbiotics and exercise interventions in remodeling the microbiome and reducing the symptoms of neurodegenerative diseases.
Collapse
Affiliation(s)
- Xiaoyan Liu
- School of Life Sciences, Jiangsu University, Zhenjiang, Jiangsu Province, China
| | - Yi Liu
- School of Life Sciences, Jiangsu University, Zhenjiang, Jiangsu Province, China
- Institute of Animal Husbandry, Jiangsu Academy of Agricultural Sciences, Nanjing, Jiangsu Province, China
| | - Junlin Liu
- School of Life Sciences, Jiangsu University, Zhenjiang, Jiangsu Province, China
| | - Hantao Zhang
- School of Life Sciences, Jiangsu University, Zhenjiang, Jiangsu Province, China
| | - Chaofan Shan
- School of Life Sciences, Jiangsu University, Zhenjiang, Jiangsu Province, China
| | - Yinglu Guo
- School of Life Sciences, Jiangsu University, Zhenjiang, Jiangsu Province, China
| | - Xun Gong
- Department of Rheumatology & Immunology, Affiliated Hospital of Jiangsu University, Zhenjiang, Jiangsu Province, China
| | - Mengmeng Cui
- Department of Neurology, The Second Affiliated Hospital of Shandong First Medical University, Taian, Shandong Province, China
| | - Xiubin Li
- Department of Neurology, The Second Affiliated Hospital of Shandong First Medical University, Taian, Shandong Province, China
| | - Min Tang
- School of Life Sciences, Jiangsu University, Zhenjiang, Jiangsu Province, China
| |
Collapse
|
15
|
Nie W, Qiu T, Wei Y, Ding H, Guo Z, Qiu J. Advances in phage-host interaction prediction: in silico method enhances the development of phage therapies. Brief Bioinform 2024; 25:bbae117. [PMID: 38555471 PMCID: PMC10981677 DOI: 10.1093/bib/bbae117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2023] [Revised: 01/15/2024] [Accepted: 03/02/2024] [Indexed: 04/02/2024] Open
Abstract
Phages can specifically recognize and kill bacteria, which lead to important application value of bacteriophage in bacterial identification and typing, livestock aquaculture and treatment of human bacterial infection. Considering the variety of human-infected bacteria and the continuous discovery of numerous pathogenic bacteria, screening suitable therapeutic phages that are capable of infecting pathogens from massive phage databases has been a principal step in phage therapy design. Experimental methods to identify phage-host interaction (PHI) are time-consuming and expensive; high-throughput computational method to predict PHI is therefore a potential substitute. Here, we systemically review bioinformatic methods for predicting PHI, introduce reference databases and in silico models applied in these methods and highlight the strengths and challenges of current tools. Finally, we discuss the application scope and future research direction of computational prediction methods, which contribute to the performance improvement of prediction models and the development of personalized phage therapy.
Collapse
Affiliation(s)
- Wanchun Nie
- School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai, 200093, China
| | - Tianyi Qiu
- Institute of Clinical Science, Zhongshan Hospital; Intelligent Medicine Institute, Fudan University, Shanghai, 200032, China
- Shanghai Institute of Infectious Disease and Biosecurity, Fudan University, Shanghai, 200032, China
| | - Yiwen Wei
- School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai, 200093, China
| | - Hao Ding
- Institute of Clinical Science, Zhongshan Hospital; Intelligent Medicine Institute, Fudan University, Shanghai, 200032, China
| | - Zhixiang Guo
- School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai, 200093, China
| | - Jingxuan Qiu
- School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai, 200093, China
| |
Collapse
|
16
|
Mahony J. Biological and bioinformatic tools for the discovery of unknown phage-host combinations. Curr Opin Microbiol 2024; 77:102426. [PMID: 38246125 DOI: 10.1016/j.mib.2024.102426] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 12/21/2023] [Accepted: 01/02/2024] [Indexed: 01/23/2024]
Abstract
The field of microbial ecology has been transformed by metagenomics in recent decades and has culminated in vast datasets that facilitate the bioinformatic dissection of complex microbial communities. Recently, attention has turned from defining the microbiota composition to the interactions and relationships that occur between members of the microbiota. Within complex microbiota, the identification of bacteriophage-host combinations has been a major challenge. Recent developments in artificial intelligence tools to predict protein structure and function as well as the relationships between bacteria and their infecting bacteriophages allow a strategic approach to identifying and validating phage-host relationships. However, biological validation of these predictions remains essential and will serve to improve the existing predictive tools. In this review, I provide an overview of the most recent developments in both bioinformatic and experimental approaches to predicting and experimentally validating unknown phage-host combinations.
Collapse
Affiliation(s)
- Jennifer Mahony
- School of Microbiology & APC Microbiome Ireland, University College Cork, Western Road, T12 YT20 Cork, Ireland.
| |
Collapse
|
17
|
Liu GY, Yu D, Fan MM, Zhang X, Jin ZY, Tang C, Liu XF. Antimicrobial resistance crisis: could artificial intelligence be the solution? Mil Med Res 2024; 11:7. [PMID: 38254241 PMCID: PMC10804841 DOI: 10.1186/s40779-024-00510-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 01/08/2024] [Indexed: 01/24/2024] Open
Abstract
Antimicrobial resistance is a global public health threat, and the World Health Organization (WHO) has announced a priority list of the most threatening pathogens against which novel antibiotics need to be developed. The discovery and introduction of novel antibiotics are time-consuming and expensive. According to WHO's report of antibacterial agents in clinical development, only 18 novel antibiotics have been approved since 2014. Therefore, novel antibiotics are critically needed. Artificial intelligence (AI) has been rapidly applied to drug development since its recent technical breakthrough and has dramatically improved the efficiency of the discovery of novel antibiotics. Here, we first summarized recently marketed novel antibiotics, and antibiotic candidates in clinical development. In addition, we systematically reviewed the involvement of AI in antibacterial drug development and utilization, including small molecules, antimicrobial peptides, phage therapy, essential oils, as well as resistance mechanism prediction, and antibiotic stewardship.
Collapse
Affiliation(s)
- Guang-Yu Liu
- Department of Immunology and Pathogen Biology, School of Basic Medical Sciences, Hangzhou Normal University, Key Laboratory of Aging and Cancer Biology of Zhejiang Province, Key Laboratory of Inflammation and Immunoregulation of Hangzhou, Hangzhou Normal University, Hangzhou, 311121, China
| | - Dan Yu
- National Key Discipline of Pediatrics Key Laboratory of Major Diseases in Children Ministry of Education, Laboratory of Dermatology, Beijing Pediatric Research Institute, Beijing Children's Hospital, Capital Medical University, National Center for Children's Health, Beijing, 100045, China
| | - Mei-Mei Fan
- Department of Immunology and Pathogen Biology, School of Basic Medical Sciences, Hangzhou Normal University, Key Laboratory of Aging and Cancer Biology of Zhejiang Province, Key Laboratory of Inflammation and Immunoregulation of Hangzhou, Hangzhou Normal University, Hangzhou, 311121, China
| | - Xu Zhang
- Robert and Arlene Kogod Center on Aging, Mayo Clinic, Rochester, MN, 55905, USA
- Department of Biochemistry and Molecular Biology, Mayo Clinic, Rochester, MN, 55905, USA
| | - Ze-Yu Jin
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Christoph Tang
- Sir William Dunn School of Pathology, University of Oxford, Oxford, OX1 3RE, UK.
| | - Xiao-Fen Liu
- Institute of Antibiotics, Huashan Hospital, Fudan University, Key Laboratory of Clinical Pharmacology of Antibiotics, National Health Commission of the People's Republic of China, National Clinical Research Centre for Aging and Medicine, Huashan Hospital, Fudan University, Shanghai, 200040, China.
| |
Collapse
|
18
|
Zhou Y, Wang Y, Prangishvili D, Krupovic M. Exploring the Archaeal Virosphere by Metagenomics. Methods Mol Biol 2024; 2732:1-22. [PMID: 38060114 DOI: 10.1007/978-1-0716-3515-5_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/08/2023]
Abstract
During the past decade, environmental research has demonstrated that archaea are abundant and widespread in nature and play important ecological roles at a global scale. Currently, however, the majority of archaeal lineages cannot be cultivated under laboratory conditions and are known exclusively or nearly exclusively through metagenomics. A similar trend extends to the archaeal virosphere, where isolated representatives are available for a handful of model archaeal virus-host systems. Viral metagenomics provides an alternative way to circumvent the limitations of culture-based virus discovery and offers insight into the diversity, distribution, and environmental impact of uncultured archaeal viruses. Presently, metagenomics approaches have been successfully applied to explore the viromes associated with various lineages of extremophilic and mesophilic archaea, including Asgard archaea (Asgardarchaeota), ANME-1 archaea (Methanophagales), thaumarchaea (Nitrososphaeria), altiarchaea (Altiarchaeota), and marine group II archaea (Poseidoniales). Here, we provide an overview of methods widely used in archaeal virus metagenomics, covering metavirome preparation, genome annotation, phylogenetic and phylogenomic analyses, and archaeal host assignment. We hope that this summary will contribute to further exploration and characterization of the enigmatic archaeal virome lurking in diverse environments.
Collapse
Affiliation(s)
- Yifan Zhou
- Institut Pasteur, Université Paris Cité, Archaeal Virology Unit, Paris, France
- Sorbonne Université, Collège Doctoral, Paris, France
| | - Yongjie Wang
- College of Food Science and Technology, Shanghai Ocean University, Shanghai, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
- Laboratory of Quality and Safety Risk Assessment for Aquatic Products on Storage and Preservation (Shanghai), Ministry of Agriculture, Shanghai, China
| | - David Prangishvili
- Institut Pasteur, Université Paris Cité, Archaeal Virology Unit, Paris, France
- Ivane Javakhishvili Tbilisi State University, Tbilisi, Georgia
| | - Mart Krupovic
- Institut Pasteur, Université Paris Cité, Archaeal Virology Unit, Paris, France.
| |
Collapse
|
19
|
Howell AA, Versoza CJ, Pfeifer SP. Computational host range prediction-The good, the bad, and the ugly. Virus Evol 2023; 10:vead083. [PMID: 38361822 PMCID: PMC10868548 DOI: 10.1093/ve/vead083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Revised: 12/05/2023] [Accepted: 12/19/2023] [Indexed: 02/17/2024] Open
Abstract
The rapid emergence and spread of antimicrobial resistance across the globe have prompted the usage of bacteriophages (i.e. viruses that infect bacteria) in a variety of applications ranging from agriculture to biotechnology and medicine. In order to effectively guide the application of bacteriophages in these multifaceted areas, information about their host ranges-that is the bacterial strains or species that a bacteriophage can successfully infect and kill-is essential. Utilizing sixteen broad-spectrum (polyvalent) bacteriophages with experimentally validated host ranges, we here benchmark the performance of eleven recently developed computational host range prediction tools that provide a promising and highly scalable supplement to traditional, but laborious, experimental procedures. We show that machine- and deep-learning approaches offer the highest levels of accuracy and precision-however, their predominant predictions at the species- or genus-level render them ill-suited for applications outside of an ecosystems metagenomics framework. In contrast, only moderate sensitivity (<80 per cent) could be reached at the strain-level, albeit at low levels of precision (<40 per cent). Taken together, these limitations demonstrate that there remains room for improvement in the active scientific field of in silico host prediction to combat the challenge of guiding experimental designs to identify the most promising bacteriophage candidates for any given application.
Collapse
Affiliation(s)
| | - Cyril J Versoza
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ 85281, USA
| | - Susanne P Pfeifer
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ 85281, USA
| |
Collapse
|
20
|
Du ZH, Zhong JP, Liu Y, Li JQ. Prokaryotic virus host prediction with graph contrastive augmentaion. PLoS Comput Biol 2023; 19:e1011671. [PMID: 38039280 PMCID: PMC10691718 DOI: 10.1371/journal.pcbi.1011671] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Accepted: 11/07/2023] [Indexed: 12/03/2023] Open
Abstract
Prokaryotic viruses, also known as bacteriophages, play crucial roles in regulating microbial communities and have the potential for phage therapy applications. Accurate prediction of phage-host interactions is essential for understanding the dynamics of these viruses and their impacts on bacterial populations. Numerous computational methods have been developed to tackle this challenging task. However, most existing prediction models can be constrained due to the substantial number of unknown interactions in comparison to the constrained diversity of available training data. To solve the problem, we introduce a model for prokaryotic virus host prediction with graph contrastive augmentation (PHPGCA). Specifically, we construct a comprehensive heterogeneous graph by integrating virus-virus protein similarity and virus-host DNA sequence similarity information. As the backbone encoder for learning node representations in the virus-prokaryote graph, we employ LGCN, a state-of-the-art graph embedding technique. Additionally, we apply graph contrastive learning to augment the node representations without the need for additional labels. We further conducted two case studies aimed at predicting the host range of multi-species phages, helping to understand the phage ecology and evolution.
Collapse
Affiliation(s)
- Zhi-Hua Du
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, Guang-dong, China
| | - Jun-Peng Zhong
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, Guang-dong, China
| | - Yun Liu
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, Guang-dong, China
| | - Jian-Qiang Li
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, Guang-dong, China
| |
Collapse
|
21
|
Cai H, Zhou Y, Li X, Xu T, Ni Y, Wu S, Yu Y, Wang Y. Genomic Analysis and Taxonomic Characterization of Seven Bacteriophage Genomes Metagenomic-Assembled from the Dishui Lake. Viruses 2023; 15:2038. [PMID: 37896815 PMCID: PMC10611076 DOI: 10.3390/v15102038] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 09/27/2023] [Accepted: 09/29/2023] [Indexed: 10/29/2023] Open
Abstract
Viruses in aquatic ecosystems exhibit remarkable abundance and diversity. However, scattered studies have been conducted to mine uncultured viruses and identify them taxonomically in lake water. Here, whole genomes (29-173 kbp) of seven uncultured dsDNA bacteriophages were discovered in Dishui Lake, the largest artificial lake in Shanghai. We analyzed their genomic signatures and found a series of viral auxiliary metabolic genes closely associated with protein synthesis and host metabolism. Dishui Lake phages shared more genes with uncultivated environmental viruses than with reference viruses based on the gene-sharing network classification. Phylogeny of proteomes and comparative genomics delineated three new genera within two known viral families of Kyanoviridae and Autographiviridae, and four new families in Caudoviricetes for these seven novel phages. Their potential hosts appeared to be from the dominant bacterial phyla in Dishui Lake. Altogether, our study provides initial insights into the composition and diversity of bacteriophage communities in Dishui Lake, contributing valuable knowledge to the ongoing research on the roles played by viruses in freshwater ecosystems.
Collapse
Affiliation(s)
- Haoyun Cai
- College of Food Science and Technology, Shanghai Ocean University, Shanghai 201306, China; (H.C.); (Y.Z.); (X.L.); (T.X.); (Y.N.); (S.W.); (Y.Y.)
| | - Yifan Zhou
- College of Food Science and Technology, Shanghai Ocean University, Shanghai 201306, China; (H.C.); (Y.Z.); (X.L.); (T.X.); (Y.N.); (S.W.); (Y.Y.)
| | - Xiefei Li
- College of Food Science and Technology, Shanghai Ocean University, Shanghai 201306, China; (H.C.); (Y.Z.); (X.L.); (T.X.); (Y.N.); (S.W.); (Y.Y.)
| | - Tianqi Xu
- College of Food Science and Technology, Shanghai Ocean University, Shanghai 201306, China; (H.C.); (Y.Z.); (X.L.); (T.X.); (Y.N.); (S.W.); (Y.Y.)
| | - Yimin Ni
- College of Food Science and Technology, Shanghai Ocean University, Shanghai 201306, China; (H.C.); (Y.Z.); (X.L.); (T.X.); (Y.N.); (S.W.); (Y.Y.)
| | - Shuang Wu
- College of Food Science and Technology, Shanghai Ocean University, Shanghai 201306, China; (H.C.); (Y.Z.); (X.L.); (T.X.); (Y.N.); (S.W.); (Y.Y.)
| | - Yongxin Yu
- College of Food Science and Technology, Shanghai Ocean University, Shanghai 201306, China; (H.C.); (Y.Z.); (X.L.); (T.X.); (Y.N.); (S.W.); (Y.Y.)
| | - Yongjie Wang
- College of Food Science and Technology, Shanghai Ocean University, Shanghai 201306, China; (H.C.); (Y.Z.); (X.L.); (T.X.); (Y.N.); (S.W.); (Y.Y.)
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao 266000, China
- Laboratory of Quality and Safety Risk Assessment for Aquatic Products on Storage and Preservation, Ministry of Agriculture and Rural Affairs, Shanghai 201306, China
| |
Collapse
|
22
|
Shang J, Peng C, Liao H, Tang X, Sun Y. PhaBOX: a web server for identifying and characterizing phage contigs in metagenomic data. BIOINFORMATICS ADVANCES 2023; 3:vbad101. [PMID: 37641717 PMCID: PMC10460485 DOI: 10.1093/bioadv/vbad101] [Citation(s) in RCA: 34] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Revised: 07/05/2023] [Accepted: 08/01/2023] [Indexed: 08/31/2023]
Abstract
Motivation There is accumulating evidence showing the important roles of bacteriophages (phages) in regulating the structure and functions of the microbiome. However, lacking an easy-to-use and integrated phage analysis software hampers microbiome-related research from incorporating phages in the analysis. Results In this work, we developed a web server, PhaBOX, which can comprehensively identify and analyze phage contigs in metagenomic data. It supports integrated phage analysis, including phage contig identification from the metagenomic assembly, lifestyle prediction, taxonomic classification, and host prediction. Instead of treating the algorithms as a black box, PhaBOX also supports visualization of the essential features for making predictions. The web server is designed with a user-friendly graphical interface that enables both informatics-trained and nonspecialist users to analyze phages in microbiome data with ease. Availability and implementation The web server of PhaBOX is available via: https://phage.ee.cityu.edu.hk. The source code of PhaBOX is available at: https://github.com/KennthShang/PhaBOX.
Collapse
Affiliation(s)
- Jiayu Shang
- Department of Electrical Engineering, City University of Hong Kong, Hong Kong (SAR), China
| | - Cheng Peng
- Department of Electrical Engineering, City University of Hong Kong, Hong Kong (SAR), China
| | - Herui Liao
- Department of Electrical Engineering, City University of Hong Kong, Hong Kong (SAR), China
| | - Xubo Tang
- Department of Electrical Engineering, City University of Hong Kong, Hong Kong (SAR), China
| | - Yanni Sun
- Department of Electrical Engineering, City University of Hong Kong, Hong Kong (SAR), China
| |
Collapse
|
23
|
Gonzales MEM, Ureta JC, Shrestha AMS. Protein embeddings improve phage-host interaction prediction. PLoS One 2023; 18:e0289030. [PMID: 37486915 PMCID: PMC10365317 DOI: 10.1371/journal.pone.0289030] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 07/07/2023] [Indexed: 07/26/2023] Open
Abstract
With the growing interest in using phages to combat antimicrobial resistance, computational methods for predicting phage-host interactions have been explored to help shortlist candidate phages. Most existing models consider entire proteomes and rely on manual feature engineering, which poses difficulty in selecting the most informative sequence properties to serve as input to the model. In this paper, we framed phage-host interaction prediction as a multiclass classification problem that takes as input the embeddings of a phage's receptor-binding proteins, which are known to be the key machinery for host recognition, and predicts the host genus. We explored different protein language models to automatically encode these protein sequences into dense embeddings without the need for additional alignment or structural information. We show that the use of embeddings of receptor-binding proteins presents improvements over handcrafted genomic and protein sequence features. The highest performance was obtained using the transformer-based protein language model ProtT5, resulting in a 3% to 4% increase in weighted F1 and recall scores across different prediction confidence thresholds, compared to using selected handcrafted sequence features.
Collapse
Affiliation(s)
- Mark Edward M Gonzales
- Bioinformatics Laboratory, Advanced Research Institute for Informatics, Computing and Networking, De La Salle University, Manila, Philippines
- Department of Software Technology, College of Computer Studies, De La Salle University, Manila, Philippines
| | - Jennifer C Ureta
- Bioinformatics Laboratory, Advanced Research Institute for Informatics, Computing and Networking, De La Salle University, Manila, Philippines
- Department of Software Technology, College of Computer Studies, De La Salle University, Manila, Philippines
| | - Anish M S Shrestha
- Bioinformatics Laboratory, Advanced Research Institute for Informatics, Computing and Networking, De La Salle University, Manila, Philippines
- Systems and Computational Biology Research Unit, Center for Natural Sciences and Environmental Research, De La Salle University, Manila, Philippines
- Department of Software Technology, College of Computer Studies, De La Salle University, Manila, Philippines
| |
Collapse
|
24
|
Shang J, Peng C, Tang X, Sun Y. PhaVIP: Phage VIrion Protein classification based on chaos game representation and Vision Transformer. Bioinformatics 2023; 39:i30-i39. [PMID: 37387136 DOI: 10.1093/bioinformatics/btad229] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/01/2023] Open
Abstract
MOTIVATION As viruses that mainly infect bacteria, phages are key players across a wide range of ecosystems. Analyzing phage proteins is indispensable for understanding phages' functions and roles in microbiomes. High-throughput sequencing enables us to obtain phages in different microbiomes with low cost. However, compared to the fast accumulation of newly identified phages, phage protein classification remains difficult. In particular, a fundamental need is to annotate virion proteins, the structural proteins, such as major tail, baseplate, etc. Although there are experimental methods for virion protein identification, they are too expensive or time-consuming, leaving a large number of proteins unclassified. Thus, there is a great demand to develop a computational method for fast and accurate phage virion protein (PVP) classification. RESULTS In this work, we adapted the state-of-the-art image classification model, Vision Transformer, to conduct virion protein classification. By encoding protein sequences into unique images using chaos game representation, we can leverage Vision Transformer to learn both local and global features from sequence "images". Our method, PhaVIP, has two main functions: classifying PVP and non-PVP sequences and annotating the types of PVP, such as capsid and tail. We tested PhaVIP on several datasets with increasing difficulty and benchmarked it against alternative tools. The experimental results show that PhaVIP has superior performance. After validating the performance of PhaVIP, we investigated two applications that can use the output of PhaVIP: phage taxonomy classification and phage host prediction. The results showed the benefit of using classified proteins over all proteins. AVAILABILITY AND IMPLEMENTATION The web server of PhaVIP is available via: https://phage.ee.cityu.edu.hk/phavip. The source code of PhaVIP is available via: https://github.com/KennthShang/PhaVIP.
Collapse
Affiliation(s)
- Jiayu Shang
- Department of Electrical Engineering, City University of Hong Kong, Hong Kong (SAR), China
| | - Cheng Peng
- Department of Electrical Engineering, City University of Hong Kong, Hong Kong (SAR), China
| | - Xubo Tang
- Department of Electrical Engineering, City University of Hong Kong, Hong Kong (SAR), China
| | - Yanni Sun
- Department of Electrical Engineering, City University of Hong Kong, Hong Kong (SAR), China
| |
Collapse
|
25
|
Yuan X, Huang Z, Zhu Z, Zhang J, Wu Q, Xue L, Wang J, Ding Y. Recent advances in phage defense systems and potential overcoming strategies. Biotechnol Adv 2023; 65:108152. [PMID: 37037289 DOI: 10.1016/j.biotechadv.2023.108152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 04/05/2023] [Accepted: 04/06/2023] [Indexed: 04/12/2023]
Abstract
Bacteriophages are effective in the prevention and control of bacteria, and many phage products have been permitted and applied in the field. Because bacteriophages are expected to replace other antimicrobial agents like antibiotics, the antibacterial effect of bacteriophage has attracted widespread attention. Recently, the diversified defense systems discovered in the target host have become potential threats to the continued effective application of phages. Therefore, a systematic summary and in-depth illustration of the interaction between phages and bacteria is conducive to the development of this biological control approach. In this review, we introduce different defense systems in bacteria against phages and emphasize newly discovered defense mechanisms in recent years. Additionally, we draw attention to the striking resemblance between defense system genes and antibiotic resistance genes, which raises concerns about the potential transfer of phage defense systems within bacterial populations and its future impact on phage efficacy. Thus, attention should be given to the effects of phage defense genes in practical applications. This article is not exhaustive, but strategies to overcome phage defense systems are also discussed to further promote more efficient use of phages.
Collapse
Affiliation(s)
- Xiaoming Yuan
- State Key Laboratory of Applied Microbiology Southern China, Key Laboratory of Agricultural Microbiomics and Precision Application, Ministry of Agriculture and Rural Affairs, Guangdong Provincial Key Laboratory of Microbial Safety and Health, Institute of Microbiology, Guangdong Academy of Sciences, Guangzhou 510070, China; Department of Food Science & Engineering, Jinan University, Guangzhou 510632, China
| | - Zhichao Huang
- State Key Laboratory of Applied Microbiology Southern China, Key Laboratory of Agricultural Microbiomics and Precision Application, Ministry of Agriculture and Rural Affairs, Guangdong Provincial Key Laboratory of Microbial Safety and Health, Institute of Microbiology, Guangdong Academy of Sciences, Guangzhou 510070, China; Department of Food Science & Engineering, Jinan University, Guangzhou 510632, China
| | - Zhenjun Zhu
- Department of Food Science & Engineering, Jinan University, Guangzhou 510632, China
| | - Jumei Zhang
- State Key Laboratory of Applied Microbiology Southern China, Key Laboratory of Agricultural Microbiomics and Precision Application, Ministry of Agriculture and Rural Affairs, Guangdong Provincial Key Laboratory of Microbial Safety and Health, Institute of Microbiology, Guangdong Academy of Sciences, Guangzhou 510070, China
| | - Qingping Wu
- State Key Laboratory of Applied Microbiology Southern China, Key Laboratory of Agricultural Microbiomics and Precision Application, Ministry of Agriculture and Rural Affairs, Guangdong Provincial Key Laboratory of Microbial Safety and Health, Institute of Microbiology, Guangdong Academy of Sciences, Guangzhou 510070, China
| | - Liang Xue
- State Key Laboratory of Applied Microbiology Southern China, Key Laboratory of Agricultural Microbiomics and Precision Application, Ministry of Agriculture and Rural Affairs, Guangdong Provincial Key Laboratory of Microbial Safety and Health, Institute of Microbiology, Guangdong Academy of Sciences, Guangzhou 510070, China
| | - Juan Wang
- State Key Laboratory of Applied Microbiology Southern China, Key Laboratory of Agricultural Microbiomics and Precision Application, Ministry of Agriculture and Rural Affairs, Guangdong Provincial Key Laboratory of Microbial Safety and Health, Institute of Microbiology, Guangdong Academy of Sciences, Guangzhou 510070, China; College of Food Science, South China Agricultural University, Guangzhou 510432, China.
| | - Yu Ding
- Department of Food Science & Engineering, Jinan University, Guangzhou 510632, China.
| |
Collapse
|
26
|
Bajiya N, Dhall A, Aggarwal S, Raghava GPS. Advances in the field of phage-based therapy with special emphasis on computational resources. Brief Bioinform 2023; 24:6961791. [PMID: 36575815 DOI: 10.1093/bib/bbac574] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Revised: 11/07/2022] [Accepted: 11/25/2022] [Indexed: 12/29/2022] Open
Abstract
In the current era, one of the major challenges is to manage the treatment of drug/antibiotic-resistant strains of bacteria. Phage therapy, a century-old technique, may serve as an alternative to antibiotics in treating bacterial infections caused by drug-resistant strains of bacteria. In this review, a systematic attempt has been made to summarize phage-based therapy in depth. This review has been divided into the following two sections: general information and computer-aided phage therapy (CAPT). In the case of general information, we cover the history of phage therapy, the mechanism of action, the status of phage-based products (approved and clinical trials) and the challenges. This review emphasizes CAPT, where we have covered primary phage-associated resources, phage prediction methods and pipelines. This review covers a wide range of databases and resources, including viral genomes and proteins, phage receptors, host genomes of phages, phage-host interactions and lytic proteins. In the post-genomic era, identifying the most suitable phage for lysing a drug-resistant strain of bacterium is crucial for developing alternate treatments for drug-resistant bacteria and this remains a challenging problem. Thus, we compile all phage-associated prediction methods that include the prediction of phages for a bacterial strain, the host for a phage and the identification of interacting phage-host pairs. Most of these methods have been developed using machine learning and deep learning techniques. This review also discussed recent advances in the field of CAPT, where we briefly describe computational tools available for predicting phage virions, the life cycle of phages and prophage identification. Finally, we describe phage-based therapy's advantages, challenges and opportunities.
Collapse
Affiliation(s)
- Nisha Bajiya
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi, 110020, India
| | - Anjali Dhall
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi, 110020, India
| | - Suchet Aggarwal
- Department of Computer Science and Engineering, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi, 110020, India
| | - Gajendra P S Raghava
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi, 110020, India
| |
Collapse
|
27
|
Shang J, Tang X, Sun Y. PhaTYP: predicting the lifestyle for bacteriophages using BERT. Brief Bioinform 2023; 24:bbac487. [PMID: 36659812 PMCID: PMC9851330 DOI: 10.1093/bib/bbac487] [Citation(s) in RCA: 87] [Impact Index Per Article: 43.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 10/05/2022] [Accepted: 10/15/2022] [Indexed: 11/24/2022] Open
Abstract
Bacteriophages (or phages), which infect bacteria, have two distinct lifestyles: virulent and temperate. Predicting the lifestyle of phages helps decipher their interactions with their bacterial hosts, aiding phages' applications in fields such as phage therapy. Because experimental methods for annotating the lifestyle of phages cannot keep pace with the fast accumulation of sequenced phages, computational method for predicting phages' lifestyles has become an attractive alternative. Despite some promising results, computational lifestyle prediction remains difficult because of the limited known annotations and the sheer amount of sequenced phage contigs assembled from metagenomic data. In particular, most of the existing tools cannot precisely predict phages' lifestyles for short contigs. In this work, we develop PhaTYP (Phage TYPe prediction tool) to improve the accuracy of lifestyle prediction on short contigs. We design two different training tasks, self-supervised and fine-tuning tasks, to overcome lifestyle prediction difficulties. We rigorously tested and compared PhaTYP with four state-of-the-art methods: DeePhage, PHACTS, PhagePred and BACPHLIP. The experimental results show that PhaTYP outperforms all these methods and achieves more stable performance on short contigs. In addition, we demonstrated the utility of PhaTYP for analyzing the phage lifestyle on human neonates' gut data. This application shows that PhaTYP is a useful means for studying phages in metagenomic data and helps extend our understanding of microbial communities.
Collapse
Affiliation(s)
- Jiayu Shang
- Department of Electrical Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong, China SAR
| | - Xubo Tang
- Department of Electrical Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong, China SAR
| | - Yanni Sun
- Department of Electrical Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong, China SAR
| |
Collapse
|
28
|
Li P, Luo H, Ji B, Nielsen J. Machine learning for data integration in human gut microbiome. Microb Cell Fact 2022; 21:241. [PMID: 36419034 PMCID: PMC9685977 DOI: 10.1186/s12934-022-01973-4] [Citation(s) in RCA: 41] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2022] [Accepted: 11/15/2022] [Indexed: 11/25/2022] Open
Abstract
Recent studies have demonstrated that gut microbiota plays critical roles in various human diseases. High-throughput technology has been widely applied to characterize the microbial ecosystems, which led to an explosion of different types of molecular profiling data, such as metagenomics, metatranscriptomics and metabolomics. For analysis of such data, machine learning algorithms have shown to be useful for identifying key molecular signatures, discovering potential patient stratifications, and particularly for generating models that can accurately predict phenotypes. In this review, we first discuss how dysbiosis of the intestinal microbiota is linked to human disease development and how potential modulation strategies of the gut microbial ecosystem can be used for disease treatment. In addition, we introduce categories and workflows of different machine learning approaches, and how they can be used to perform integrative analysis of multi-omics data. Finally, we review advances of machine learning in gut microbiome applications and discuss related challenges. Based on this we conclude that machine learning is very well suited for analysis of gut microbiome and that these approaches can be useful for development of gut microbe-targeted therapies, which ultimately can help in achieving personalized and precision medicine.
Collapse
Affiliation(s)
- Peishun Li
- grid.5371.00000 0001 0775 6028Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden
| | - Hao Luo
- grid.5371.00000 0001 0775 6028Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden
| | - Boyang Ji
- grid.5371.00000 0001 0775 6028Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden ,grid.510909.4BioInnovation Institute, Ole Maaløes Vej 3, DK2200 Copenhagen, Denmark
| | - Jens Nielsen
- grid.5371.00000 0001 0775 6028Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden ,grid.510909.4BioInnovation Institute, Ole Maaløes Vej 3, DK2200 Copenhagen, Denmark
| |
Collapse
|
29
|
Shang J, Tang X, Guo R, Sun Y. Accurate identification of bacteriophages from metagenomic data using Transformer. Brief Bioinform 2022; 23:6620872. [PMID: 35769000 PMCID: PMC9294416 DOI: 10.1093/bib/bbac258] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Revised: 05/22/2022] [Accepted: 06/04/2022] [Indexed: 11/20/2022] Open
Abstract
Motivation Bacteriophages are viruses infecting bacteria. Being key players in microbial communities, they can regulate the composition/function of microbiome by infecting their bacterial hosts and mediating gene transfer. Recently, metagenomic sequencing, which can sequence all genetic materials from various microbiome, has become a popular means for new phage discovery. However, accurate and comprehensive detection of phages from the metagenomic data remains difficult. High diversity/abundance, and limited reference genomes pose major challenges for recruiting phage fragments from metagenomic data. Existing alignment-based or learning-based models have either low recall or precision on metagenomic data. Results In this work, we adopt the state-of-the-art language model, Transformer, to conduct contextual embedding for phage contigs. By constructing a protein-cluster vocabulary, we can feed both the protein composition and the proteins’ positions from each contig into the Transformer. The Transformer can learn the protein organization and associations using the self-attention mechanism and predicts the label for test contigs. We rigorously tested our developed tool named PhaMer on multiple datasets with increasing difficulty, including quality RefSeq genomes, short contigs, simulated metagenomic data, mock metagenomic data and the public IMG/VR dataset. All the experimental results show that PhaMer outperforms the state-of-the-art tools. In the real metagenomic data experiment, PhaMer improves the F1-score of phage detection by 27%.
Collapse
Affiliation(s)
- Jiayu Shang
- Department of Electrical Engineering, City University of Hong Kong, Hong Kong (SAR), China
| | - Xubo Tang
- Department of Electrical Engineering, City University of Hong Kong, Hong Kong (SAR), China
| | - Ruocheng Guo
- School of Data Science, City University of Hong Kong, Hong Kong (SAR), China
| | - Yanni Sun
- Department of Electrical Engineering, City University of Hong Kong, Hong Kong (SAR), China
| |
Collapse
|
30
|
Abstract
Motivation Phage–host associations play important roles in microbial communities. But in natural communities, as opposed to culture-based lab studies where phages are discovered and characterized metagenomically, their hosts are generally not known. Several programs have been developed for predicting which phage infects which host based on various sequence similarity measures or machine learning approaches. These are often based on whole viral and host genomes, but in metagenomics-based studies, we rarely have whole genomes but rather must rely on contigs that are sometimes as short as hundreds of bp long. Therefore, we need programs that predict hosts of phage contigs on the basis of these short contigs. Although most existing programs can be applied to metagenomic datasets for these predictions, their accuracies are generally low. Here, we develop ContigNet, a convolutional neural network-based model capable of predicting phage–host matches based on relatively short contigs, and compare it to previously published VirHostMatcher (VHM) and WIsH. Results On the validation set, ContigNet achieves 72–85% area under the receiver operating characteristic curve (AUROC) scores, compared to the maximum of 68% by VHM or WIsH for contigs of lengths between 200 bps to 50 kbps. We also apply the model to the Metagenomic Gut Virus (MGV) catalogue, a dataset containing a wide range of draft genomes from metagenomic samples and achieve 60–70% AUROC scores compared to that of VHM and WIsH of 52%. Surprisingly, ContigNet can also be used to predict plasmid-host contig associations with high accuracy, indicating a similar genetic exchange between mobile genetic elements and their hosts. Availability and implementation The source code of ContigNet and related datasets can be downloaded from https://github.com/tianqitang1/ContigNet.
Collapse
Affiliation(s)
- Tianqi Tang
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Shengwei Hou
- Department of Ocean Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China
- Marine and Environmental Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
| | - Jed A Fuhrman
- Marine and Environmental Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
| | - Fengzhu Sun
- To whom correspondence should be addressed. E-mail:
| |
Collapse
|