1
|
Xu C, Zhao LY, Ye CS, Xu KC, Xu KY. The application of machine learning in clinical microbiology and infectious diseases. Front Cell Infect Microbiol 2025; 15:1545646. [PMID: 40375898 PMCID: PMC12078339 DOI: 10.3389/fcimb.2025.1545646] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2024] [Accepted: 04/08/2025] [Indexed: 05/18/2025] Open
Abstract
With the development of artificial intelligence(AI) in computer science and statistics, it has been further applied to the medical field. These applications include the management of infectious diseases, in which machine learning has created inroads in clinical microbiology, radiology, genomics, and the analysis of electronic health record data. Especially, the role of machine learning in microbiology has gradually become prominent, and it is used in etiological diagnosis, prediction of antibiotic resistance, association between human microbiome characteristics and complex host diseases, prognosis judgment, and prevention and control of infectious diseases. Machine learning in the field of microbiology mainly adopts supervised learning and unsupervised learning, involving algorithms from classification and regression to clustering and dimensionality reduction. This Review explains crucial concepts in machine learning for unfamiliar readers, describes machine learning's current applications in clinical microbiology and infectious diseases, and summarizes important approaches clinicians must be aware of when evaluating research using machine learning.
Collapse
Affiliation(s)
- Cheng Xu
- Clinical Laboratory of Chun’an First People’s Hospital, Zhejiang Provincial People’s Hospital Chun’an Branch, Hangzhou Medical College Affiliated Chun’an Hospital, Hangzhou, Zhejiang, China
| | - Ling-Yun Zhao
- Department of Medicine & Therapeutics, Prince of Wales Hospital, The Chinese University of Hong Kong, Hong Kong, Hong Kong SAR, China
| | - Cun-Si Ye
- Department of Clinical Laboratory Medicine, Institution of Microbiology and Infectious Diseases, The First Affiliated Hospital, Hengyang Medical School, University of South China, Hengyang, Hunan, China
| | - Ke-Chen Xu
- School of Psychology, Zhejiang Normal University, Jinhua, China
- Key Laboratory of Intelligent Education Technology and Application of Zhejiang Province, Zhejiang Normal University, Jinhua, China
| | - Ke-Yang Xu
- Faculty of Chinese Medicine, and State Key Laboratory of Quality Research in Chinese Medicine, Macau University of Science and Technology, Macao SAR, China
| |
Collapse
|
2
|
Feng R, Ma L, Zhao J, Li X, Dong S, Wang Y, Lyu Y, Wang N, Kou B, Wang Y, Mu Y, Pan Y, Ma D. Mechanisms of efficient polyacrylamide degradation: From multi-omics analysis to structural characterization of two amidohydrolases. Int J Biol Macromol 2024; 281:136329. [PMID: 39383923 DOI: 10.1016/j.ijbiomac.2024.136329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2024] [Revised: 09/26/2024] [Accepted: 10/03/2024] [Indexed: 10/11/2024]
Abstract
Polyacrylamide (PAM) is a high molecular weight polymer with extensive applications. However, inefficient natural degradation of PAM results in its environmental accumulation. Here, using multi-omics analysis, we constructed the PAM biodegradation pathway in Klebsiella sp. PCX, an efficient PAM-degrading bacterium. Subsequently, two unclassified amidohydrolases (PCX00451 and PCX04581) were identified as key factors for rapid PAM biodegradation, both of which possessed much higher hydrolysis efficiency for PAM than for small molecule amide compounds. Besides, crystal structures of PCX00451 and PCX04581 were solved. Both two amidohydrolases were consisted with a twisted triosephosphateisomerase (TIM)-barrel and a smaller β-sandwich domain. And their binding pockets were in the conserved metal center of TIM-barrel domain. Moreover, Asp267 of PCX00451 and Asp282 of PCX04581 were examined as active sites for acid/base catalysis. Our research characterized the molecular mechanisms of two efficient amidohydrolases, providing theoretical basis and valuable tools for PAM bioremediation.
Collapse
Affiliation(s)
- Rui Feng
- College of Life Sciences, Hebei Basic Science Center for Biotic Interaction, Hebei University, Baoding, Hebei 071002, China
| | - Lili Ma
- College of Chemistry and Chemical Engineering, Research Institute of Industrial Hazardous Waste Disposal and Resource Utilization, Southwest Petroleum University, Chengdu, Sichuan 610500, China
| | - Juyi Zhao
- College of Life Sciences, Hebei Basic Science Center for Biotic Interaction, Hebei University, Baoding, Hebei 071002, China
| | - Xiaochen Li
- College of Life Sciences, Hebei Basic Science Center for Biotic Interaction, Hebei University, Baoding, Hebei 071002, China
| | - Sijun Dong
- College of Life Sciences, Hebei Basic Science Center for Biotic Interaction, Hebei University, Baoding, Hebei 071002, China
| | - Yingying Wang
- Key Laboratory of Pollution Processes and Environmental Criteria (Ministry of Education), Tianjin Key Laboratory of Environmental Remediation and Pollution Control, College of Environmental Science and Engineering, Nankai International Advanced Research Institute (Shenzhen Futian), Nankai University, Tianjin 300071, China
| | - Yang Lyu
- College of Life Sciences, Hebei Basic Science Center for Biotic Interaction, Hebei University, Baoding, Hebei 071002, China
| | - Ning Wang
- College of Life Sciences, Hebei Basic Science Center for Biotic Interaction, Hebei University, Baoding, Hebei 071002, China
| | - Boxiang Kou
- College of Life Sciences, Hebei Basic Science Center for Biotic Interaction, Hebei University, Baoding, Hebei 071002, China
| | - Yujia Wang
- College of Life Sciences, Hebei Basic Science Center for Biotic Interaction, Hebei University, Baoding, Hebei 071002, China
| | - Yao Mu
- College of Life Sciences, Hebei Basic Science Center for Biotic Interaction, Hebei University, Baoding, Hebei 071002, China
| | - Ying Pan
- College of Life Sciences, Hebei Basic Science Center for Biotic Interaction, Hebei University, Baoding, Hebei 071002, China
| | - Dan Ma
- College of Life Sciences, Hebei Basic Science Center for Biotic Interaction, Hebei University, Baoding, Hebei 071002, China.
| |
Collapse
|
3
|
Feng R, Zhao J, Li X, Dong S, Ma D. Structural and Mechanistic Insights into a Novel Monooxygenase for Poly(acrylic acid) Biodegradation. Int J Mol Sci 2024; 25:8871. [PMID: 39201558 PMCID: PMC11354265 DOI: 10.3390/ijms25168871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2024] [Revised: 08/09/2024] [Accepted: 08/11/2024] [Indexed: 09/02/2024] Open
Abstract
Polyacrylamide (PAM) is a high-molecular-weight polymer with extensive applications. However, the inefficient natural degradation of PAM results in environmental accumulation of the polymer. Biodegradation is an environmentally friendly approach in the field of PAM treatment. The first phase of PAM biodegradation is the deamination of PAM, forming the product poly(acrylic acid) (PAA). The second phase of PAM biodegradation involves the cleavage of PAA into small molecules, which is a crucial step in the degradation pathway of PAM. However, the enzyme that catalyzes the degradation of PAA and the molecular mechanism remain unclear. Here, a novel monooxygenase PCX02514 is identified as the key enzyme for PAA degradation. Through biochemical experiments, the monooxygenase PCX02514 oxidizes PAA with the participation of NADPH, causing the cleavage of carbon chains and a decrease in the molecular weight of PAA. In addition, the crystal structure of the monooxygenase PCX02514 is solved at a resolution of 1.97 Å. The active pocket is in a long cavity that extends from the C-terminus of the TIM barrel to the protein surface and exhibits positive electrostatic potential, thereby causing the migration of oxygen-negative ions into the active pocket and facilitating the reaction between the substrates and monooxygenase PCX02514. Moreover, Arg10-Arg125-Ser186-Arg187-His253 are proposed as potential active sites in monooxygenase PCX02514. Our research characterizes the molecular mechanism of this monooxygenase, providing a theoretical basis and valuable tools for PAM bioremediation.
Collapse
Affiliation(s)
| | | | | | | | - Dan Ma
- College of Life Sciences, Hebei Basic Science Center for Biotic Interaction, Hebei University, Baoding 071002, China; (R.F.); (J.Z.); (X.L.); (S.D.)
| |
Collapse
|
4
|
Amani K, Shivnauth V, Castroverde CDM. CBP60-DB: An AlphaFold-predicted plant kingdom-wide database of the CALMODULIN-BINDING PROTEIN 60 protein family with a novel structural clustering algorithm. PLANT DIRECT 2023; 7:e509. [PMID: 37435612 PMCID: PMC10331130 DOI: 10.1002/pld3.509] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Revised: 04/17/2023] [Accepted: 05/23/2023] [Indexed: 07/13/2023]
Abstract
Molecular genetic analyses in the model species Arabidopsis thaliana have demonstrated the major roles of different CALMODULIN-BINDING PROTEIN 60 (CBP60) proteins in growth, stress signaling, and immune responses. Prominently, CBP60g and SARD1 are paralogous CBP60 transcription factors that regulate numerous components of the immune system, such as cell surface and intracellular immune receptors, MAP kinases, WRKY transcription factors, and biosynthetic enzymes for immunity-activating metabolites salicylic acid (SA) and N-hydroxypipecolic acid (NHP). However, their function, regulation, and diversification in most species remain unclear. Here, we have created CBP60-DB (https://cbp60db.wlu.ca/), a structural and bioinformatic database that comprehensively characterized 1052 CBP60 gene homologs (encoding 2376 unique transcripts and 1996 unique proteins) across 62 phylogenetically diverse genomes in the plant kingdom. We have employed deep learning-predicted structural analyses using AlphaFold2 and then generated dedicated web pages for all plant CBP60 proteins. Importantly, we have generated a novel clustering visualization algorithm to interrogate kingdom-wide structural similarities for more efficient inference of conserved functions across various plant taxa. Because well-characterized CBP60 proteins in Arabidopsis are known to be transcription factors with putative calmodulin-binding domains, we have integrated external bioinformatic resources to analyze protein domains and motifs. Collectively, we present a plant kingdom-wide identification of this important protein family in a user-friendly AlphaFold-anchored database, representing a novel and significant resource for the broader plant biology community.
Collapse
Affiliation(s)
- Keaun Amani
- Department of BiologyWilfrid Laurier UniversityWaterlooOntarioCanada
| | - Vanessa Shivnauth
- Department of BiologyWilfrid Laurier UniversityWaterlooOntarioCanada
| | | |
Collapse
|
5
|
Sykes J, Holland BR, Charleston MA. A review of visualisations of protein fold networks and their relationship with sequence and function. Biol Rev Camb Philos Soc 2023; 98:243-262. [PMID: 36210328 PMCID: PMC10092621 DOI: 10.1111/brv.12905] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 09/08/2022] [Accepted: 09/09/2022] [Indexed: 01/12/2023]
Abstract
Proteins form arguably the most significant link between genotype and phenotype. Understanding the relationship between protein sequence and structure, and applying this knowledge to predict function, is difficult. One way to investigate these relationships is by considering the space of protein folds and how one might move from fold to fold through similarity, or potential evolutionary relationships. The many individual characterisations of fold space presented in the literature can tell us a lot about how well the current Protein Data Bank represents protein fold space, how convergence and divergence may affect protein evolution, how proteins affect the whole of which they are part, and how proteins themselves function. A synthesis of these different approaches and viewpoints seems the most likely way to further our knowledge of protein structure evolution and thus, facilitate improved protein structure design and prediction.
Collapse
Affiliation(s)
- Janan Sykes
- School of Natural Sciences, University of Tasmania, Private Bag 37, Hobart, Tasmania, 7001, Australia
| | - Barbara R Holland
- School of Natural Sciences, University of Tasmania, Private Bag 37, Hobart, Tasmania, 7001, Australia
| | - Michael A Charleston
- School of Natural Sciences, University of Tasmania, Private Bag 37, Hobart, Tasmania, 7001, Australia
| |
Collapse
|
6
|
Singh K, Baird M, Fischer R, Chaitankar V, Seifuddin F, Chen YC, Tunc I, Waterman CM, Pirooznia M. Misregulation of ELK1, AP1, and E12 Transcription Factor Networks Is Associated with Melanoma Progression. Cancers (Basel) 2020; 12:E458. [PMID: 32079144 PMCID: PMC7072154 DOI: 10.3390/cancers12020458] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2020] [Revised: 02/10/2020] [Accepted: 02/12/2020] [Indexed: 01/17/2023] Open
Abstract
Melanoma is among the most malignant cutaneous cancers and when metastasized results in dramatically high mortality. Despite advances in high-throughput gene expression profiling in cancer transcriptomic studies, our understanding of mechanisms driving melanoma progression is still limited. We present here an in-depth bioinformatic analysis of the melanoma RNAseq, chromatin immunoprecipitation (ChIP)seq, and single-cell (sc)RNA seq data to understand cancer progression. Specifically, we have performed a consensus network analysis of RNA-seq data from clinically re-grouped melanoma samples to identify gene co-expression networks that are conserved in early (stage 1) and late (stage 4/invasive) stage melanoma. Overlaying the fold-change information on co-expression networks revealed several coordinately up or down-regulated subnetworks that may play a critical role in melanoma progression. Furthermore, by incorporating histone lysine-27 acetylation information and highly expressed genes identified from the single-cell RNA data from melanoma patient samples, we present a comprehensive list of pathways, putative protein-protein interactions (PPIs) and transcription factor (TF) networks that are driving cancer progression. From this analysis, we have identified Elk1, AP1 and E12 TF networks that coordinately change expression in late melanoma when compared to early melanoma, implicating these TFs in melanoma progression. Additionally, the sumoylation-associated interactome is upregulated in invasive melanoma. Together, this bioinformatic analysis potentially implicates a combination of TF networks and PPIs in melanoma progression, which if confirmed in the experimental systems, could be used as targets for drug intervention in melanoma.
Collapse
Affiliation(s)
- Komudi Singh
- Bioinformatics and Computational Biology Laboratory, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA; (K.S.); (V.C.); (F.S.); (Y.-C.C.); (I.T.)
| | - Michelle Baird
- Cell and Developmental Biology Center, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA; (M.B.); (R.F.); (C.M.W.)
| | - Robert Fischer
- Cell and Developmental Biology Center, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA; (M.B.); (R.F.); (C.M.W.)
| | - Vijender Chaitankar
- Bioinformatics and Computational Biology Laboratory, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA; (K.S.); (V.C.); (F.S.); (Y.-C.C.); (I.T.)
| | - Fayaz Seifuddin
- Bioinformatics and Computational Biology Laboratory, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA; (K.S.); (V.C.); (F.S.); (Y.-C.C.); (I.T.)
| | - Yun-Ching Chen
- Bioinformatics and Computational Biology Laboratory, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA; (K.S.); (V.C.); (F.S.); (Y.-C.C.); (I.T.)
| | - Ilker Tunc
- Bioinformatics and Computational Biology Laboratory, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA; (K.S.); (V.C.); (F.S.); (Y.-C.C.); (I.T.)
| | - Clare M. Waterman
- Cell and Developmental Biology Center, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA; (M.B.); (R.F.); (C.M.W.)
| | - Mehdi Pirooznia
- Bioinformatics and Computational Biology Laboratory, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA; (K.S.); (V.C.); (F.S.); (Y.-C.C.); (I.T.)
| |
Collapse
|
7
|
Hu GM, Secario MK, Chen CM. SeQuery: an interactive graph database for visualizing the GPCR superfamily. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2020; 2019:5522636. [PMID: 31236561 PMCID: PMC6591535 DOI: 10.1093/database/baz073] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/13/2019] [Revised: 04/28/2019] [Accepted: 05/16/2019] [Indexed: 01/01/2023]
Abstract
The rate at which new protein and gene sequences are being discovered has grown explosively in the omics era, which has increasingly complicated the efficient characterization and analysis of their biological properties. In this study, we propose a web-based graphical database tool, SeQuery, for intuitively visualizing proteome/genome networks by integrating the sequential, structural and functional information of sequences. As a demonstration of our tool’s effectiveness, we constructed a graph database of G protein-coupled receptor (GPCR) sequences by integrating data from the UniProt, GPCRdb and RCSB PDB databases. Our tool attempts to achieve two goals: (i) given the sequence of a query protein, correctly and efficiently identify whether the protein is a GPCR, and, if so, define its sequential and functional roles in the GPCR superfamily; and (ii) present a panoramic view of the GPCR superfamily and its network centralities that allows users to explore the superfamily at various resolutions. Such a bottom-up-to-top-down view can provide the users with a comprehensive understanding of the GPCR superfamily through interactive navigation of the graph database. A test of SeQuery with the GPCR2841 dataset shows that it correctly identifies 99 out of 100 queried protein sequences. The developed tool is readily applicable to other biological networks, and we aim to expand SeQuery by including additional biological databases in the near future.
Collapse
Affiliation(s)
- Geng-Ming Hu
- Department of Physics, National Taiwan Normal University, 88 Sec. 4 Ting-Chou Rd., Taipei 11677, Taiwan
| | - M K Secario
- Department of Physics, National Taiwan Normal University, 88 Sec. 4 Ting-Chou Rd., Taipei 11677, Taiwan.,Department of Applied Chemistry, National Chiao Tung University, 1001 Ta Hsueh Rd., Hsinchu 300, Taiwan
| | - Chi-Ming Chen
- Department of Physics, National Taiwan Normal University, 88 Sec. 4 Ting-Chou Rd., Taipei 11677, Taiwan
| |
Collapse
|
8
|
Gimenes NC, Silveira E, Tambourgi EB. An Overview of Proteases: Production, Downstream Processes and Industrial Applications. SEPARATION & PURIFICATION REVIEWS 2019. [DOI: 10.1080/15422119.2019.1677249] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Affiliation(s)
| | - Edgar Silveira
- Biotechnology Institute, Federal University of Uberlandia, Uberlandia, Minas Gerais, Brazil
- Brazilian Savanna’s, Diversity Research Center, Federal University of Uberlandia, Uberlandia, Minas Gerais, Brazil
| | | |
Collapse
|
9
|
Chen W, Li W, Huang G, Flavel M. The Applications of Clustering Methods in Predicting Protein Functions. CURR PROTEOMICS 2019. [DOI: 10.2174/1570164616666181212114612] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Background:
The understanding of protein function is essential to the study of biological
processes. However, the prediction of protein function has been a difficult task for bioinformatics to
overcome. This has resulted in many scholars focusing on the development of computational methods
to address this problem.
Objective:
In this review, we introduce the recently developed computational methods of protein function
prediction and assess the validity of these methods. We then introduce the applications of clustering
methods in predicting protein functions.
Collapse
Affiliation(s)
- Weiyang Chen
- College of Information, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China
| | - Weiwei Li
- College of Information, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China
| | - Guohua Huang
- College of Information Engineering, Shaoyang University, Shaoyang, Hunan 422000, China
| | - Matthew Flavel
- School of Life Sciences, La Trobe University, Bundoora, Vic 3083, Australia
| |
Collapse
|
10
|
Unsupervised cluster analyses of character networks in fiction: Community structure and centrality. Knowl Based Syst 2019. [DOI: 10.1016/j.knosys.2018.10.005] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
|
11
|
Sankar MG, Roy S, Tran TTN, Wittstein K, Bauer JO, Strohmann C, Ziegler S, Kumar K. Scaffold Diversity Synthesis Delivers Complex, Structurally, and Functionally Distinct Tetracyclic Benzopyrones. ChemistryOpen 2018; 7:302-309. [PMID: 29721402 PMCID: PMC5917230 DOI: 10.1002/open.201800025] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2018] [Indexed: 12/19/2022] Open
Abstract
Complexity-generating chemical transformations that afford novel molecular scaffolds enriched in sp3 character are highly desired. Here, we present a highly stereoselective scaffold diversity synthesis approach that utilizes cascade double-annulation reactions of diverse pairs of zwitterionic and non-zwitterionic partners with 3-formylchromones to generate highly complex tetracyclic benzopyrones. Each pair of annulation partners adds to the common chroman-4-one scaffold to build two new rings, supporting up to four contiguous chiral centers that include an all-carbon quaternary center. Differently ring-fused benzopyrones display different biological activities, thus demonstrating their immense potential in medicinal chemistry and chemical biology research.
Collapse
Affiliation(s)
- Muthukumar G. Sankar
- Department of Chemical BiologyMax Planck Institute of Molecular PhysiologyOtto-Hahn Str. 1144227DortmundGermany
| | - Sayantani Roy
- Department of Chemical BiologyMax Planck Institute of Molecular PhysiologyOtto-Hahn Str. 1144227DortmundGermany
| | - Tuyen Thi Ngoc Tran
- Department of Chemical BiologyMax Planck Institute of Molecular PhysiologyOtto-Hahn Str. 1144227DortmundGermany
- Faculty of Chemistry and Chemical BiologyTechnical University of DortmundOtto-Hahn Str. 644227DortmundGermany
| | - Kathrin Wittstein
- Department of Chemical BiologyMax Planck Institute of Molecular PhysiologyOtto-Hahn Str. 1144227DortmundGermany
- Faculty of Chemistry and Chemical BiologyTechnical University of DortmundOtto-Hahn Str. 644227DortmundGermany
| | - Jonathan O. Bauer
- Faculty of Chemistry and Chemical BiologyTechnical University of DortmundOtto-Hahn Str. 644227DortmundGermany
| | - Carsten Strohmann
- Faculty of Chemistry and Chemical BiologyTechnical University of DortmundOtto-Hahn Str. 644227DortmundGermany
| | - Slava Ziegler
- Department of Chemical BiologyMax Planck Institute of Molecular PhysiologyOtto-Hahn Str. 1144227DortmundGermany
| | - Kamal Kumar
- Department of Chemical BiologyMax Planck Institute of Molecular PhysiologyOtto-Hahn Str. 1144227DortmundGermany
- Faculty of Chemistry and Chemical BiologyTechnical University of DortmundOtto-Hahn Str. 644227DortmundGermany
| |
Collapse
|
12
|
Gaded V, Anand R. Nucleobase deaminases: a potential enzyme system for new therapies. RSC Adv 2018; 8:23567-23577. [PMID: 35540270 PMCID: PMC9081823 DOI: 10.1039/c8ra04112a] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2018] [Accepted: 06/11/2018] [Indexed: 11/21/2022] Open
Abstract
This review presents an overview of the structure, function and mechanism of CDA deaminases and their potential as enzyme systems for development of new antimicrobial therapies.
Collapse
Affiliation(s)
- Vandana Gaded
- Department of Chemistry
- Indian Institute of Technology Bombay
- Mumbai
- India
| | - Ruchi Anand
- Department of Chemistry
- Indian Institute of Technology Bombay
- Mumbai
- India
| |
Collapse
|
13
|
Holliday GL, Brown SD, Akiva E, Mischel D, Hicks MA, Morris JH, Huang CC, Meng EC, Pegg SCH, Ferrin TE, Babbitt PC. Biocuration in the structure-function linkage database: the anatomy of a superfamily. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2017; 2017:3074783. [PMID: 28365730 PMCID: PMC5467563 DOI: 10.1093/database/bax006] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/28/2016] [Accepted: 01/23/2017] [Indexed: 12/11/2022]
Abstract
With ever-increasing amounts of sequence data available in both the primary literature and sequence repositories, there is a bottleneck in annotating molecular function to a sequence. This article describes the biocuration process and methods used in the structure-function linkage database (SFLD) to help address some of the challenges. We discuss how the hierarchy within the SFLD allows us to infer detailed functional properties for functionally diverse enzyme superfamilies in which all members are homologous, conserve an aspect of their chemical function and have associated conserved structural features that enable the chemistry. Also presented is the Enzyme Structure-Function Ontology (ESFO), which has been designed to capture the relationships between enzyme sequence, structure and function that underlie the SFLD and is used to guide the biocuration processes within the SFLD. Database URL:http://sfld.rbvi.ucsf.edu/
Collapse
Affiliation(s)
- Gemma L Holliday
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94143, USA
| | - Shoshana D Brown
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94143, USA
| | - Eyal Akiva
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94143, USA
| | - David Mischel
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94143, USA
| | - Michael A Hicks
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94143, USA.,Human Longevity, Inc, San Diego, CA 92121, USA
| | - John H Morris
- Department of Pharmaceutical Chemistry, School of Pharmacy, University of California, San Francisco, CA 94143, USA
| | - Conrad C Huang
- Department of Pharmaceutical Chemistry, School of Pharmacy, University of California, San Francisco, CA 94143, USA
| | - Elaine C Meng
- Department of Pharmaceutical Chemistry, School of Pharmacy, University of California, San Francisco, CA 94143, USA
| | | | - Thomas E Ferrin
- Department of Pharmaceutical Chemistry, School of Pharmacy, University of California, San Francisco, CA 94143, USA.,California Institute for Quantitative Biosciences, University of California, San Francisco, CA 94158, USA
| | - Patricia C Babbitt
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94143, USA.,Department of Pharmaceutical Chemistry, School of Pharmacy, University of California, San Francisco, CA 94143, USA.,California Institute for Quantitative Biosciences, University of California, San Francisco, CA 94158, USA
| |
Collapse
|
14
|
Abstract
In this study, we delineate an unsupervised clustering algorithm, minimum span clustering (MSC), and apply it to detect G-protein coupled receptor (GPCR) sequences and to study the GPCR network using a base dataset of 2770 GPCR and 652 non-GPCR sequences. High detection accuracy can be achieved with a proper dataset. The clustering results of GPCRs derived from MSC show a strong correlation between their sequences and functions. By comparing our level 1 MSC results with the GPCRdb classification, the consistency is 87.9% for the fourth level of GPCRdb, 89.2% for the third level, 98.4% for the second level, and 100% for the top level (the lowest resolution level of GPCRdb). The MSC results of GPCRs can be well explained by estimating the selective pressure of GPCRs, as exemplified by investigating the largest two subfamilies, peptide receptors (PRs) and olfactory receptors (ORs), in class A GPCRs. PRs are decomposed into three groups due to a positive selective pressure, whilst ORs remain as a single group due to a negative selective pressure. Finally, we construct and compare phylogenetic trees using distance-based and character-based methods, a combination of which could convey more comprehensive information about the evolution of GPCRs.
Collapse
|