1
|
Aplakidou E, Vergoulidis N, Chasapi M, Venetsianou NK, Kokoli M, Panagiotopoulou E, Iliopoulos I, Karatzas E, Pafilis E, Georgakopoulos-Soares I, Kyrpides NC, Pavlopoulos GA, Baltoumas FA. Visualizing metagenomic and metatranscriptomic data: A comprehensive review. Comput Struct Biotechnol J 2024; 23:2011-2033. [PMID: 38765606 PMCID: PMC11101950 DOI: 10.1016/j.csbj.2024.04.060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2024] [Revised: 04/25/2024] [Accepted: 04/25/2024] [Indexed: 05/22/2024] Open
Abstract
The fields of Metagenomics and Metatranscriptomics involve the examination of complete nucleotide sequences, gene identification, and analysis of potential biological functions within diverse organisms or environmental samples. Despite the vast opportunities for discovery in metagenomics, the sheer volume and complexity of sequence data often present challenges in processing analysis and visualization. This article highlights the critical role of advanced visualization tools in enabling effective exploration, querying, and analysis of these complex datasets. Emphasizing the importance of accessibility, the article categorizes various visualizers based on their intended applications and highlights their utility in empowering bioinformaticians and non-bioinformaticians to interpret and derive insights from meta-omics data effectively.
Collapse
Affiliation(s)
- Eleni Aplakidou
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
- Department of Informatics and Telecommunications, Data Science and Information Technologies program, University of Athens, 15784 Athens, Greece
| | - Nikolaos Vergoulidis
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
| | - Maria Chasapi
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
- Department of Informatics and Telecommunications, Data Science and Information Technologies program, University of Athens, 15784 Athens, Greece
| | - Nefeli K. Venetsianou
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
| | - Maria Kokoli
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
| | - Eleni Panagiotopoulou
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
- Department of Informatics and Telecommunications, Data Science and Information Technologies program, University of Athens, 15784 Athens, Greece
| | - Ioannis Iliopoulos
- Department of Basic Sciences, School of Medicine, University of Crete, 71003 Heraklion, Greece
| | - Evangelos Karatzas
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Evangelos Pafilis
- Institute of Marine Biology, Biotechnology and Aquaculture (IMBBC), Hellenic Centre for Marine Research (HCMR), Heraklion, Greece
| | - Ilias Georgakopoulos-Soares
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Nikos C. Kyrpides
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Georgios A. Pavlopoulos
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Center of New Biotechnologies & Precision Medicine, Department of Medicine, School of Health Sciences, National and Kapodistrian University of Athens, Greece
- Hellenic Army Academy, 16673 Vari, Greece
| | - Fotis A. Baltoumas
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
| |
Collapse
|
2
|
Butterfield ER, Obado SO, Scutts SR, Zhang W, Chait BT, Rout MP, Field MC. A lineage-specific protein network at the trypanosome nuclear envelope. Nucleus 2024; 15:2310452. [PMID: 38605598 PMCID: PMC11018031 DOI: 10.1080/19491034.2024.2310452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 01/18/2024] [Indexed: 04/13/2024] Open
Abstract
The nuclear envelope (NE) separates translation and transcription and is the location of multiple functions, including chromatin organization and nucleocytoplasmic transport. The molecular basis for many of these functions have diverged between eukaryotic lineages. Trypanosoma brucei, a member of the early branching eukaryotic lineage Discoba, highlights many of these, including a distinct lamina and kinetochore composition. Here, we describe a cohort of proteins interacting with both the lamina and NPC, which we term lamina-associated proteins (LAPs). LAPs represent a diverse group of proteins, including two candidate NPC-anchoring pore membrane proteins (POMs) with architecture conserved with S. cerevisiae and H. sapiens, and additional peripheral components of the NPC. While many of the LAPs are Kinetoplastid specific, we also identified broadly conserved proteins, indicating an amalgam of divergence and conservation within the trypanosome NE proteome, highlighting the diversity of nuclear biology across the eukaryotes, increasing our understanding of eukaryotic and NPC evolution.
Collapse
Affiliation(s)
| | - Samson O. Obado
- Laboratory of Cellular and Structural Biology, The Rockefeller University, New York, NY, USA
| | - Simon R. Scutts
- Department of Pathology, University of Cambridge, Cambridge, UK
| | - Wenzhu Zhang
- Laboratory of Mass Spectrometry and Gaseous Ion Chemistry, The Rockefeller University, New York, NY, USA
| | - Brian T. Chait
- Laboratory of Mass Spectrometry and Gaseous Ion Chemistry, The Rockefeller University, New York, NY, USA
| | - Michael P. Rout
- Laboratory of Cellular and Structural Biology, The Rockefeller University, New York, NY, USA
| | - Mark C. Field
- School of Life Sciences, University of Dundee, Dundee, UK
- Biology Centre, Czech Academy of Sciences, Institute of Parasitology, České Budějovice, Czech Republic
| |
Collapse
|
3
|
Xia Y, Pan X, Shen HB. A comprehensive survey on protein-ligand binding site prediction. Curr Opin Struct Biol 2024; 86:102793. [PMID: 38447285 DOI: 10.1016/j.sbi.2024.102793] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Revised: 02/18/2024] [Accepted: 02/18/2024] [Indexed: 03/08/2024]
Abstract
Protein-ligand binding site prediction is critical for protein function annotation and drug discovery. Biological experiments are time-consuming and require significant equipment, materials, and labor resources. Developing accurate and efficient computational methods for protein-ligand interaction prediction is essential. Here, we summarize the key challenges associated with ligand binding site (LBS) prediction and introduce recently published methods from their input features, computational algorithms, and ligand types. Furthermore, we investigate the specificity of allosteric site identification as a particular LBS type. Finally, we discuss the prospective directions for machine learning-based LBS prediction in the near future.
Collapse
Affiliation(s)
- Ying Xia
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China
| | - Xiaoyong Pan
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China.
| | - Hong-Bin Shen
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China.
| |
Collapse
|
4
|
Gaschignard G, Millet M, Bruley A, Benzerara K, Dezi M, Skouri-Panet F, Duprat E, Callebaut I. AlphaFold2-guided description of CoBaHMA, a novel family of bacterial domains within the heavy-metal-associated superfamily. Proteins 2024; 92:776-794. [PMID: 38258321 DOI: 10.1002/prot.26668] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 12/22/2023] [Accepted: 01/01/2024] [Indexed: 01/24/2024]
Abstract
Three-dimensional (3D) structure information, now available at the proteome scale, may facilitate the detection of remote evolutionary relationships in protein superfamilies. Here, we illustrate this with the identification of a novel family of protein domains related to the ferredoxin-like superfold, by combining (i) transitive sequence similarity searches, (ii) clustering approaches, and (iii) the use of AlphaFold2 3D structure models. Domains of this family were initially identified in relation with the intracellular biomineralization of calcium carbonates by Cyanobacteria. They are part of the large heavy-metal-associated (HMA) superfamily, departing from the latter by specific sequence and structural features. In particular, most of them share conserved basic amino acids (hence their name CoBaHMA for Conserved Basic residues HMA), forming a positively charged surface, which is likely to interact with anionic partners. CoBaHMA domains are found in diverse modular organizations in bacteria, existing in the form of monodomain proteins or as part of larger proteins, some of which are membrane proteins involved in transport or lipid metabolism. This suggests that the CoBaHMA domains may exert a regulatory function, involving interactions with anionic lipids. This hypothesis might have a particular resonance in the context of the compartmentalization observed for cyanobacterial intracellular calcium carbonates.
Collapse
Affiliation(s)
- Geoffroy Gaschignard
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Maxime Millet
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Apolline Bruley
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Karim Benzerara
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Manuela Dezi
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Feriel Skouri-Panet
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Elodie Duprat
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Isabelle Callebaut
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| |
Collapse
|
5
|
von Kügelgen A, Cassidy CK, van Dorst S, Pagani LL, Batters C, Ford Z, Löwe J, Alva V, Stansfeld PJ, Bharat TAM. Membraneless channels sieve cations in ammonia-oxidizing marine archaea. Nature 2024:10.1038/s41586-024-07462-5. [PMID: 38811725 DOI: 10.1038/s41586-024-07462-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Accepted: 04/24/2024] [Indexed: 05/31/2024]
Abstract
Nitrosopumilus maritimus is an ammonia-oxidizing archaeon that is crucial to the global nitrogen cycle1,2. A critical step for nitrogen oxidation is the entrapment of ammonium ions from a dilute marine environment at the cell surface and their subsequent channelling to the cell membrane of N. maritimus. Here we elucidate the structure of the molecular machinery responsible for this process, comprising the surface layer (S-layer), using electron cryotomography and subtomogram averaging from cells. We supplemented our in situ structure of the ammonium-binding S-layer array with a single-particle electron cryomicroscopy structure, revealing detailed features of this immunoglobulin-rich and glycan-decorated S-layer. Biochemical analyses showed strong ammonium binding by the cell surface, which was lost after S-layer disassembly. Sensitive bioinformatic analyses identified similar S-layers in many ammonia-oxidizing archaea, with conserved sequence and structural characteristics. Moreover, molecular simulations and structure determination of ammonium-enriched specimens enabled us to examine the cation-binding properties of the S-layer, revealing how it concentrates ammonium ions on its cell-facing side, effectively acting as a multichannel sieve on the cell membrane. This in situ structural study illuminates the biogeochemically essential process of ammonium binding and channelling, common to many marine microorganisms that are fundamental to the nitrogen cycle.
Collapse
Affiliation(s)
- Andriko von Kügelgen
- Structural Studies Division, MRC Laboratory of Molecular Biology, Cambridge, UK
- Sir William Dunn School of Pathology, University of Oxford, Oxford, UK
| | - C Keith Cassidy
- Department of Physics and Astronomy, University of Missouri-Columbia, Columbia, MO, USA
| | - Sofie van Dorst
- Sir William Dunn School of Pathology, University of Oxford, Oxford, UK
| | - Lennart L Pagani
- Sir William Dunn School of Pathology, University of Oxford, Oxford, UK
| | - Christopher Batters
- Protein and Nucleic Acid Chemistry Division, MRC Laboratory of Molecular Biology, Cambridge, UK
| | - Zephyr Ford
- Sir William Dunn School of Pathology, University of Oxford, Oxford, UK
| | - Jan Löwe
- Structural Studies Division, MRC Laboratory of Molecular Biology, Cambridge, UK
| | - Vikram Alva
- Department of Protein Evolution, Max Planck Institute for Biology Tübingen, Tübingen, Germany
| | - Phillip J Stansfeld
- School of Life Sciences and Department of Chemistry, University of Warwick, Coventry, UK
| | - Tanmay A M Bharat
- Structural Studies Division, MRC Laboratory of Molecular Biology, Cambridge, UK.
| |
Collapse
|
6
|
Medvedev KE, Zhang J, Schaeffer RD, Kinch LN, Cong Q, Grishin NV. Structure classification of the proteins from Salmonella enterica pangenome revealed novel potential pathogenicity islands. Sci Rep 2024; 14:12260. [PMID: 38806511 PMCID: PMC11133325 DOI: 10.1038/s41598-024-60991-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2024] [Accepted: 04/30/2024] [Indexed: 05/30/2024] Open
Abstract
Salmonella enterica is a pathogenic bacterium known for causing severe typhoid fever in humans, making it important to study due to its potential health risks and significant impact on public health. This study provides evolutionary classification of proteins from Salmonella enterica pangenome. We classified 17,238 domains from 13,147 proteins from 79,758 Salmonella enterica strains and studied in detail domains of 272 proteins from 14 characterized Salmonella pathogenicity islands (SPIs). Among SPIs-related proteins, 90 proteins function in the secretion machinery. 41% domains of SPI proteins have no previous sequence annotation. By comparing clinical and environmental isolates, we identified 3682 proteins that are overrepresented in clinical group that we consider as potentially pathogenic. Among domains of potentially pathogenic proteins only 50% domains were annotated by sequence methods previously. Moreover, 36% (1330 out of 3682) of potentially pathogenic proteins cannot be classified into Evolutionary Classification of Protein Domains database (ECOD). Among classified domains of potentially pathogenic proteins the most populated homology groups include helix-turn-helix (HTH), Immunoglobulin-related, and P-loop domains-related. Functional analysis revealed overrepresentation of these protein in biological processes related to viral entry into host cell, antibiotic biosynthesis, DNA metabolism and conformation change, and underrepresentation in translational processes. Analysis of the potentially pathogenic proteins indicates that they form 119 clusters or novel potential pathogenicity islands (NPPIs) within the Salmonella genome, suggesting their potential contribution to the bacterium's virulence. One of the NPPIs revealed significant overrepresentation of potentially pathogenic proteins. Overall, our analysis revealed that identified potentially pathogenic proteins are poorly studied.
Collapse
Affiliation(s)
- Kirill E Medvedev
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA.
| | - Jing Zhang
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
| | - R Dustin Schaeffer
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
| | - Lisa N Kinch
- Department of Molecular Biology, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
| | - Qian Cong
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
| | - Nick V Grishin
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
- Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
| |
Collapse
|
7
|
Bryant P, Kelkar A, Guljas A, Clementi C, Noé F. Structure prediction of protein-ligand complexes from sequence information with Umol. Nat Commun 2024; 15:4536. [PMID: 38806453 PMCID: PMC11133481 DOI: 10.1038/s41467-024-48837-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Accepted: 05/15/2024] [Indexed: 05/30/2024] Open
Abstract
Protein-ligand docking is an established tool in drug discovery and development to narrow down potential therapeutics for experimental testing. However, a high-quality protein structure is required and often the protein is treated as fully or partially rigid. Here we develop an AI system that can predict the fully flexible all-atom structure of protein-ligand complexes directly from sequence information. We find that classical docking methods are still superior, but depend upon having crystal structures of the target protein. In addition to predicting flexible all-atom structures, predicted confidence metrics (plDDT) can be used to select accurate predictions as well as to distinguish between strong and weak binders. The advances presented here suggest that the goal of AI-based drug discovery is one step closer, but there is still a way to go to grasp the complexity of protein-ligand interactions fully. Umol is available at: https://github.com/patrickbryant1/Umol .
Collapse
Affiliation(s)
- Patrick Bryant
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany.
- The Department of Molecular Biosciences, The Wenner-Gren Institute, Stockholm University, Svante Arrhenius väg 20C, 114 18, Stockholm, Sweden.
- Science for Life Laboratory, 172 21, Solna, Sweden.
| | - Atharva Kelkar
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany
| | - Andrea Guljas
- Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany
| | - Cecilia Clementi
- Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany
| | - Frank Noé
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany
- Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany
- Microsoft Research AI4Science, Karl-Liebknecht Str. 32, 10178, Berlin, Germany
| |
Collapse
|
8
|
Gao Y, Zhong Z, Zhang D, Zhang J, Li YX. Exploring the roles of ribosomal peptides in prokaryote-phage interactions through deep learning-enabled metagenome mining. MICROBIOME 2024; 12:94. [PMID: 38790030 PMCID: PMC11118758 DOI: 10.1186/s40168-024-01807-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/11/2023] [Accepted: 04/04/2024] [Indexed: 05/26/2024]
Abstract
BACKGROUND Microbial secondary metabolites play a crucial role in the intricate interactions within the natural environment. Among these metabolites, ribosomally synthesized and post-translationally modified peptides (RiPPs) are becoming a promising source of therapeutic agents due to their structural diversity and functional versatility. However, their biosynthetic capacity and ecological functions remain largely underexplored. RESULTS Here, we aim to explore the biosynthetic profile of RiPPs and their potential roles in the interactions between microbes and viruses in the ocean, which encompasses a vast diversity of unique biomes that are rich in interactions and remains chemically underexplored. We first developed TrRiPP to identify RiPPs from ocean metagenomes, a deep learning method that detects RiPP precursors in a hallmark gene-independent manner to overcome the limitations of classic methods in processing highly fragmented metagenomic data. Applying this method to metagenomes from the global ocean microbiome, we uncover a diverse array of previously uncharacterized putative RiPP families with great novelty and diversity. Through correlation analysis based on metatranscriptomic data, we observed a high prevalence of antiphage defense-related and phage-related protein families that were co-expressed with RiPP families. Based on this putative association between RiPPs and phage infection, we constructed an Ocean Virus Database (OVD) and established a RiPP-involving host-phage interaction network through host prediction and co-expression analysis, revealing complex connectivities linking RiPP-encoding prokaryotes, RiPP families, viral protein families, and phages. These findings highlight the potential of RiPP families involved in prokaryote-phage interactions and coevolution, providing insights into their ecological functions in the ocean microbiome. CONCLUSIONS This study provides a systematic investigation of the biosynthetic potential of RiPPs from the ocean microbiome at a global scale, shedding light on the essential insights into the ecological functions of RiPPs in prokaryote-phage interactions through the integration of deep learning approaches, metatranscriptomic data, and host-phage connectivity. This study serves as a valuable example of exploring the ecological functions of bacterial secondary metabolites, particularly their associations with unexplored microbial interactions. Video Abstract.
Collapse
Affiliation(s)
- Ying Gao
- CYM305, Department of Chemistry and The Swire Institute of Marine Science, The University of Hong Kong, Pokfulam Road, Hong Kong Special Administrative Region, 999077, China
| | - Zheng Zhong
- CYM305, Department of Chemistry and The Swire Institute of Marine Science, The University of Hong Kong, Pokfulam Road, Hong Kong Special Administrative Region, 999077, China
| | - Dengwei Zhang
- CYM305, Department of Chemistry and The Swire Institute of Marine Science, The University of Hong Kong, Pokfulam Road, Hong Kong Special Administrative Region, 999077, China
| | - Jian Zhang
- CYM305, Department of Chemistry and The Swire Institute of Marine Science, The University of Hong Kong, Pokfulam Road, Hong Kong Special Administrative Region, 999077, China
| | - Yong-Xin Li
- CYM305, Department of Chemistry and The Swire Institute of Marine Science, The University of Hong Kong, Pokfulam Road, Hong Kong Special Administrative Region, 999077, China.
| |
Collapse
|
9
|
Bai H, Lewitus E, Li Y, Thomas PV, Zemil M, Merbah M, Peterson CE, Thuraisamy T, Rees PA, Hajduczki A, Dussupt V, Slike B, Mendez-Rivera L, Schmid A, Kavusak E, Rao M, Smith G, Frey J, Sims A, Wieczorek L, Polonis V, Krebs SJ, Ake JA, Vasan S, Bolton DL, Joyce MG, Townsley S, Rolland M. Contemporary HIV-1 consensus Env with AI-assisted redesigned hypervariable loops promote antibody binding. Nat Commun 2024; 15:3924. [PMID: 38724518 PMCID: PMC11082178 DOI: 10.1038/s41467-024-48139-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 04/19/2024] [Indexed: 05/12/2024] Open
Abstract
An effective HIV-1 vaccine must elicit broadly neutralizing antibodies (bnAbs) against highly diverse Envelope glycoproteins (Env). Since Env with the longest hypervariable (HV) loops is more resistant to the cognate bnAbs than Env with shorter HV loops, we redesigned hypervariable loops for updated Env consensus sequences of subtypes B and C and CRF01_AE. Using modeling with AlphaFold2, we reduced the length of V1, V2, and V5 HV loops while maintaining the integrity of the Env structure and glycan shield, and modified the V4 HV loop. Spacers are designed to limit strain-specific targeting. All updated Env are infectious as pseudoviruses. Preliminary structural characterization suggests that the modified HV loops have a limited impact on Env's conformation. Binding assays show improved binding to modified subtype B and CRF01_AE Env but not to subtype C Env. Neutralization assays show increases in sensitivity to bnAbs, although not always consistently across clades. Strikingly, the HV loop modification renders the resistant CRF01_AE Env sensitive to 10-1074 despite the absence of a glycan at N332.
Collapse
Affiliation(s)
- Hongjun Bai
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Eric Lewitus
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Yifan Li
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Paul V Thomas
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
- Emerging Infectious Disease Branch, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
| | - Michelle Zemil
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Mélanie Merbah
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Caroline E Peterson
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
- Emerging Infectious Disease Branch, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
| | - Thujitha Thuraisamy
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Phyllis A Rees
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
- Emerging Infectious Disease Branch, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
| | - Agnes Hajduczki
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
- Emerging Infectious Disease Branch, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
| | - Vincent Dussupt
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Bonnie Slike
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Letzibeth Mendez-Rivera
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Annika Schmid
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Erin Kavusak
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Mekhala Rao
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Gabriel Smith
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Jessica Frey
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Alicea Sims
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Lindsay Wieczorek
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Victoria Polonis
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
| | - Shelly J Krebs
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
| | - Julie A Ake
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
| | - Sandhya Vasan
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Diane L Bolton
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - M Gordon Joyce
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
- Emerging Infectious Disease Branch, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
| | - Samantha Townsley
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Morgane Rolland
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA.
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA.
| |
Collapse
|
10
|
Tominaga K, Ozaki S, Sato S, Katayama T, Nishimura Y, Omae K, Iwasaki W. Frequent nonhomologous replacement of replicative helicase loaders by viruses in Vibrionaceae. Proc Natl Acad Sci U S A 2024; 121:e2317954121. [PMID: 38683976 PMCID: PMC11087808 DOI: 10.1073/pnas.2317954121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Accepted: 03/14/2024] [Indexed: 05/02/2024] Open
Abstract
Several microbial genomes lack textbook-defined essential genes. If an essential gene is absent from a genome, then an evolutionarily independent gene of unknown function complements its function. Here, we identified frequent nonhomologous replacement of an essential component of DNA replication initiation, a replicative helicase loader gene, in Vibrionaceae. Our analysis of Vibrionaceae genomes revealed two genes with unknown function, named vdhL1 and vdhL2, that were substantially enriched in genomes without the known helicase-loader genes. These genes showed no sequence similarities to genes with known function but encoded proteins structurally similar with a viral helicase loader. Analyses of genomic syntenies and coevolution with helicase genes suggested that vdhL1/2 encodes a helicase loader. The in vitro assay showed that Vibrio harveyi VdhL1 and Vibrio ezurae VdhL2 promote the helicase activity of DnaB. Furthermore, molecular phylogenetics suggested that vdhL1/2 were derived from phages and replaced an intrinsic helicase loader gene of Vibrionaceae over 20 times. This high replacement frequency implies the host's advantage in acquiring a viral helicase loader gene.
Collapse
Affiliation(s)
- Kento Tominaga
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba277-0882, Japan
| | - Shogo Ozaki
- Department of Molecular Biology, Graduate School of Pharmaceutical Sciences, Kyushu University, Fukuoka812-8582, Japan
| | - Shohei Sato
- Department of Molecular Biology, Graduate School of Pharmaceutical Sciences, Kyushu University, Fukuoka812-8582, Japan
| | - Tsutomu Katayama
- Department of Molecular Biology, Graduate School of Pharmaceutical Sciences, Kyushu University, Fukuoka812-8582, Japan
| | - Yuki Nishimura
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba277-0882, Japan
| | - Kimiho Omae
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba277-0882, Japan
| | - Wataru Iwasaki
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba277-0882, Japan
- Department of Biological Sciences, Graduate School of Science, The University of Tokyo, Tokyo113-0032, Japan
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba277-0882, Japan
- Atmosphere and Ocean Research Institute, The University of Tokyo, Chiba277-8564, Japan
- Institute for Quantitative Biosciences, The University of Tokyo, Tokyo113-0032, Japan
- Collaborative Research Institute for Innovative Microbiology, The University of Tokyo, Tokyo113-8657, Japan
| |
Collapse
|
11
|
Wallner ES, Mair A, Handler D, McWhite C, Xu SL, Dolan L, Bergmann DC. Spatially resolved proteomics of the Arabidopsis stomatal lineage identifies polarity complexes for cell divisions and stomatal pores. Dev Cell 2024; 59:1096-1109.e5. [PMID: 38518768 DOI: 10.1016/j.devcel.2024.03.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Revised: 02/23/2024] [Accepted: 03/05/2024] [Indexed: 03/24/2024]
Abstract
Cell polarity is used to guide asymmetric divisions and create morphologically diverse cells. We find that two oppositely oriented cortical polarity domains present during the asymmetric divisions in the Arabidopsis stomatal lineage are reconfigured into polar domains marking ventral (pore-forming) and outward-facing domains of maturing stomatal guard cells. Proteins that define these opposing polarity domains were used as baits in miniTurboID-based proximity labeling. Among differentially enriched proteins, we find kinases, putative microtubule-interacting proteins, and polar SOSEKIs with their effector ANGUSTIFOLIA. Using AI-facilitated protein structure prediction models, we identify potential protein-protein interaction interfaces among them. Functional and localization analyses of the polarity protein OPL2 and its putative interaction partners suggest a positive interaction with mitotic microtubules and a role in cytokinesis. This combination of proteomics and structural modeling with live-cell imaging provides insights into how polarity is rewired in different cell types and cell-cycle stages.
Collapse
Affiliation(s)
- Eva-Sophie Wallner
- Department of Biology, Stanford University, Stanford, CA 94305-5020, USA; Gregor Mendel Institute, Dr. Bohr-Gasse 3, 1030 Wien, Austria; Howard Hughes Medical Institute, Stanford, CA 94305, USA.
| | - Andrea Mair
- Howard Hughes Medical Institute, Stanford, CA 94305, USA
| | | | - Claire McWhite
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA
| | - Shou-Ling Xu
- Carnegie Institution for Science, Stanford, CA 94305, USA; Carnegie Mass Spectrometry Facility, Carnegie Institution for Science, Stanford, CA 94305, USA
| | - Liam Dolan
- Gregor Mendel Institute, Dr. Bohr-Gasse 3, 1030 Wien, Austria
| | - Dominique C Bergmann
- Department of Biology, Stanford University, Stanford, CA 94305-5020, USA; Howard Hughes Medical Institute, Stanford, CA 94305, USA.
| |
Collapse
|
12
|
Dubey A, Baxter M, Hendargo KJ, Medrano-Soto A, Saier MH. The Pentameric Ligand-Gated Ion Channel Family: A New Member of the Voltage Gated Ion Channel Superfamily? Int J Mol Sci 2024; 25:5005. [PMID: 38732224 PMCID: PMC11084639 DOI: 10.3390/ijms25095005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2024] [Revised: 04/28/2024] [Accepted: 04/29/2024] [Indexed: 05/13/2024] Open
Abstract
In this report we present seven lines of bioinformatic evidence supporting the conclusion that the Pentameric Ligand-gated Ion Channel (pLIC) Family is a member of the Voltage-gated Ion Channel (VIC) Superfamily. In our approach, we used the Transporter Classification Database (TCDB) as a reference and applied a series of bioinformatic methods to search for similarities between the pLIC family and members of the VIC superfamily. These include: (1) sequence similarity, (2) compatibility of topology and hydropathy profiles, (3) shared domains, (4) conserved motifs, (5) similarity of Hidden Markov Model profiles between families, (6) common 3D structural folds, and (7) clustering analysis of all families. Furthermore, sequence and structural comparisons as well as the identification of a 3-TMS repeat unit in the VIC superfamily suggests that the sixth transmembrane segment evolved into a re-entrant loop. This evidence suggests that the voltage-sensor domain and the channel domain have a common origin. The classification of the pLIC family within the VIC superfamily sheds light onto the topological origins of this family and its evolution, which will facilitate experimental verification and further research into this superfamily by the scientific community.
Collapse
Affiliation(s)
| | | | | | - Arturo Medrano-Soto
- Department of Molecular Biology, School of Biological Sciences, University of California San Diego, La Jolla, CA 92093-0116, USA; (A.D.); (M.B.); (K.J.H.)
| | - Milton H. Saier
- Department of Molecular Biology, School of Biological Sciences, University of California San Diego, La Jolla, CA 92093-0116, USA; (A.D.); (M.B.); (K.J.H.)
| |
Collapse
|
13
|
Lee S, Kim G, Karin EL, Mirdita M, Park S, Chikhi R, Babaian A, Kryshtafovych A, Steinegger M. Petabase-Scale Homology Search for Structure Prediction. Cold Spring Harb Perspect Biol 2024; 16:a041465. [PMID: 38316555 PMCID: PMC11065157 DOI: 10.1101/cshperspect.a041465] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2024]
Abstract
The recent CASP15 competition highlighted the critical role of multiple sequence alignments (MSAs) in protein structure prediction, as demonstrated by the success of the top AlphaFold2-based prediction methods. To push the boundaries of MSA utilization, we conducted a petabase-scale search of the Sequence Read Archive (SRA), resulting in gigabytes of aligned homologs for CASP15 targets. These were merged with default MSAs produced by ColabFold-search and provided to ColabFold-predict. By using SRA data, we achieved highly accurate predictions (GDT_TS > 70) for 66% of the non-easy targets, whereas using ColabFold-search default MSAs scored highly in only 52%. Next, we tested the effect of deep homology search and ColabFold's advanced features, such as more recycles, on prediction accuracy. While SRA homologs were most significant for improving ColabFold's CASP15 ranking from 11th to 3rd place, other strategies contributed too. We analyze these in the context of existing strategies to improve prediction.
Collapse
Affiliation(s)
- Sewon Lee
- School of Biological Sciences, Seoul National University, Gwanak-gu, Seoul 08826, South Korea
| | - Gyuri Kim
- School of Biological Sciences, Seoul National University, Gwanak-gu, Seoul 08826, South Korea
| | | | - Milot Mirdita
- School of Biological Sciences, Seoul National University, Gwanak-gu, Seoul 08826, South Korea
| | - Sukhwan Park
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul 08826, South Korea
| | - Rayan Chikhi
- Institut Pasteur, Université Paris Cité, G5 Sequence Bioinformatics, 75015 Paris, France
| | - Artem Babaian
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada
| | | | - Martin Steinegger
- School of Biological Sciences, Seoul National University, Gwanak-gu, Seoul 08826, South Korea
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul 08826, South Korea
- Artificial Intelligence Institute, Seoul National University, Seoul 08826, South Korea
- Institute of Molecular Biology and Genetics, Seoul National University, Seoul 08826, South Korea
| |
Collapse
|
14
|
Wienhausen G, Moraru C, Bruns S, Tran DQ, Sultana S, Wilkes H, Dlugosch L, Azam F, Simon M. Ligand cross-feeding resolves bacterial vitamin B 12 auxotrophies. Nature 2024; 629:886-892. [PMID: 38720071 DOI: 10.1038/s41586-024-07396-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2021] [Accepted: 04/08/2024] [Indexed: 05/24/2024]
Abstract
Cobalamin (vitamin B12, herein referred to as B12) is an essential cofactor for most marine prokaryotes and eukaryotes1,2. Synthesized by a limited number of prokaryotes, its scarcity affects microbial interactions and community dynamics2-4. Here we show that two bacterial B12 auxotrophs can salvage different B12 building blocks and cooperate to synthesize B12. A Colwellia sp. synthesizes and releases the activated lower ligand α-ribazole, which is used by another B12 auxotroph, a Roseovarius sp., to produce the corrin ring and synthesize B12. Release of B12 by Roseovarius sp. happens only in co-culture with Colwellia sp. and only coincidently with the induction of a prophage encoded in Roseovarius sp. Subsequent growth of Colwellia sp. in these conditions may be due to the provision of B12 by lysed cells of Roseovarius sp. Further evidence is required to support a causative role for prophage induction in the release of B12. These complex microbial interactions of ligand cross-feeding and joint B12 biosynthesis seem to be widespread in marine pelagic ecosystems. In the western and northern tropical Atlantic Ocean, bacteria predicted to be capable of salvaging cobinamide and synthesizing only the activated lower ligand outnumber B12 producers. These findings add new players to our understanding of B12 supply to auxotrophic microorganisms in the ocean and possibly in other ecosystems.
Collapse
Affiliation(s)
- Gerrit Wienhausen
- Institute for Chemistry and Biology of the Marine Environment (ICBM), School of Mathematics and Science, Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany.
- Scripps Institution of Oceanography, Marine Biology Research Division, University of California San Diego, La Jolla, CA, USA.
| | - Cristina Moraru
- Institute for Chemistry and Biology of the Marine Environment (ICBM), School of Mathematics and Science, Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany
- Environmental Metagenomics, Research Center One Health Ruhr of the University Alliance Ruhr, Faculty of Chemistry, University of Duisburg-Essen, Essen, Germany
| | - Stefan Bruns
- Institute for Chemistry and Biology of the Marine Environment (ICBM), School of Mathematics and Science, Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany
| | - Den Quoc Tran
- Institute for Chemistry and Biology of the Marine Environment (ICBM), School of Mathematics and Science, Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany
| | - Sabiha Sultana
- Institute for Chemistry and Biology of the Marine Environment (ICBM), School of Mathematics and Science, Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany
| | - Heinz Wilkes
- Institute for Chemistry and Biology of the Marine Environment (ICBM), School of Mathematics and Science, Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany
| | - Leon Dlugosch
- Institute for Chemistry and Biology of the Marine Environment (ICBM), School of Mathematics and Science, Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany
| | - Farooq Azam
- Scripps Institution of Oceanography, Marine Biology Research Division, University of California San Diego, La Jolla, CA, USA
| | - Meinhard Simon
- Institute for Chemistry and Biology of the Marine Environment (ICBM), School of Mathematics and Science, Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany.
- Helmholtz Institute for Functional Marine Biodiversity at the University of Oldenburg (HIFMB), Oldenburg, Germany.
| |
Collapse
|
15
|
Moreno-Manuel AI, Macías Á, Cruz FM, Gutiérrez LK, Martínez F, González-Guerra A, Martínez Carrascoso I, Bermúdez-Jimenez FJ, Sánchez-Pérez P, Vera-Pedrosa ML, Ruiz-Robles JM, Bernal JA, Jalife J. The Kir2.1E299V mutation increases atrial fibrillation vulnerability while protecting the ventricles against arrhythmias in a mouse model of short QT syndrome type 3. Cardiovasc Res 2024; 120:490-505. [PMID: 38261726 PMCID: PMC11060485 DOI: 10.1093/cvr/cvae019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 10/24/2023] [Accepted: 12/12/2023] [Indexed: 01/25/2024] Open
Abstract
AIMS Short QT syndrome type 3 (SQTS3) is a rare arrhythmogenic disease caused by gain-of-function mutations in KCNJ2, the gene coding the inward rectifier potassium channel Kir2.1. We used a multidisciplinary approach and investigated arrhythmogenic mechanisms in an in-vivo model of de-novo mutation Kir2.1E299V identified in a patient presenting an extremely abbreviated QT interval and paroxysmal atrial fibrillation. METHODS AND RESULTS We used intravenous adeno-associated virus-mediated gene transfer to generate mouse models, and confirmed cardiac-specific expression of Kir2.1WT or Kir2.1E299V. On ECG, the Kir2.1E299V mouse recapitulated the QT interval shortening and the atrial-specific arrhythmia of the patient. The PR interval was also significantly shorter in Kir2.1E299V mice. Patch-clamping showed extremely abbreviated action potentials in both atrial and ventricular Kir2.1E299V cardiomyocytes due to a lack of inward-going rectification and increased IK1 at voltages positive to -80 mV. Relative to Kir2.1WT, atrial Kir2.1E299V cardiomyocytes had a significantly reduced slope conductance at voltages negative to -80 mV. After confirming a higher proportion of heterotetrameric Kir2.x channels containing Kir2.2 subunits in the atria, in-silico 3D simulations predicted an atrial-specific impairment of polyamine block and reduced pore diameter in the Kir2.1E299V-Kir2.2WT channel. In ventricular cardiomyocytes, the mutation increased excitability by shifting INa activation and inactivation in the hyperpolarizing direction, which protected the ventricle against arrhythmia. Moreover, Purkinje myocytes from Kir2.1E299V mice manifested substantially higher INa density than Kir2.1WT, explaining the abbreviation in the PR interval. CONCLUSION The first in-vivo mouse model of cardiac-specific SQTS3 recapitulates the electrophysiological phenotype of a patient with the Kir2.1E299V mutation. Kir2.1E299V eliminates rectification in both cardiac chambers but protects against ventricular arrhythmias by increasing excitability in both Purkinje-fiber network and ventricles. Consequently, the predominant arrhythmias are supraventricular likely due to the lack of inward rectification and atrial-specific reduced pore diameter of the Kir2.1E299V-Kir2.2WT heterotetramer.
Collapse
MESH Headings
- Animals
- Humans
- Mice
- Action Potentials
- Arrhythmias, Cardiac/genetics
- Arrhythmias, Cardiac/physiopathology
- Arrhythmias, Cardiac/metabolism
- Atrial Fibrillation/genetics
- Atrial Fibrillation/physiopathology
- Atrial Fibrillation/metabolism
- Disease Models, Animal
- Genetic Predisposition to Disease
- Heart Rate/genetics
- Heart Ventricles/metabolism
- Heart Ventricles/physiopathology
- Mice, Inbred C57BL
- Mice, Transgenic
- Mutation
- Myocytes, Cardiac/metabolism
- Myocytes, Cardiac/pathology
- Phenotype
- Potassium Channels, Inwardly Rectifying/genetics
- Potassium Channels, Inwardly Rectifying/metabolism
Collapse
Affiliation(s)
- Ana I Moreno-Manuel
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
| | - Álvaro Macías
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
| | - Francisco M Cruz
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
| | - Lilian K Gutiérrez
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
| | - Fernando Martínez
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
- CIBER de Enfermedades Cardiovasculares (CIBERCV), Madrid, Spain
| | - Andrés González-Guerra
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
| | - Isabel Martínez Carrascoso
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
| | - Francisco José Bermúdez-Jimenez
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
- Department of Cardiology, Hospital Universitario Virgen de las Nieves, 18014 Granada, Spain
| | - Patricia Sánchez-Pérez
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
| | | | - Juan Manuel Ruiz-Robles
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
| | - Juan A Bernal
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
- CIBER de Enfermedades Cardiovasculares (CIBERCV), Madrid, Spain
| | - José Jalife
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
- CIBER de Enfermedades Cardiovasculares (CIBERCV), Madrid, Spain
- Departments of Internal Medicine and Molecular and Integrative Physiology, Center for Arrhythmia Research, University of Michigan, Ann Arbor, MI 4810, USA
| |
Collapse
|
16
|
Kshirsagar M, Meller A, Humphreys I, Sledzieski S, Xu Y, Dodhia R, Horvitz E, Berger B, Bowman G, Ferres JL, Baker D, Baek M. Rapid and accurate prediction of protein homo-oligomer symmetry with Seq2Symm. RESEARCH SQUARE 2024:rs.3.rs-4215086. [PMID: 38746169 PMCID: PMC11092833 DOI: 10.21203/rs.3.rs-4215086/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
The majority of proteins must form higher-order assemblies to perform their biological functions. Despite the importance of protein quaternary structure, there are few machine learning models that can accurately and rapidly predict the symmetry of assemblies involving multiple copies of the same protein chain. Here, we address this gap by training several classes of protein foundation models, including ESM-MSA, ESM2, and RoseTTAFold2, to predict homo-oligomer symmetry. Our best model named Seq2Symm, which utilizes ESM2, outperforms existing template-based and deep learning methods. It achieves an average PR-AUC of 0.48 and 0.44 across homo-oligomer symmetries on two different held-out test sets compared to 0.32 and 0.23 for the template-based method. Because Seq2Symm can rapidly predict homo-oligomer symmetries using a single sequence as input (~ 80,000 proteins/hour), we have applied it to 5 entire proteomes and ~ 3.5 million unlabeled protein sequences to identify patterns in protein assembly complexity across biological kingdoms and species.
Collapse
Affiliation(s)
| | | | | | | | - Yixi Xu
- Microsoft AI for Good research lab
| | | | | | | | | | | | | | | |
Collapse
|
17
|
Krishna R, Wang J, Ahern W, Sturmfels P, Venkatesh P, Kalvet I, Lee GR, Morey-Burrows FS, Anishchenko I, Humphreys IR, McHugh R, Vafeados D, Li X, Sutherland GA, Hitchcock A, Hunter CN, Kang A, Brackenbrough E, Bera AK, Baek M, DiMaio F, Baker D. Generalized biomolecular modeling and design with RoseTTAFold All-Atom. Science 2024; 384:eadl2528. [PMID: 38452047 DOI: 10.1126/science.adl2528] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 02/27/2024] [Indexed: 03/09/2024]
Abstract
Deep-learning methods have revolutionized protein structure prediction and design but are presently limited to protein-only systems. We describe RoseTTAFold All-Atom (RFAA), which combines a residue-based representation of amino acids and DNA bases with an atomic representation of all other groups to model assemblies that contain proteins, nucleic acids, small molecules, metals, and covalent modifications, given their sequences and chemical structures. By fine-tuning on denoising tasks, we developed RFdiffusion All-Atom (RFdiffusionAA), which builds protein structures around small molecules. Starting from random distributions of amino acid residues surrounding target small molecules, we designed and experimentally validated, through crystallography and binding measurements, proteins that bind the cardiac disease therapeutic digoxigenin, the enzymatic cofactor heme, and the light-harvesting molecule bilin.
Collapse
Affiliation(s)
- Rohith Krishna
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
| | - Jue Wang
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
| | - Woody Ahern
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA 98105, USA
| | - Pascal Sturmfels
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA 98105, USA
| | - Preetham Venkatesh
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
- Graduate Program in Biological Physics, Structure and Design, University of Washington, Seattle, WA 98105, USA
| | - Indrek Kalvet
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA 98105, USA
| | - Gyu Rie Lee
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA 98105, USA
| | | | - Ivan Anishchenko
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
| | - Ian R Humphreys
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
| | - Ryan McHugh
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
- Graduate Program in Biological Physics, Structure and Design, University of Washington, Seattle, WA 98105, USA
| | - Dionne Vafeados
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
| | - Xinting Li
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
| | | | - Andrew Hitchcock
- School of Biosciences, University of Sheffield, Sheffield S10 2TN, UK
| | - C Neil Hunter
- School of Biosciences, University of Sheffield, Sheffield S10 2TN, UK
| | - Alex Kang
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
| | - Evans Brackenbrough
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
| | - Asim K Bera
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
| | - Minkyung Baek
- School of Biological Sciences, Seoul National University, Seoul 08826, Republic of Korea
| | - Frank DiMaio
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA 98105, USA
| |
Collapse
|
18
|
Ackmann J, Brüge A, Gotina L, Lim S, Jahreis K, Vollbrecht AL, Kim YK, Pae AN, Labus J, Ponimaskin E. Structural determinants for activation of the Tau kinase CDK5 by the serotonin receptor 5-HT7R. Cell Commun Signal 2024; 22:233. [PMID: 38641599 PMCID: PMC11031989 DOI: 10.1186/s12964-024-01612-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Accepted: 04/11/2024] [Indexed: 04/21/2024] Open
Abstract
BACKGROUND Multiple neurodegenerative diseases are induced by the formation and deposition of protein aggregates. In particular, the microtubule-associated protein Tau leads to the development of so-called tauopathies characterized by the aggregation of hyperphosphorylated Tau within neurons. We recently showed that the constitutive activity of the serotonin receptor 7 (5-HT7R) is required for Tau hyperphosphorylation and aggregation through activation of the cyclin-dependent kinase 5 (CDK5). We also demonstrated physical interaction between 5-HT7R and CDK5 at the plasma membrane suggesting that the 5-HT7R/CDK5 complex is an integral part of the signaling network involved in Tau-mediated pathology. METHODS Using biochemical, microscopic, molecular biological, computational and AI-based approaches, we investigated structural requirements for the formation of 5-HT7R/CDK5 complex. RESULTS We demonstrated that 5-HT7R domains responsible for coupling to Gs proteins are not involved in receptor interaction with CDK5. We also created a structural model of the 5-HT7R/CDK5 complex and refined the interaction interface. The model predicted two conserved phenylalanine residues, F278 and F281, within the third intracellular loop of 5-HT7R to be potentially important for complex formation. While site-directed mutagenesis of these residues did not influence Gs protein-mediated receptor signaling, replacement of both phenylalanines by alanine residues significantly reduced 5-HT7R/CDK5 interaction and receptor-mediated CDK5 activation, leading to reduced Tau hyperphosphorylation and aggregation. Molecular dynamics simulations of 5-HT7R/CDK5 complex for wild-type and receptor mutants confirmed binding interface stability of the initial model. CONCLUSIONS Our results provide a structural basis for the development of novel drugs targeting the 5-HT7R/CDK5 interaction interface for the selective treatment of Tau-related disorders, including frontotemporal dementia and Alzheimer's disease.
Collapse
Affiliation(s)
- Jana Ackmann
- Department of Cellular Neurophysiology, Institute for Neurophysiology, Hannover Medical School, Carl-Neuberg-Str. 1, 30625, Hannover, Germany
| | - Alina Brüge
- Department of Cellular Neurophysiology, Institute for Neurophysiology, Hannover Medical School, Carl-Neuberg-Str. 1, 30625, Hannover, Germany
| | - Lizaveta Gotina
- Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul, Republic of Korea
- Division of Bio-Medical Science & Technology, KIST School, Korea University of Science and Technology (UST), Daejeon, Republic of Korea
| | - Sungsu Lim
- Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul, Republic of Korea
| | - Kathrin Jahreis
- Department of Cellular Neurophysiology, Institute for Neurophysiology, Hannover Medical School, Carl-Neuberg-Str. 1, 30625, Hannover, Germany
| | - Anna-Lena Vollbrecht
- Department of Cellular Neurophysiology, Institute for Neurophysiology, Hannover Medical School, Carl-Neuberg-Str. 1, 30625, Hannover, Germany
| | - Yun Kyung Kim
- Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul, Republic of Korea
- Division of Bio-Medical Science & Technology, KIST School, Korea University of Science and Technology (UST), Daejeon, Republic of Korea
| | - Ae Nim Pae
- Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul, Republic of Korea
- Division of Bio-Medical Science & Technology, KIST School, Korea University of Science and Technology (UST), Daejeon, Republic of Korea
| | - Josephine Labus
- Department of Cellular Neurophysiology, Institute for Neurophysiology, Hannover Medical School, Carl-Neuberg-Str. 1, 30625, Hannover, Germany
| | - Evgeni Ponimaskin
- Department of Cellular Neurophysiology, Institute for Neurophysiology, Hannover Medical School, Carl-Neuberg-Str. 1, 30625, Hannover, Germany.
| |
Collapse
|
19
|
Wu LY, Wijesekara Y, Piedade GJ, Pappas N, Brussaard CPD, Dutilh BE. Benchmarking bioinformatic virus identification tools using real-world metagenomic data across biomes. Genome Biol 2024; 25:97. [PMID: 38622738 PMCID: PMC11020464 DOI: 10.1186/s13059-024-03236-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Accepted: 04/01/2024] [Indexed: 04/17/2024] Open
Abstract
BACKGROUND As most viruses remain uncultivated, metagenomics is currently the main method for virus discovery. Detecting viruses in metagenomic data is not trivial. In the past few years, many bioinformatic virus identification tools have been developed for this task, making it challenging to choose the right tools, parameters, and cutoffs. As all these tools measure different biological signals, and use different algorithms and training and reference databases, it is imperative to conduct an independent benchmarking to give users objective guidance. RESULTS We compare the performance of nine state-of-the-art virus identification tools in thirteen modes on eight paired viral and microbial datasets from three distinct biomes, including a new complex dataset from Antarctic coastal waters. The tools have highly variable true positive rates (0-97%) and false positive rates (0-30%). PPR-Meta best distinguishes viral from microbial contigs, followed by DeepVirFinder, VirSorter2, and VIBRANT. Different tools identify different subsets of the benchmarking data and all tools, except for Sourmash, find unique viral contigs. Performance of tools improved with adjusted parameter cutoffs, indicating that adjustment of parameter cutoffs before usage should be considered. CONCLUSIONS Together, our independent benchmarking facilitates selecting choices of bioinformatic virus identification tools and gives suggestions for parameter adjustments to viromics researchers.
Collapse
Affiliation(s)
- Ling-Yi Wu
- Theoretical Biology and Bioinformatics, Science4Life, Utrecht University, Padualaan 8, Utrecht, 3584 CH, The Netherlands
| | - Yasas Wijesekara
- Institute of Bioinformatics, University Medicine Greifswald, Felix Hausdorff Str. 8, 17475, Greifswald, Germany
| | - Gonçalo J Piedade
- Department Marine Microbiology and Biogeochemistry, NIOZ Royal Netherlands Institute for Sea Research, Den Burg, PO Box 59, Texel, 1790 AB, The Netherlands
- Institute for Biodiversity and Ecosystem Dynamics, University of Amsterdam, Amsterdam, The Netherlands
| | - Nikolaos Pappas
- Theoretical Biology and Bioinformatics, Science4Life, Utrecht University, Padualaan 8, Utrecht, 3584 CH, The Netherlands
| | - Corina P D Brussaard
- Department Marine Microbiology and Biogeochemistry, NIOZ Royal Netherlands Institute for Sea Research, Den Burg, PO Box 59, Texel, 1790 AB, The Netherlands
- Institute for Biodiversity and Ecosystem Dynamics, University of Amsterdam, Amsterdam, The Netherlands
| | - Bas E Dutilh
- Theoretical Biology and Bioinformatics, Science4Life, Utrecht University, Padualaan 8, Utrecht, 3584 CH, The Netherlands.
- Institute of Biodiversity, Faculty of Biological Sciences, Cluster of Excellence Balance of the Microverse, Friedrich Schiller University Jena, 07743, Jena, Germany.
| |
Collapse
|
20
|
Gucwa K, Wons E, Wisniewska A, Jakalski M, Dubiak Z, Kozlowski LP, Mruk I. Lethal perturbation of an Escherichia coli regulatory network is triggered by a restriction-modification system's regulator and can be mitigated by excision of the cryptic prophage Rac. Nucleic Acids Res 2024; 52:2942-2960. [PMID: 38153127 PMCID: PMC11014345 DOI: 10.1093/nar/gkad1234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 12/08/2023] [Accepted: 12/13/2023] [Indexed: 12/29/2023] Open
Abstract
Bacterial gene regulatory networks orchestrate responses to environmental challenges. Horizontal gene transfer can bring in genes with regulatory potential, such as new transcription factors (TFs), and this can disrupt existing networks. Serious regulatory perturbations may even result in cell death. Here, we show the impact on Escherichia coli of importing a promiscuous TF that has adventitious transcriptional effects within the cryptic Rac prophage. A cascade of regulatory network perturbations occurred on a global level. The TF, a C regulatory protein, normally controls a Type II restriction-modification system, but in E. coli K-12 interferes with expression of the RacR repressor gene, resulting in de-repression of the normally-silent Rac ydaT gene. YdaT is a prophage-encoded TF with pleiotropic effects on E. coli physiology. In turn, YdaT alters expression of a variety of bacterial regulons normally controlled by the RcsA TF, resulting in deficient lipopolysaccharide biosynthesis and cell division. At the same time, insufficient RacR repressor results in Rac DNA excision, halting Rac gene expression due to loss of the replication-defective Rac prophage. Overall, Rac induction appears to counteract the lethal toxicity of YdaT. We show here that E. coli rewires its regulatory network, so as to minimize the adverse regulatory effects of the imported C TF. This complex set of interactions may reflect the ability of bacteria to protect themselves by having robust mechanisms to maintain their regulatory networks, and/or suggest that regulatory C proteins from mobile operons are under selection to manipulate their host's regulatory networks for their own benefit.
Collapse
Affiliation(s)
- Katarzyna Gucwa
- Department of Microbiology, Faculty of Biology, University of Gdansk, Wita Stwosza 59, Gdansk 80-308, Poland
| | - Ewa Wons
- Department of Microbiology, Faculty of Biology, University of Gdansk, Wita Stwosza 59, Gdansk 80-308, Poland
| | - Aleksandra Wisniewska
- Department of Microbiology, Faculty of Biology, University of Gdansk, Wita Stwosza 59, Gdansk 80-308, Poland
| | - Marcin Jakalski
- 3P-Medicine Laboratory, Medical University of Gdansk, Debinki 7, 80-211 Gdansk, Poland
| | - Zuzanna Dubiak
- Department of Microbiology, Faculty of Biology, University of Gdansk, Wita Stwosza 59, Gdansk 80-308, Poland
| | - Lukasz Pawel Kozlowski
- Institute of Informatics, Faculty of Mathematics, Informatics and Mechanics, University of Warsaw, Banacha 2, 02-097 Warsaw, Poland
| | - Iwona Mruk
- Department of Microbiology, Faculty of Biology, University of Gdansk, Wita Stwosza 59, Gdansk 80-308, Poland
| |
Collapse
|
21
|
Ding X, Chen X, Sullivan EE, Shay TF, Gradinaru V. Fast, accurate ranking of engineered proteins by target-binding propensity using structure modeling. Mol Ther 2024:S1525-0016(24)00219-3. [PMID: 38582966 DOI: 10.1016/j.ymthe.2024.04.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Revised: 02/08/2024] [Accepted: 04/03/2024] [Indexed: 04/08/2024] Open
Abstract
Deep-learning-based methods for protein structure prediction have achieved unprecedented accuracy, yet their utility in the engineering of protein-based binders remains constrained due to a gap between the ability to predict the structures of candidate proteins and the ability toprioritize proteins by their potential to bind to a target. To bridge this gap, we introduce Automated Pairwise Peptide-Receptor Analysis for Screening Engineered proteins (APPRAISE), a method for predicting the target-binding propensity of engineered proteins. After generating structural models of engineered proteins competing for binding to a target using an established structure prediction tool such as AlphaFold-Multimer or ESMFold, APPRAISE performs a rapid (under 1 CPU second per model) scoring analysis that takes into account biophysical and geometrical constraints. As proof-of-concept cases, we demonstrate that APPRAISE can accurately classify receptor-dependent vs. receptor-independent adeno-associated viral vectors and diverse classes of engineered proteins such as miniproteins targeting the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) spike, nanobodies targeting a G-protein-coupled receptor, and peptides that specifically bind to transferrin receptor or programmed death-ligand 1 (PD-L1). APPRAISE is accessible through a web-based notebook interface using Google Colaboratory (https://tiny.cc/APPRAISE). With its accuracy, interpretability, and generalizability, APPRAISE promises to expand the utility of protein structure prediction and accelerate protein engineering for biomedical applications.
Collapse
Affiliation(s)
- Xiaozhe Ding
- Division of Biology and Biological Engineering, California Institute of Technology, 1200 E California, Boulevard, Pasadena, CA 91125, USA.
| | - Xinhong Chen
- Division of Biology and Biological Engineering, California Institute of Technology, 1200 E California, Boulevard, Pasadena, CA 91125, USA
| | - Erin E Sullivan
- Division of Biology and Biological Engineering, California Institute of Technology, 1200 E California, Boulevard, Pasadena, CA 91125, USA
| | - Timothy F Shay
- Division of Biology and Biological Engineering, California Institute of Technology, 1200 E California, Boulevard, Pasadena, CA 91125, USA
| | - Viviana Gradinaru
- Division of Biology and Biological Engineering, California Institute of Technology, 1200 E California, Boulevard, Pasadena, CA 91125, USA.
| |
Collapse
|
22
|
Si Y, Yan C. Protein language model-embedded geometric graphs power inter-protein contact prediction. eLife 2024; 12:RP92184. [PMID: 38564241 PMCID: PMC10987090 DOI: 10.7554/elife.92184] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/04/2024] Open
Abstract
Accurate prediction of contacting residue pairs between interacting proteins is very useful for structural characterization of protein-protein interactions. Although significant improvement has been made in inter-protein contact prediction recently, there is still a large room for improving the prediction accuracy. Here we present a new deep learning method referred to as PLMGraph-Inter for inter-protein contact prediction. Specifically, we employ rotationally and translationally invariant geometric graphs obtained from structures of interacting proteins to integrate multiple protein language models, which are successively transformed by graph encoders formed by geometric vector perceptrons and residual networks formed by dimensional hybrid residual blocks to predict inter-protein contacts. Extensive evaluation on multiple test sets illustrates that PLMGraph-Inter outperforms five top inter-protein contact prediction methods, including DeepHomo, GLINTER, CDPred, DeepHomo2, and DRN-1D2D_Inter, by large margins. In addition, we also show that the prediction of PLMGraph-Inter can complement the result of AlphaFold-Multimer. Finally, we show leveraging the contacts predicted by PLMGraph-Inter as constraints for protein-protein docking can dramatically improve its performance for protein complex structure prediction.
Collapse
Affiliation(s)
- Yunda Si
- School of Physics, Huazhong University of Science and TechnologyWuhanChina
| | - Chengfei Yan
- School of Physics, Huazhong University of Science and TechnologyWuhanChina
| |
Collapse
|
23
|
Wu JS, Liu Y, Ge F, Yu DJ. Prediction of protein-ATP binding residues using multi-view feature learning via contextual-based co-attention network. Comput Biol Med 2024; 172:108227. [PMID: 38460308 DOI: 10.1016/j.compbiomed.2024.108227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Revised: 01/17/2024] [Accepted: 02/25/2024] [Indexed: 03/11/2024]
Abstract
Accurately predicting protein-ATP binding residues is critical for protein function annotation and drug discovery. Computational methods dedicated to the prediction of binding residues based on protein sequence information have exhibited notable advancements in predictive accuracy. Nevertheless, these methods continue to grapple with several formidable challenges, including limited means of extracting more discriminative features and inadequate algorithms for integrating protein and residue information. To address the problems, we propose ATP-Deep, a novel protein-ATP binding residues predictor. ATP-Deep harnesses the capabilities of unsupervised pre-trained language models and incorporates domain-specific evolutionary context information from homologous sequences. It further refines the embedding at the residue level through integration with corresponding protein-level information and employs a contextual-based co-attention mechanism to adeptly fuse multiple sources of features. The performance evaluation results on the benchmark datasets reveal that ATP-Deep achieves an AUC of 0.954 and 0.951, respectively, surpassing the performance of the state-of-the-art model. These findings underscore the effectiveness of assimilating protein-level information and deploying a contextual-based co-attention mechanism grounded in context to bolster the prediction performance of protein-ATP binding residues.
Collapse
Affiliation(s)
- Jia-Shun Wu
- School of Computer Science and Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing, 210094, China
| | - Yan Liu
- School of Information Engineering, Yangzhou University, 196 West Huayang, Yangzhou, 225100, China
| | - Fang Ge
- State Key Laboratory of Organic Electronics and Information Displays & Institute of Advanced Materials (IAM), Nanjing University of Posts & Telecommunications, 9 Wenyuan Road, Nanjing 210023, China
| | - Dong-Jun Yu
- School of Computer Science and Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing, 210094, China.
| |
Collapse
|
24
|
Shallangwa GA, Mahmud AW, Uzairu A, Ibrahim MT. 2,4-disubstituted 6-fluoroquinolines as potent antiplasmodial agents: QSAR, homology modeling, molecular docking and ADMET studies. J Taibah Univ Med Sci 2024; 19:233-247. [PMID: 38179257 PMCID: PMC10762476 DOI: 10.1016/j.jtumed.2023.11.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 09/29/2023] [Accepted: 11/09/2023] [Indexed: 01/06/2024] Open
Abstract
Objective This work was designed to study 2,4-disubstituted 6-fluoroquinolines as antiplasmodial agents by using in silico techniques, to aid in the design of novel analogs with high potency against malaria and high inhibition of Plasmodium falciparum translation elongation factor 2 (PfeEF2), a novel drug target. Methods Quantitative structure-activity relationships (QSAR) of 2,4-disubstituted 6-fluoroquinolines were studied with the genetic function approximation technique in Material Studio software. The 3D structure of PfeEF2 was modeled in the SWISS-MODEL workspace through homology modeling. A molecular docking study of the modeled PfeEF2 and 2,4-disubstituted 6-fluoroquinolines was conducted with Autodock Vina in Pyrx software. Furthermore, the in silico pharmacokinetic properties of selected compounds were investigated. Results A robust, reliable and predictive QSAR model was developed that related the chemical structures of 2,4-disubstituted 6-fluoroquinolines to their antiplasmodium activities. The model had an internal squared correlation coefficient R2 of 0.921, adjusted squared correlation coefficient R2adj of 0.878, leave-one-out cross-validation coefficient Q2cv of 0.801 and predictive squared correlation coefficient R2pred of 0.901. The antiplasmodium activity of 6-fluoroquinolines was found to depend on the n5Ring, GGI9, TDB7u, TDB8u and RDF75i physicochemical properties: n5Ring, TDB8u and RDF75i were positively associated, whereas GGI9 and TDB7u were negatively associated, with the antiplasmodium activity of the compounds. Stable complexes formed between the compounds and modeled PfeEF2, with binding affinity ranging from -8.200 to -10.700 kcal/mol. Compounds 5, 11, 16, 22 and 24 had better binding affinities than quinoline-4-carboxamide (DDD107498), as well as good pharmacokinetic properties, and therefore may be better inhibitors of this novel target. Conclusion QSAR and docking studies provided insight into designing novel 2,4-disubstituted 6-fluoroquinolines with high antiplasmodial activity and good structural properties for inhibiting a novel antimalarial drug target.
Collapse
Affiliation(s)
| | - Aliyu W. Mahmud
- Department of Applied Chemistry, Kaduna Polytechnic, P.M.B 2021, Kaduna, Nigeria
| | - Adamu Uzairu
- Chemistry Department, Ahmadu Bello University, Zaria, Nigeria
| | | |
Collapse
|
25
|
Liu W, Wang Z, You R, Xie C, Wei H, Xiong Y, Yang J, Zhu S. PLMSearch: Protein language model powers accurate and fast sequence search for remote homology. Nat Commun 2024; 15:2775. [PMID: 38555371 PMCID: PMC10981738 DOI: 10.1038/s41467-024-46808-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2023] [Accepted: 03/08/2024] [Indexed: 04/02/2024] Open
Abstract
Homologous protein search is one of the most commonly used methods for protein annotation and analysis. Compared to structure search, detecting distant evolutionary relationships from sequences alone remains challenging. Here we propose PLMSearch (Protein Language Model), a homologous protein search method with only sequences as input. PLMSearch uses deep representations from a pre-trained protein language model and trains the similarity prediction model with a large number of real structure similarity. This enables PLMSearch to capture the remote homology information concealed behind the sequences. Extensive experimental results show that PLMSearch can search millions of query-target protein pairs in seconds like MMseqs2 while increasing the sensitivity by more than threefold, and is comparable to state-of-the-art structure search methods. In particular, unlike traditional sequence search methods, PLMSearch can recall most remote homology pairs with dissimilar sequences but similar structures. PLMSearch is freely available at https://dmiip.sjtu.edu.cn/PLMSearch .
Collapse
Affiliation(s)
- Wei Liu
- Institute of Science and Technology for Brain-Inspired Intelligence and MOE Frontiers Center for Brain Science, Fudan University, 200433, Shanghai, China
| | - Ziye Wang
- Institute of Science and Technology for Brain-Inspired Intelligence and MOE Frontiers Center for Brain Science, Fudan University, 200433, Shanghai, China
| | - Ronghui You
- Institute of Science and Technology for Brain-Inspired Intelligence and MOE Frontiers Center for Brain Science, Fudan University, 200433, Shanghai, China
| | - Chenghan Xie
- School of Mathematical Sciences, Fudan University, 200433, Shanghai, China
| | - Hong Wei
- School of Mathematical Sciences, Nankai University, 300071, Tianjin, China
| | - Yi Xiong
- Department of Bioinformatics and Biostatistics, Shanghai Jiao Tong University, 200240, Shanghai, China
| | - Jianyi Yang
- Ministry of Education Frontiers Science Center for Nonlinear Expectations, Research Center for Mathematics and Interdisciplinary Science, Shandong University, 266237, Qingdao, China.
| | - Shanfeng Zhu
- Institute of Science and Technology for Brain-Inspired Intelligence and MOE Frontiers Center for Brain Science, Fudan University, 200433, Shanghai, China.
- Shanghai Qi Zhi Institute, Shanghai, China.
- Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence (Fudan University), Ministry of Education, Shanghai, China.
- Shanghai Key Lab of Intelligent Information Processing and Shanghai Institute of Artificial Intelligence Algorithm, Fudan University, Shanghai, China.
- Zhangjiang Fudan International Innovation Center, Shanghai, China.
| |
Collapse
|
26
|
Bibik P, Alibai S, Pandini A, Dantu SC. PyCoM: a python library for large-scale analysis of residue-residue coevolution data. Bioinformatics 2024; 40:btae166. [PMID: 38532297 PMCID: PMC11009027 DOI: 10.1093/bioinformatics/btae166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 02/02/2024] [Accepted: 03/25/2024] [Indexed: 03/28/2024] Open
Abstract
MOTIVATION Computational methods to detect correlated amino acid positions in proteins have become a valuable tool to predict intra- and inter-residue protein contacts, protein structures, and effects of mutation on protein stability and function. While there are many tools and webservers to compute coevolution scoring matrices, there is no central repository of alignments and coevolution matrices for large-scale studies and pattern detection leveraging on biological and structural annotations already available in UniProt. RESULTS We present a Python library, PyCoM, which enables users to query and analyze coevolution matrices and sequence alignments of 457 622 proteins, selected from UniProtKB/Swiss-Prot database (length ≤ 500 residues), from a precompiled coevolution matrix database (PyCoMdb). PyCoM facilitates the development of statistical analyses of residue coevolution patterns using filters on biological and structural annotations from UniProtKB/Swiss-Prot, with simple access to PyCoMdb for both novice and advanced users, supporting Jupyter Notebooks, Python scripts, and a web API access. The resource is open source and will help in generating data-driven computational models and methods to study and understand protein structures, stability, function, and design. AVAILABILITY AND IMPLEMENTATION PyCoM code is freely available from https://github.com/scdantu/pycom and PyCoMdb and the Jupyter Notebook tutorials are freely available from https://pycom.brunel.ac.uk.
Collapse
Affiliation(s)
- Philipp Bibik
- Department of Computer Science, Brunel University London, Uxbridge UB8 3PH, United Kingdom
| | - Sabriyeh Alibai
- Department of Computer Science, Brunel University London, Uxbridge UB8 3PH, United Kingdom
| | - Alessandro Pandini
- Department of Computer Science, Brunel University London, Uxbridge UB8 3PH, United Kingdom
| | - Sarath Chandra Dantu
- Department of Computer Science, Brunel University London, Uxbridge UB8 3PH, United Kingdom
| |
Collapse
|
27
|
Luthringer R, Raphalen M, Guerra C, Colin S, Martinho C, Zheng M, Hoshino M, Badis Y, Lipinska AP, Haas FB, Barrera-Redondo J, Alva V, Coelho SM. Repeated co-option of HMG-box genes for sex determination in brown algae and animals. Science 2024; 383:eadk5466. [PMID: 38513029 DOI: 10.1126/science.adk5466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Accepted: 01/31/2024] [Indexed: 03/23/2024]
Abstract
In many eukaryotes, genetic sex determination is not governed by XX/XY or ZW/ZZ systems but by a specialized region on the poorly studied U (female) or V (male) sex chromosomes. Previous studies have hinted at the existence of a dominant male-sex factor on the V chromosome in brown algae, a group of multicellular eukaryotes distantly related to animals and plants. The nature of this factor has remained elusive. Here, we demonstrate that an HMG-box gene acts as the male-determining factor in brown algae, mirroring the role HMG-box genes play in sex determination in animals. Over a billion-year evolutionary timeline, these lineages have independently co-opted the HMG box for male determination, representing a paradigm for evolution's ability to recurrently use the same genetic "toolkit" to accomplish similar tasks.
Collapse
Affiliation(s)
- Rémy Luthringer
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Morgane Raphalen
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Carla Guerra
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Sébastien Colin
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Claudia Martinho
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Min Zheng
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Masakazu Hoshino
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
- Research Center for Inland Seas, Kobe University, Kobe 658-0022, Japan
| | - Yacine Badis
- Roscoff Biological Station, CNRS-Sorbonne University, Place Georges Teissier, 29680 Roscoff, France
| | - Agnieszka P Lipinska
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Fabian B Haas
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Josué Barrera-Redondo
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Vikram Alva
- Department of Protein Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Susana M Coelho
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| |
Collapse
|
28
|
Billman ZP, Kovacs SB, Wei B, Kang K, Cissé OH, Miao EA. Caspase-1 activates gasdermin A in non-mammals. eLife 2024; 12:RP92362. [PMID: 38497531 PMCID: PMC10948149 DOI: 10.7554/elife.92362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/19/2024] Open
Abstract
Gasdermins oligomerize to form pores in the cell membrane, causing regulated lytic cell death called pyroptosis. Mammals encode five gasdermins that can trigger pyroptosis: GSDMA, B, C, D, and E. Caspase and granzyme proteases cleave the linker regions of and activate GSDMB, C, D, and E, but no endogenous activation pathways are yet known for GSDMA. Here, we perform a comprehensive evolutionary analysis of the gasdermin family. A gene duplication of GSDMA in the common ancestor of caecilian amphibians, reptiles, and birds gave rise to GSDMA-D in mammals. Uniquely in our tree, amphibian, reptile, and bird GSDMA group in a separate clade than mammal GSDMA. Remarkably, GSDMA in numerous bird species contain caspase-1 cleavage sites like YVAD or FASD in the linker. We show that GSDMA from birds, amphibians, and reptiles are all cleaved by caspase-1. Thus, GSDMA was originally cleaved by the host-encoded protease caspase-1. In mammals the caspase-1 cleavage site in GSDMA is disrupted; instead, a new protein, GSDMD, is the target of caspase-1. Mammal caspase-1 uses exosite interactions with the GSDMD C-terminal domain to confer the specificity of this interaction, whereas we show that bird caspase-1 uses a stereotypical tetrapeptide sequence to confer specificity for bird GSDMA. Our results reveal an evolutionarily stable association between caspase-1 and the gasdermin family, albeit a shifting one. Caspase-1 repeatedly changes its target gasdermin over evolutionary time at speciation junctures, initially cleaving GSDME in fish, then GSDMA in amphibians/reptiles/birds, and finally GSDMD in mammals.
Collapse
Affiliation(s)
- Zachary Paul Billman
- Department of Integrative Immunobiology; Molecular Genetics and Microbiology; Pathology; and Cell Biology, Duke University School of MedicineDurhamUnited States
- Department of Microbiology and Immunology, University of North Carolina at Chapel HillChapel HillUnited States
| | - Stephen Bela Kovacs
- Department of Integrative Immunobiology; Molecular Genetics and Microbiology; Pathology; and Cell Biology, Duke University School of MedicineDurhamUnited States
- Department of Microbiology and Immunology, University of North Carolina at Chapel HillChapel HillUnited States
| | - Bo Wei
- Department of Integrative Immunobiology; Molecular Genetics and Microbiology; Pathology; and Cell Biology, Duke University School of MedicineDurhamUnited States
| | - Kidong Kang
- Department of Integrative Immunobiology; Molecular Genetics and Microbiology; Pathology; and Cell Biology, Duke University School of MedicineDurhamUnited States
| | - Ousmane H Cissé
- Critical Care Medicine Department, National Institutes of Health Clinical CenterBethesdaUnited States
| | - Edward A Miao
- Department of Integrative Immunobiology; Molecular Genetics and Microbiology; Pathology; and Cell Biology, Duke University School of MedicineDurhamUnited States
| |
Collapse
|
29
|
Johnson SR, Peshwa M, Sun Z. Sensitive remote homology search by local alignment of small positional embeddings from protein language models. eLife 2024; 12:RP91415. [PMID: 38488154 PMCID: PMC10942778 DOI: 10.7554/elife.91415] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/17/2024] Open
Abstract
Accurately detecting distant evolutionary relationships between proteins remains an ongoing challenge in bioinformatics. Search methods based on primary sequence struggle to accurately detect homology between sequences with less than 20% amino acid identity. Profile- and structure-based strategies extend sensitive search capabilities into this twilight zone of sequence similarity but require slow pre-processing steps. Recently, whole-protein and positional embeddings from deep neural networks have shown promise for providing sensitive sequence comparison and annotation at long evolutionary distances. Embeddings are generally faster to compute than profiles and predicted structures but still suffer several drawbacks related to the ability of whole-protein embeddings to discriminate domain-level homology, and the database size and search speed of methods using positional embeddings. In this work, we show that low-dimensionality positional embeddings can be used directly in speed-optimized local search algorithms. As a proof of concept, we use the ESM2 3B model to convert primary sequences directly into the 3D interaction (3Di) alphabet or amino acid profiles and use these embeddings as input to the highly optimized Foldseek, HMMER3, and HH-suite search algorithms. Our results suggest that positional embeddings as small as a single byte can provide sufficient information for dramatically improved sensitivity over amino acid sequence searches without sacrificing search speed.
Collapse
Affiliation(s)
| | | | - Zhiyi Sun
- New England Biolabs IncIpswichUnited States
| |
Collapse
|
30
|
Bollinger KW, Müh U, Ocius KL, Apostolos AJ, Pires MM, Helm RF, Popham DL, Weiss DS, Ellermeier CD. Identification of a new family of peptidoglycan transpeptidases reveals atypical crosslinking is essential for viability in Clostridioides difficile. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.14.584917. [PMID: 38559057 PMCID: PMC10980060 DOI: 10.1101/2024.03.14.584917] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
Clostridioides difficile, the leading cause of antibiotic-associated diarrhea, relies primarily on 3-3 crosslinks created by L,D-transpeptidases (LDTs) to fortify its peptidoglycan (PG) cell wall. This is unusual, as in most bacteria the vast majority of PG crosslinks are 4-3 crosslinks, which are created by penicillin-binding proteins (PBPs). Here we report the unprecedented observation that 3-3 crosslinking is essential for viability in C. difficile. We also report the discovery of a new family of LDTs that use a VanW domain to catalyze 3-3 crosslinking rather than a YkuD domain as in all previously known LDTs. Bioinformatic analyses indicate VanW domain LDTs are less common than YkuD domain LDTs and are largely restricted to Gram-positive bacteria. Our findings suggest that LDTs might be exploited as targets for antibiotics that kill C. difficile without disrupting the intestinal microbiota that is important for keeping C. difficile in check.
Collapse
Affiliation(s)
- Kevin W. Bollinger
- Department of Microbiology and Immunology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
| | - Ute Müh
- Department of Microbiology and Immunology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
| | - Karl L. Ocius
- Department of Chemistry, University of Virginia, Charlottesville, VA, USA
| | - Alexis J. Apostolos
- Department of Chemistry, University of Virginia, Charlottesville, VA, USA
- Present address: Haleon, 1211 Sherwood Ave, Richmond, VA 23220
| | - Marcos M. Pires
- Department of Chemistry, University of Virginia, Charlottesville, VA, USA
| | - Richard F. Helm
- Department of Biochemistry, Virginia Tech, Blacksburg, VA, USA
| | - David L. Popham
- Department of Biological Sciences, Virginia Tech, Blacksburg, VA, USA
| | - David S. Weiss
- Department of Microbiology and Immunology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
- Graduate Program in Genetics, University of Iowa, Iowa City, IA USA
| | - Craig D. Ellermeier
- Department of Microbiology and Immunology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
- Graduate Program in Genetics, University of Iowa, Iowa City, IA USA
| |
Collapse
|
31
|
Makarova KS, Tobiasson V, Wolf YI, Lu Z, Liu Y, Zhang S, Krupovic M, Li M, Koonin EV. Diversity, origin, and evolution of the ESCRT systems. mBio 2024; 15:e0033524. [PMID: 38380930 PMCID: PMC10936438 DOI: 10.1128/mbio.00335-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Accepted: 02/06/2024] [Indexed: 02/22/2024] Open
Abstract
Endosomal sorting complexes required for transport (ESCRT) play key roles in protein sorting between membrane-bounded compartments of eukaryotic cells. Homologs of many ESCRT components are identifiable in various groups of archaea, especially in Asgardarchaeota, the archaeal phylum that is currently considered to include the closest relatives of eukaryotes, but not in bacteria. We performed a comprehensive search for ESCRT protein homologs in archaea and reconstructed ESCRT evolution using the phylogenetic tree of Vps4 ATPase (ESCRT IV) as a scaffold and using sensitive protein sequence analysis and comparison of structural models to identify previously unknown ESCRT proteins. Several distinct groups of ESCRT systems in archaea outside of Asgard were identified, including proteins structurally similar to ESCRT-I and ESCRT-II, and several other domains involved in protein sorting in eukaryotes, suggesting an early origin of these components. Additionally, distant homologs of CdvA proteins were identified in Thermoproteales which are likely components of the uncharacterized cell division system in these archaea. We propose an evolutionary scenario for the origin of eukaryotic and Asgard ESCRT complexes from ancestral building blocks, namely, the Vps4 ATPase, ESCRT-III components, wH (winged helix-turn-helix fold) and possibly also coiled-coil, and Vps28-like domains. The last archaeal common ancestor likely encompassed a complex ESCRT system that was involved in protein sorting. Subsequent evolution involved either simplification, as in the TACK superphylum, where ESCRT was co-opted for cell division, or complexification as in Asgardarchaeota. In Asgardarchaeota, the connection between ESCRT and the ubiquitin system that was previously considered a eukaryotic signature was already established.IMPORTANCEAll eukaryotic cells possess complex intracellular membrane organization. Endosomal sorting complexes required for transport (ESCRT) play a central role in membrane remodeling which is essential for cellular functionality in eukaryotes. Recently, it has been shown that Asgard archaea, the archaeal phylum that includes the closest known relatives of eukaryotes, encode homologs of many components of the ESCRT systems. We employed protein sequence and structure comparisons to reconstruct the evolution of ESCRT systems in archaea and identified several previously unknown homologs of ESCRT subunits, some of which can be predicted to participate in cell division. The results of this reconstruction indicate that the last archaeal common ancestor already encoded a complex ESCRT system that was involved in protein sorting. In Asgard archaea, ESCRT systems evolved toward greater complexity, and in particular, the connection between ESCRT and the ubiquitin system that was previously considered a eukaryotic signature was established.
Collapse
Affiliation(s)
- Kira S. Makarova
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, Maryland, USA
| | - Victor Tobiasson
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, Maryland, USA
| | - Yuri I. Wolf
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, Maryland, USA
| | - Zhongyi Lu
- Archaeal Biology Center, Institute for Advanced Study, Shenzhen University, Shenzhen, China
- Shenzhen Key Laboratory of Marine Microbiome Engineering, Institute for Advanced Study, Shenzhen University, Shenzhen, China
| | - Yang Liu
- Archaeal Biology Center, Institute for Advanced Study, Shenzhen University, Shenzhen, China
- Shenzhen Key Laboratory of Marine Microbiome Engineering, Institute for Advanced Study, Shenzhen University, Shenzhen, China
| | - Siyu Zhang
- Archaeal Biology Center, Institute for Advanced Study, Shenzhen University, Shenzhen, China
- Shenzhen Key Laboratory of Marine Microbiome Engineering, Institute for Advanced Study, Shenzhen University, Shenzhen, China
| | - Mart Krupovic
- Archaeal Virology Unit, Institut Pasteur, Université de Paris, Paris, France
| | - Meng Li
- Archaeal Biology Center, Institute for Advanced Study, Shenzhen University, Shenzhen, China
- Shenzhen Key Laboratory of Marine Microbiome Engineering, Institute for Advanced Study, Shenzhen University, Shenzhen, China
| | - Eugene V. Koonin
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, Maryland, USA
| |
Collapse
|
32
|
Meitil IKS, Gippert GP, Barrett K, Hunt CJ, Henrissat B. Diversity of sugar-diphospholipid-utilizing glycosyltransferase families. Commun Biol 2024; 7:285. [PMID: 38454040 PMCID: PMC10920833 DOI: 10.1038/s42003-024-05930-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Accepted: 02/16/2024] [Indexed: 03/09/2024] Open
Abstract
Peptidoglycan polymerases, enterobacterial common antigen polymerases, O-antigen ligases, and other bacterial polysaccharide polymerases (BP-Pols) are glycosyltransferases (GTs) that build bacterial surface polysaccharides. These integral membrane enzymes share the particularity of using diphospholipid-activated sugars and were previously missing in the carbohydrate-active enzymes database (CAZy; www.cazy.org ). While the first three classes formed well-defined families of similar proteins, the sequences of BP-Pols were so diverse that a single family could not be built. To address this, we developed a new clustering method using a combination of a sequence similarity network and hidden Markov model comparisons. Overall, we have defined 17 new GT families including 14 of BP-Pols. We find that the reaction stereochemistry appears to be conserved in each of the defined BP-Pol families, and that the BP-Pols within the families transfer similar sugars even across Gram-negative and Gram-positive bacteria. Comparison of the new GT families reveals three clans of distantly related families, which also conserve the reaction stereochemistry.
Collapse
Affiliation(s)
- Ida K S Meitil
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Garry P Gippert
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Kristian Barrett
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Cameron J Hunt
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Bernard Henrissat
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Kgs. Lyngby, Denmark.
- Architecture et Fonction des Macromolécules Biologiques, CNRS, Aix-Marseille Université, Marseille, France.
- Department of Biological Sciences, King Abdulaziz University, Jeddah, Saudi Arabia.
| |
Collapse
|
33
|
Figueroa III JL, Dhungel E, Bellanger M, Brouwer CR, White III RA. MetaCerberus: distributed highly parallelized HMM-based processing for robust functional annotation across the tree of life. Bioinformatics 2024; 40:btae119. [PMID: 38426351 PMCID: PMC10955254 DOI: 10.1093/bioinformatics/btae119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Revised: 01/22/2024] [Accepted: 02/27/2024] [Indexed: 03/02/2024] Open
Abstract
MOTIVATION MetaCerberus is a massively parallel, fast, low memory, scalable annotation tool for inference gene function across genomes to metacommunities. MetaCerberus provides an elusive HMM/HMMER-based tool at a rapid scale with low memory. It offers scalable gene elucidation to major public databases, including KEGG (KO), COGs, CAZy, FOAM, and specific databases for viruses, including VOGs and PHROGs, from single genomes to metacommunities. RESULTS MetaCerberus is 1.3× as fast on a single node than eggNOG-mapper v2 on 5× less memory using an exclusively HMM/HMMER mode. In a direct comparison, MetaCerberus provides better annotation of viruses, phages, and archaeal viruses than DRAM, Prokka, or InterProScan. MetaCerberus annotates more KOs across domains when compared to DRAM, with a 186× smaller database, and with 63× less memory. MetaCerberus is fully integrated for automatic analysis of statistics and pathways using differential statistic tools (i.e. DESeq2 and edgeR), pathway enrichment (GAGE R), and pathview R. MetaCerberus provides a novel tool for unlocking the biosphere across the tree of life at scale. AVAILABILITY AND IMPLEMENTATION MetaCerberus is written in Python and distributed under a BSD-3 license. The source code of MetaCerberus is freely available at https://github.com/raw-lab/metacerberus compatible with Python 3 and works on both Mac OS X and Linux. MetaCerberus can also be easily installed using bioconda: mamba create -n metacerberus -c bioconda -c conda-forge metacerberus.
Collapse
Affiliation(s)
- Jose L Figueroa III
- North Carolina Research Campus (NCRC), Department of Bioinformatics and Genomics, The University of North Carolina at Charlotte, Kannapolis, NC 28081, United States
- Computational Intelligence to Predict Health and Environmental Risks (CIPHER) Research Center, Department of Bioinformatics and Genomics, The University of North Carolina at Charlotte, Charlotte, NC 28223, United States
| | - Eliza Dhungel
- North Carolina Research Campus (NCRC), Department of Bioinformatics and Genomics, The University of North Carolina at Charlotte, Kannapolis, NC 28081, United States
| | - Madeline Bellanger
- North Carolina Research Campus (NCRC), Department of Bioinformatics and Genomics, The University of North Carolina at Charlotte, Kannapolis, NC 28081, United States
- Computational Intelligence to Predict Health and Environmental Risks (CIPHER) Research Center, Department of Bioinformatics and Genomics, The University of North Carolina at Charlotte, Charlotte, NC 28223, United States
| | - Cory R Brouwer
- North Carolina Research Campus (NCRC), Department of Bioinformatics and Genomics, The University of North Carolina at Charlotte, Kannapolis, NC 28081, United States
| | - Richard Allen White III
- North Carolina Research Campus (NCRC), Department of Bioinformatics and Genomics, The University of North Carolina at Charlotte, Kannapolis, NC 28081, United States
- Computational Intelligence to Predict Health and Environmental Risks (CIPHER) Research Center, Department of Bioinformatics and Genomics, The University of North Carolina at Charlotte, Charlotte, NC 28223, United States
| |
Collapse
|
34
|
Qin Z, Wang R, Hou P, Zhang Y, Yuan Q, Wang Y, Yang Y, Xu T. TCR signaling induces STAT3 phosphorylation to promote TH17 cell differentiation. J Exp Med 2024; 221:e20230683. [PMID: 38324068 PMCID: PMC10849914 DOI: 10.1084/jem.20230683] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Revised: 08/11/2023] [Accepted: 01/18/2024] [Indexed: 02/08/2024] Open
Abstract
TH17 differentiation is critically controlled by "signal 3" of cytokines (IL-6/IL-23) through STAT3. However, cytokines alone induced only a moderate level of STAT3 phosphorylation. Surprisingly, TCR stimulation alone induced STAT3 phosphorylation through Lck/Fyn, and synergistically with IL-6/IL-23 induced robust and optimal STAT3 phosphorylation at Y705. Inhibition of Lck/Fyn kinase activity by Srci1 or disrupting the interaction between Lck/Fyn and STAT3 by disease-causing STAT3 mutations selectively impaired TCR stimulation, but not cytokine-induced STAT3 phosphorylation, which consequently abolished TH17 differentiation and converted them to FOXP3+ Treg cells. Srci1 administration or disrupting the interaction between Lck/Fyn and STAT3 significantly ameliorated TH17 cell-mediated EAE disease. These findings uncover an unexpected deterministic role of TCR signaling in fate determination between TH17 and Treg cells through Lck/Fyn-dependent phosphorylation of STAT3, which can be exploited to develop therapeutics selectively against TH17-related autoimmune diseases. Our study thus provides insight into how TCR signaling could integrate with cytokine signal to direct T cell differentiation.
Collapse
Affiliation(s)
- Zhen Qin
- Department of Immunology, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
| | - Ruining Wang
- Department of Immunology, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
| | - Ping Hou
- Department of Immunology, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
| | - Yuanyuan Zhang
- Department of Immunology, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
| | - Qianmu Yuan
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China
| | - Ying Wang
- Department of Immunology, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
| | - Yuedong Yang
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China
| | - Tao Xu
- Department of Immunology, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
| |
Collapse
|
35
|
Waksman T, Astin E, Fisher SR, Hunter WN, Bos JIB. Computational Prediction of Structure, Function, and Interaction of Myzus persicae (Green Peach Aphid) Salivary Effector Proteins. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2024; 37:338-346. [PMID: 38171380 DOI: 10.1094/mpmi-10-23-0154-fi] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
Similar to plant pathogens, phloem-feeding insects such as aphids deliver effector proteins inside their hosts that act to promote host susceptibility and enable feeding and infestation. Despite exciting progress toward identifying and characterizing effector proteins from these insects, their functions remain largely unknown. The recent groundbreaking development in protein structure prediction algorithms, combined with the availability of proteomics and transcriptomic datasets for agriculturally important pests, provides new opportunities to explore the structural and functional diversity of effector repertoires. In this study, we sought to gain insight into the infection strategy used by the Myzus persicae (green peach aphid) by predicting and analyzing the structures of a set of 71 effector candidate proteins. We used two protein structure prediction methods, AlphaFold and OmegaFold, that produced mutually consistent results. We observed a wide continuous spectrum of structures among the effector candidates, from disordered proteins to globular enzymes. We made use of the structural information and state-of-the-art computational methods to predict M. persicae effector protein properties, including function and interaction with host plant proteins. Overall, our investigation provides novel insights into prediction of structure, function, and interaction of M. persicae effector proteins and will guide the necessary experimental characterization to address new hypotheses. [Formula: see text] Copyright © 2024 The Author(s). This is an open access article distributed under the CC BY-NC-ND 4.0 International license.
Collapse
Affiliation(s)
- Thomas Waksman
- Division of Plant Sciences, School of Life Sciences, University of Dundee, Dundee, DD1 5EH, U.K
| | - Edmund Astin
- Division of Plant Sciences, School of Life Sciences, University of Dundee, Dundee, DD1 5EH, U.K
| | - S Ronan Fisher
- Division of Plant Sciences, School of Life Sciences, University of Dundee, Dundee, DD1 5EH, U.K
| | - William N Hunter
- Biological Chemistry and Drug Discovery, School of Life Sciences, University of Dundee, Dundee, DD1 5EH, U.K
| | - Jorunn I B Bos
- Division of Plant Sciences, School of Life Sciences, University of Dundee, Dundee, DD1 5EH, U.K
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, U.K
| |
Collapse
|
36
|
Ouasti F, Audin M, Fréon K, Quivy JP, Tachekort M, Cesard E, Thureau A, Ropars V, Fernández Varela P, Moal G, Soumana-Amadou I, Uryga A, Legrand P, Andreani J, Guerois R, Almouzni G, Lambert S, Ochsenbein F. Disordered regions and folded modules in CAF-1 promote histone deposition in Schizosaccharomyces pombe. eLife 2024; 12:RP91461. [PMID: 38376141 PMCID: PMC10942606 DOI: 10.7554/elife.91461] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/21/2024] Open
Abstract
Genome and epigenome integrity in eukaryotes depends on the proper coupling of histone deposition with DNA synthesis. This process relies on the evolutionary conserved histone chaperone CAF-1 for which the links between structure and functions are still a puzzle. While studies of the Saccharomyces cerevisiae CAF-1 complex enabled to propose a model for the histone deposition mechanism, we still lack a framework to demonstrate its generality and in particular, how its interaction with the polymerase accessory factor PCNA is operating. Here, we reconstituted a complete SpCAF-1 from fission yeast. We characterized its dynamic structure using NMR, SAXS and molecular modeling together with in vitro and in vivo functional studies on rationally designed interaction mutants. Importantly, we identify the unfolded nature of the acidic domain which folds up when binding to histones. We also show how the long KER helix mediates DNA binding and stimulates SpCAF-1 association with PCNA. Our study highlights how the organization of CAF-1 comprising both disordered regions and folded modules enables the dynamics of multiple interactions to promote synthesis-coupled histone deposition essential for its DNA replication, heterochromatin maintenance, and genome stability functions.
Collapse
Affiliation(s)
- Fouad Ouasti
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), Institute JoliotGif-sur-YvetteFrance
| | - Maxime Audin
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), Institute JoliotGif-sur-YvetteFrance
| | - Karine Fréon
- Institut Curie, PSL Research University, CNRS UMR 3348, INSERM U1278, Université Paris-Saclay, Equipe labellisée Ligue contre le CancerOrsayFrance
| | - Jean-Pierre Quivy
- Institut Curie, PSL Research University, CNRS, Sorbonne Université,CNRS UMR3664, Nuclear Dynamics Unit, Équipe Labellisée Ligue contre le CancerParisFrance
| | - Mehdi Tachekort
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), Institute JoliotGif-sur-YvetteFrance
| | - Elizabeth Cesard
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), Institute JoliotGif-sur-YvetteFrance
| | - Aurélien Thureau
- Synchrotron SOLEIL, HelioBio group, l'Orme des MerisiersSaint-AubinFrance
| | - Virginie Ropars
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), Institute JoliotGif-sur-YvetteFrance
| | - Paloma Fernández Varela
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), Institute JoliotGif-sur-YvetteFrance
| | - Gwenaelle Moal
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), Institute JoliotGif-sur-YvetteFrance
| | - Ibrahim Soumana-Amadou
- Institut Curie, PSL Research University, CNRS UMR 3348, INSERM U1278, Université Paris-Saclay, Equipe labellisée Ligue contre le CancerOrsayFrance
| | - Aleksandra Uryga
- Institut Curie, PSL Research University, CNRS UMR 3348, INSERM U1278, Université Paris-Saclay, Equipe labellisée Ligue contre le CancerOrsayFrance
| | - Pierre Legrand
- Synchrotron SOLEIL, HelioBio group, l'Orme des MerisiersSaint-AubinFrance
| | - Jessica Andreani
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), Institute JoliotGif-sur-YvetteFrance
| | - Raphaël Guerois
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), Institute JoliotGif-sur-YvetteFrance
| | - Geneviève Almouzni
- Institut Curie, PSL Research University, CNRS, Sorbonne Université,CNRS UMR3664, Nuclear Dynamics Unit, Équipe Labellisée Ligue contre le CancerParisFrance
| | - Sarah Lambert
- Institut Curie, PSL Research University, CNRS UMR 3348, INSERM U1278, Université Paris-Saclay, Equipe labellisée Ligue contre le CancerOrsayFrance
| | - Francoise Ochsenbein
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), Institute JoliotGif-sur-YvetteFrance
| |
Collapse
|
37
|
Qi H, Yu T, Yu W, Liu C. Drug-target affinity prediction with extended graph learning-convolutional networks. BMC Bioinformatics 2024; 25:75. [PMID: 38365583 PMCID: PMC10874073 DOI: 10.1186/s12859-024-05698-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Accepted: 02/12/2024] [Indexed: 02/18/2024] Open
Abstract
BACKGROUND High-performance computing plays a pivotal role in computer-aided drug design, a field that holds significant promise in pharmaceutical research. The prediction of drug-target affinity (DTA) is a crucial stage in this process, potentially accelerating drug development through rapid and extensive preliminary compound screening, while also minimizing resource utilization and costs. Recently, the incorporation of deep learning into DTA prediction and the enhancement of its accuracy have emerged as key areas of interest in the research community. Drugs and targets can be characterized through various methods, including structure-based, sequence-based, and graph-based representations. Despite the progress in structure and sequence-based techniques, they tend to provide limited feature information. Conversely, graph-based approaches have risen to prominence, attracting considerable attention for their comprehensive data representation capabilities. Recent studies have focused on constructing protein and drug molecular graphs using sequences and SMILES, subsequently deriving representations through graph neural networks. However, these graph-based approaches are limited by the use of a fixed adjacent matrix of protein and drug molecular graphs for graph convolution. This limitation restricts the learning of comprehensive feature representations from intricate compound and protein structures, consequently impeding the full potential of graph-based feature representation in DTA prediction. This, in turn, significantly impacts the models' generalization capabilities in the complex realm of drug discovery. RESULTS To tackle these challenges, we introduce GLCN-DTA, a model specifically designed for proficiency in DTA tasks. GLCN-DTA innovatively integrates a graph learning module into the existing graph architecture. This module is designed to learn a soft adjacent matrix, which effectively and efficiently refines the contextual structure of protein and drug molecular graphs. This advancement allows for learning richer structural information from protein and drug molecular graphs via graph convolution, specifically tailored for DTA tasks, compared to the conventional fixed adjacent matrix approach. A series of experiments have been conducted to validate the efficacy of the proposed GLCN-DTA method across diverse scenarios. The results demonstrate that GLCN-DTA possesses advantages in terms of robustness and high accuracy. CONCLUSIONS The proposed GLCN-DTA model enhances DTA prediction performance by introducing a novel framework that synergizes graph learning operations with graph convolution operations, thereby achieving richer representations. GLCN-DTA does not distinguish between different protein classifications, including structurally ordered and intrinsically disordered proteins, focusing instead on improving feature representation. Therefore, its applicability scope may be more effective in scenarios involving structurally ordered proteins, while potentially being limited in contexts with intrinsically disordered proteins.
Collapse
Affiliation(s)
- Haiou Qi
- Nursing Department, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, 310016, China
| | - Ting Yu
- Operating Room Department, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, 310016, China.
| | - Wenwen Yu
- School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan, 430074, China
| | - Chenxi Liu
- School of Medicine and Health Management, Tongji Medical School, Huazhong University of Science and Technology, Wuhan, 430030, China
| |
Collapse
|
38
|
Hendricks AR, Cohen RS, McEwen GA, Tien T, Guilliams BF, Alspach A, Snow CD, Ackerson CJ. Laboratory Evolution of Metalloid Reductase Substrate Recognition and Nanoparticle Product Size. ACS Chem Biol 2024; 19:289-299. [PMID: 38295274 DOI: 10.1021/acschembio.3c00493] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2024]
Abstract
Glutathione reductase-like metalloid reductase (GRLMR) is an enzyme that reduces selenodiglutathione (GS-Se-SG), forming zerovalent Se nanoparticles (SeNPs). Error-prone polymerase chain reaction was used to create a library of ∼10,000 GRLMR variants. The library was expressed in BL21Escherichia coli in liquid culture with 50 mM of SeO32- present, under the hypothesis that the enzyme variants with improved GS-Se-SG reduction kinetics would emerge. The selection resulted in a GRLMR variant with two mutations. One of the mutations (D-E) lacks an obvious functional role, whereas the other mutation is L-H within 5 Å of the enzyme active site. This mutation places a second H residue within 5 Å of an active site dicysteine. This GRLMR variant was characterized for NADPH-dependent reduction of GS-Se-SG, GSSG, SeO32-, SeO42-, GS-Te-SG, and TeO32-. The evolved enzyme demonstrated enhanced reduction of SeO32- and gained the ability to reduce SeO42-. This variant is named selenium reductase (SeR) because of its emergent broad activity for a wide variety of Se substrates, whereas the parent enzyme was specific for GS-Se-SG. This study overall suggests that new biosynthetic routes are possible for inorganic nanomaterials using laboratory-directed evolution methods.
Collapse
Affiliation(s)
- Alexander R Hendricks
- Department of Chemistry, Colorado State University, Fort Collins, Colorado 80523-1872, United States
| | - Rachel S Cohen
- Department of Chemistry, Colorado State University, Fort Collins, Colorado 80523-1872, United States
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, Colorado 80523, United States
| | - Gavin A McEwen
- Department of Chemistry, Colorado State University, Fort Collins, Colorado 80523-1872, United States
| | - Tony Tien
- Department of Chemistry, Colorado State University, Fort Collins, Colorado 80523-1872, United States
- School of Biomedical Engineering, Colorado State University, Fort Collins, Colorado 80523, United States
| | - Bradley F Guilliams
- Department of Chemistry, Colorado State University, Fort Collins, Colorado 80523-1872, United States
| | - Audrey Alspach
- Department of Chemistry, Colorado State University, Fort Collins, Colorado 80523-1872, United States
| | - Christopher D Snow
- Department of Chemistry, Colorado State University, Fort Collins, Colorado 80523-1872, United States
- School of Biomedical Engineering, Colorado State University, Fort Collins, Colorado 80523, United States
| | - Christopher J Ackerson
- Department of Chemistry, Colorado State University, Fort Collins, Colorado 80523-1872, United States
| |
Collapse
|
39
|
Leuzzi G, Vasciaveo A, Taglialatela A, Chen X, Firestone TM, Hickman AR, Mao W, Thakar T, Vaitsiankova A, Huang JW, Cuella-Martin R, Hayward SB, Kesner JS, Ghasemzadeh A, Nambiar TS, Ho P, Rialdi A, Hebrard M, Li Y, Gao J, Gopinath S, Adeleke OA, Venters BJ, Drake CG, Baer R, Izar B, Guccione E, Keogh MC, Guerois R, Sun L, Lu C, Califano A, Ciccia A. SMARCAL1 is a dual regulator of innate immune signaling and PD-L1 expression that promotes tumor immune evasion. Cell 2024; 187:861-881.e32. [PMID: 38301646 PMCID: PMC10980358 DOI: 10.1016/j.cell.2024.01.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Revised: 07/23/2023] [Accepted: 01/05/2024] [Indexed: 02/03/2024]
Abstract
Genomic instability can trigger cancer-intrinsic innate immune responses that promote tumor rejection. However, cancer cells often evade these responses by overexpressing immune checkpoint regulators, such as PD-L1. Here, we identify the SNF2-family DNA translocase SMARCAL1 as a factor that favors tumor immune evasion by a dual mechanism involving both the suppression of innate immune signaling and the induction of PD-L1-mediated immune checkpoint responses. Mechanistically, SMARCAL1 limits endogenous DNA damage, thereby suppressing cGAS-STING-dependent signaling during cancer cell growth. Simultaneously, it cooperates with the AP-1 family member JUN to maintain chromatin accessibility at a PD-L1 transcriptional regulatory element, thereby promoting PD-L1 expression in cancer cells. SMARCAL1 loss hinders the ability of tumor cells to induce PD-L1 in response to genomic instability, enhances anti-tumor immune responses and sensitizes tumors to immune checkpoint blockade in a mouse melanoma model. Collectively, these studies uncover SMARCAL1 as a promising target for cancer immunotherapy.
Collapse
Affiliation(s)
- Giuseppe Leuzzi
- Department of Genetics and Development, Columbia University Irving Medical Center, New York, NY 10032, USA; Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Alessandro Vasciaveo
- Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY 10032, USA; Department of Systems Biology, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Angelo Taglialatela
- Department of Genetics and Development, Columbia University Irving Medical Center, New York, NY 10032, USA; Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Xiao Chen
- Department of Genetics and Development, Columbia University Irving Medical Center, New York, NY 10032, USA; Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY 10032, USA
| | | | | | - Wendy Mao
- Columbia Center for Translational Immunology, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Tanay Thakar
- Department of Genetics and Development, Columbia University Irving Medical Center, New York, NY 10032, USA; Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Alina Vaitsiankova
- Department of Genetics and Development, Columbia University Irving Medical Center, New York, NY 10032, USA; Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Jen-Wei Huang
- Department of Genetics and Development, Columbia University Irving Medical Center, New York, NY 10032, USA; Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Raquel Cuella-Martin
- Department of Genetics and Development, Columbia University Irving Medical Center, New York, NY 10032, USA; Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Samuel B Hayward
- Department of Genetics and Development, Columbia University Irving Medical Center, New York, NY 10032, USA; Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Jordan S Kesner
- Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY 10032, USA; Department of Systems Biology, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Ali Ghasemzadeh
- Columbia Center for Translational Immunology, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Tarun S Nambiar
- Department of Genetics and Development, Columbia University Irving Medical Center, New York, NY 10032, USA; Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Patricia Ho
- Columbia Center for Translational Immunology, Columbia University Irving Medical Center, New York, NY 10032, USA; Department of Medicine, Division of Hematology and Oncology, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Alexander Rialdi
- Center for OncoGenomics and Innovative Therapeutics (COGIT), Center for Therapeutics Discovery, Department of Oncological Sciences and Pharmacological Sciences, Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Maxime Hebrard
- Institute of Molecular and Cell Biology (IMCB), Agency for Science, Technology and Research (ASTAR), Singapore, Singapore
| | - Yinglu Li
- Department of Genetics and Development, Columbia University Irving Medical Center, New York, NY 10032, USA; Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Jinmei Gao
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | | | | | | | - Charles G Drake
- Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY 10032, USA; Columbia Center for Translational Immunology, Columbia University Irving Medical Center, New York, NY 10032, USA; Department of Urology, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Richard Baer
- Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY 10032, USA; Institute for Cancer Genetics, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Benjamin Izar
- Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY 10032, USA; Department of Systems Biology, Columbia University Irving Medical Center, New York, NY 10032, USA; Columbia Center for Translational Immunology, Columbia University Irving Medical Center, New York, NY 10032, USA; Department of Medicine, Division of Hematology and Oncology, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Ernesto Guccione
- Center for OncoGenomics and Innovative Therapeutics (COGIT), Center for Therapeutics Discovery, Department of Oncological Sciences and Pharmacological Sciences, Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | | | - Raphael Guerois
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Lu Sun
- EpiCypher Inc., Durham, NC 27709, USA
| | - Chao Lu
- Department of Genetics and Development, Columbia University Irving Medical Center, New York, NY 10032, USA; Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Andrea Califano
- Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY 10032, USA; Department of Systems Biology, Columbia University Irving Medical Center, New York, NY 10032, USA; Department of Biochemistry and Molecular Biophysics, Columbia University Irving Medical Center, New York, NY 10032, USA; Department of Medicine, Columbia University Irving Medical Center, New York, NY 10032, USA; Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Alberto Ciccia
- Department of Genetics and Development, Columbia University Irving Medical Center, New York, NY 10032, USA; Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY 10032, USA; Institute for Cancer Genetics, Columbia University Irving Medical Center, New York, NY 10032, USA.
| |
Collapse
|
40
|
Flamholz ZN, Biller SJ, Kelly L. Large language models improve annotation of prokaryotic viral proteins. Nat Microbiol 2024; 9:537-549. [PMID: 38287147 DOI: 10.1038/s41564-023-01584-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2023] [Accepted: 12/08/2023] [Indexed: 01/31/2024]
Abstract
Viral genomes are poorly annotated in metagenomic samples, representing an obstacle to understanding viral diversity and function. Current annotation approaches rely on alignment-based sequence homology methods, which are limited by the paucity of characterized viral proteins and divergence among viral sequences. Here we show that protein language models can capture prokaryotic viral protein function, enabling new portions of viral sequence space to be assigned biologically meaningful labels. When applied to global ocean virome data, our classifier expanded the annotated fraction of viral protein families by 29%. Among previously unannotated sequences, we highlight the identification of an integrase defining a mobile element in marine picocyanobacteria and a capsid protein that anchors globally widespread viral elements. Furthermore, improved high-level functional annotation provides a means to characterize similarities in genomic organization among diverse viral sequences. Protein language models thus enhance remote homology detection of viral proteins, serving as a useful complement to existing approaches.
Collapse
Affiliation(s)
- Zachary N Flamholz
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Steven J Biller
- Department of Biological Sciences, Wellesley College, Wellesley, MA, USA
| | - Libusha Kelly
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, NY, USA.
- Department of Microbiology and Immunology, Albert Einstein College of Medicine, Bronx, NY, USA.
| |
Collapse
|
41
|
Kalogeropoulos K, Bohn MF, Jenkins DE, Ledergerber J, Sørensen CV, Hofmann N, Wade J, Fryer T, Thi Tuyet Nguyen G, Auf dem Keller U, Laustsen AH, Jenkins TP. A comparative study of protein structure prediction tools for challenging targets: Snake venom toxins. Toxicon 2024; 238:107559. [PMID: 38113945 DOI: 10.1016/j.toxicon.2023.107559] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Revised: 12/06/2023] [Accepted: 12/08/2023] [Indexed: 12/21/2023]
Abstract
Protein structure determination is a critical aspect of biological research, enabling us to understand protein function and potential applications. Recent advances in deep learning and artificial intelligence have led to the development of several protein structure prediction tools, such as AlphaFold2 and ColabFold. However, their performance has primarily been evaluated on well-characterised proteins and their ability to predict sturtctures of proteins lacking experimental structures, such as many snake venom toxins, has been less scrutinised. In this study, we evaluated three modelling tools on their prediction of over 1000 snake venom toxin structures for which no experimental structures exist. Our findings show that AlphaFold2 (AF2) performed the best across all assessed parameters. We also observed that ColabFold (CF) only scored slightly worse than AF2, while being computationally less intensive. All tools struggled with regions of intrinsic disorder, such as loops and propeptide regions, and performed well in predicting the structure of functional domains. Overall, our study highlights the importance of exercising caution when working with proteins with no experimental structures available, particularly those that are large and contain flexible regions. Nonetheless, leveraging computational structure prediction tools can provide valuable insights into the modelling of protein interactions with different targets and reveal potential binding sites, active sites, and conformational changes, as well as into the design of potential molecular binders for reagent, diagnostic, or therapeutic purposes.
Collapse
Affiliation(s)
| | - Markus-Frederik Bohn
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Kongens Lyngby, Denmark
| | | | - Jann Ledergerber
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Kongens Lyngby, Denmark; Department of Chemistry and Applied Bioscience, ETH Zurich, Zurich, Switzerland
| | - Christoffer V Sørensen
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Nils Hofmann
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Jack Wade
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Thomas Fryer
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Giang Thi Tuyet Nguyen
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Ulrich Auf dem Keller
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Andreas H Laustsen
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Timothy P Jenkins
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Kongens Lyngby, Denmark.
| |
Collapse
|
42
|
Du Z, Peng Z, Yang J. RNA threading with secondary structure and sequence profile. Bioinformatics 2024; 40:btae080. [PMID: 38341662 PMCID: PMC10893584 DOI: 10.1093/bioinformatics/btae080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 01/05/2024] [Accepted: 02/09/2024] [Indexed: 02/12/2024] Open
Abstract
MOTIVATION RNA threading aims to identify remote homologies for template-based modeling of RNA 3D structure. Existing RNA alignment methods primarily rely on secondary structure alignment. They are often time- and memory-consuming, limiting large-scale applications. In addition, the accuracy is far from satisfactory. RESULTS Using RNA secondary structure and sequence profile, we developed a novel RNA threading algorithm, named RNAthreader. To enhance the alignment process and minimize memory usage, a novel approach has been introduced to simplify RNA secondary structures into compact diagrams. RNAthreader employs a two-step methodology. Initially, integer programming and dynamic programming are combined to create an initial alignment for the simplified diagram. Subsequently, the final alignment is obtained using dynamic programming, taking into account the initial alignment derived from the previous step. The benchmark test on 80 RNAs illustrates that RNAthreader generates more accurate alignments than other methods, especially for RNAs with pseudoknots. Another benchmark, involving 30 RNAs from the RNA-Puzzles experiments, exhibits that the models constructed using RNAthreader templates have a lower average RMSD than those created by alternative methods. Remarkably, RNAthreader takes less than two hours to complete alignments with ∼5000 RNAs, which is 3-40 times faster than other methods. These compelling results suggest that RNAthreader is a promising algorithm for RNA template detection. AVAILABILITY AND IMPLEMENTATION https://yanglab.qd.sdu.edu.cn/RNAthreader.
Collapse
Affiliation(s)
- Zongyang Du
- Chongqing Key Laboratory of Big Data for Bio Intelligence, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
| | - Zhenling Peng
- MOE Frontiers Science Center for Nonlinear Expectations, Research Center for Mathematics and Interdisciplinary Sciences, Shandong University, Qingdao 266237, China
| | - Jianyi Yang
- MOE Frontiers Science Center for Nonlinear Expectations, Research Center for Mathematics and Interdisciplinary Sciences, Shandong University, Qingdao 266237, China
| |
Collapse
|
43
|
van Kempen M, Kim SS, Tumescheit C, Mirdita M, Lee J, Gilchrist CLM, Söding J, Steinegger M. Fast and accurate protein structure search with Foldseek. Nat Biotechnol 2024; 42:243-246. [PMID: 37156916 PMCID: PMC10869269 DOI: 10.1038/s41587-023-01773-0] [Citation(s) in RCA: 196] [Impact Index Per Article: 196.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Accepted: 03/30/2023] [Indexed: 05/10/2023]
Abstract
As structure prediction methods are generating millions of publicly available protein structures, searching these databases is becoming a bottleneck. Foldseek aligns the structure of a query protein against a database by describing tertiary amino acid interactions within proteins as sequences over a structural alphabet. Foldseek decreases computation times by four to five orders of magnitude with 86%, 88% and 133% of the sensitivities of Dali, TM-align and CE, respectively.
Collapse
Affiliation(s)
- Michel van Kempen
- Quantitative and Computational Biology Group, Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany
| | - Stephanie S Kim
- School of Biological Sciences, Seoul National University, Seoul, South Korea
| | | | - Milot Mirdita
- Quantitative and Computational Biology Group, Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany
- School of Biological Sciences, Seoul National University, Seoul, South Korea
| | - Jeongjae Lee
- School of Biological Sciences, Seoul National University, Seoul, South Korea
| | | | - Johannes Söding
- Quantitative and Computational Biology Group, Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany.
- Campus Institute Data Science (CIDAS), Göttingen, Germany.
| | - Martin Steinegger
- School of Biological Sciences, Seoul National University, Seoul, South Korea.
- Artificial Intelligence Institute, Seoul National University, Seoul, South Korea.
- Institute of Molecular Biology and Genetics, Seoul National University, Seoul, South Korea.
| |
Collapse
|
44
|
Zhao C, Wang S. AttCON: With better MSAs and attention mechanism for accurate protein contact map prediction. Comput Biol Med 2024; 169:107822. [PMID: 38091726 DOI: 10.1016/j.compbiomed.2023.107822] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 11/19/2023] [Accepted: 12/04/2023] [Indexed: 02/08/2024]
Abstract
Protein contact map prediction is a critical and vital step in protein structure prediction, and its accuracy is highly contingent upon the feature representations of protein sequence information and the efficacy of deep learning models. In this paper, we propose an algorithm, DeepMSA+, to generate protein multiple sequence alignments (MSAs) and to construct feature representations based on co-evolutionary information and sequence information derived from MSAs. We also propose an improved deep learning model, AttCON, for training input features to predict protein contact map. The model incorporates an attention module, and by comparing different attention modules, we find a parameter-free attention module suitable for contact map prediction. Additionally, we use the Focal Loss function to better address the data imbalance issue in protein contact map. We also developed a weighted evaluation index (W score) for model evaluation, which takes into account a wide range of metrics. W score is comprehensive in its scope, with a particular focus on the precision of predictions for medium-range and long-range contacts. Experimental results show that AttCON achieves good precision results on datasets from CASP11 to CASP15. Compared to some state-of-the-art methods, it achieves an average improvement of over 5% in both medium-range and long-range predictions, and W score is improved by an average of 2 points.
Collapse
Affiliation(s)
- Che Zhao
- Department of Computer Science and Engineering, School of Information Science and Engineering, Yunnan University, Kunming, 650504, Yunnan, China
| | - Shunfang Wang
- Department of Computer Science and Engineering, School of Information Science and Engineering, Yunnan University, Kunming, 650504, Yunnan, China; Yunnan Key Laboratory of Intelligent Systems and Computing, Yunnan University, Kunming, 650504, Yunnan, China.
| |
Collapse
|
45
|
Sürmeli Y, Tekedar HC, Şanlı-Mohamed G. Sequence identification and in silico characterization of novel thermophilic lipases from Geobacillus species. Biotechnol Appl Biochem 2024; 71:162-175. [PMID: 37908087 DOI: 10.1002/bab.2529] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Accepted: 10/07/2023] [Indexed: 11/02/2023]
Abstract
Microbial lipases are utilized in various biotechnological areas, including pharmaceuticals, food, biodiesel, and detergents. In this study, we cloned and sequenced Lip21 and Lip33 genes from Geobacillus sp. GS21 and Geobacillus sp. GS33, then we in silico and experimentally analyzed the encoded lipases. For this purpose, Lip21 and Lip33 were cloned, sequenced, and their amino acid sequences were investigated for determination of biophysicochemical characteristics, evolutionary relationships, and sequence similarities. 3D models were built and computationally affirmed by various bioinformatics tools, and enzyme-ligand interactions were investigated by docking analysis using six ligands. Biophysicochemical property of Lip21 and Lip33 was also determined experimentally and the results demonstrated that they had similar isoelectric point (pI) (6.21) and Tm (75.5°C) values as Tm was revealed by denatured protein analysis of the circular dichroism spectrum and pI was obtained by isoelectric focusing. Phylogeny analysis indicated that Lip21 and Lip33 were the closest to lipases from Geobacillus sp. SBS-4S and Geobacillus thermoleovorans, respectively. Alignment analysis demonstrated that S144-D348-H389 was catalytic triad residues in Lip21 and Lip33, and enzymes possessed a conserved Gly-X-Ser-X-Gly motif containing catalytic serine. 3D structure analysis indicated that Lip21 and Lip33 highly resembled each other and they were α/β hydrolase-fold enzymes with large lid domains. BANΔIT analysis results showed that Lip21 and Lip33 had higher thermal stability, compared to other thermostable Geobacillus lipases. Docking results revealed that Lip21- and Lip33-docked complexes possessed common residues (H112, K115, Q162, E163, and S141) that interacted with the substrates, except paranitrophenyl (pNP)-C10 and pNP-C12, indicating that these residues might have a significant action on medium and short-chain fatty acid esters. Thus, Lip21 and Lip33 can be potential candidates for different industrial applications.
Collapse
Affiliation(s)
- Yusuf Sürmeli
- Department of Agricultural Biotechnology, Tekirdağ Namık Kemal University, Tekirdağ, Turkey
- Department of Biotechnology and Bioengineering, İzmir Institute of Technology, İzmir, Turkey
| | - Hasan Cihad Tekedar
- Department of Biotechnology and Bioengineering, İzmir Institute of Technology, İzmir, Turkey
- College of Veterinary Medicine, Mississippi State University, Mississippi State, Mississippi, USA
| | - Gülşah Şanlı-Mohamed
- Department of Biotechnology and Bioengineering, İzmir Institute of Technology, İzmir, Turkey
- Department of Chemistry, İzmir Institute of Technology, İzmir, Turkey
| |
Collapse
|
46
|
Schaeffer RD, Zhang J, Medvedev KE, Kinch LN, Cong Q, Grishin NV. ECOD domain classification of 48 whole proteomes from AlphaFold Structure Database using DPAM2. PLoS Comput Biol 2024; 20:e1011586. [PMID: 38416793 PMCID: PMC10927120 DOI: 10.1371/journal.pcbi.1011586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 03/11/2024] [Accepted: 02/20/2024] [Indexed: 03/01/2024] Open
Abstract
Protein structure prediction has now been deployed widely across several different large protein sets. Large-scale domain annotation of these predictions can aid in the development of biological insights. Using our Evolutionary Classification of Protein Domains (ECOD) from experimental structures as a basis for classification, we describe the detection and cataloging of domains from 48 whole proteomes deposited in the AlphaFold Database. On average, we can provide positive classification (either of domains or other identifiable non-domain regions) for 90% of residues in all proteomes. We classified 746,349 domains from 536,808 proteins comprised of over 226,424,000 amino acid residues. We examine the varying populations of homologous groups in both eukaryotes and bacteria. In addition to containing a higher fraction of disordered regions and unassigned domains, eukaryotes show a higher proportion of repeated proteins, both globular and small repeats. We enumerate those highly populated domains that are shared in both eukaryotes and bacteria, such as the Rossmann domains, TIM barrels, and P-loop domains. Additionally, we compare the sampling of homologous groups from this whole proteome set against our stable ECOD reference and discuss groups that have been enriched by structure predictions. Finally, we discuss the implication of these results for protein target selection for future classification strategies for very large protein sets.
Collapse
Affiliation(s)
- R. Dustin Schaeffer
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
| | - Jing Zhang
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
| | - Kirill E. Medvedev
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
| | - Lisa N. Kinch
- Department of Molecular Biology, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
| | - Qian Cong
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
| | - Nick V. Grishin
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
- Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
| |
Collapse
|
47
|
Simpson GA, Rezende IF, da Silva Peixoto A, de Oliveira Soares IB, Barbosa JARG, de Freitas SM, Valadares NF. Crystal structure and interconversion of monomers and domain-swapped dimers of the walnut tree phytocystatin. BIOCHIMICA ET BIOPHYSICA ACTA. PROTEINS AND PROTEOMICS 2024; 1872:140975. [PMID: 38056804 DOI: 10.1016/j.bbapap.2023.140975] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/08/2023]
Abstract
Biotechnological applications of phytocystatins have garnered significant interest due to their potential applications in crop protection and improve crop resistance to abiotic stress factors. Cof1 and Wal1 are phytocystatins derived from Coffea arabica and Juglans regia, respectively. These plants hold significant economic value due to coffee's global demand and the walnut tree's production of valuable timber and widely consumed walnuts with culinary and nutritional benefits. The study involved the heterologous expression in E. coli Lemo 21(DE3), purification by immobilized metal ion affinity and size exclusion chromatography, and biophysical characterization of both phytocystatins, focusing on isolating and interconverting their monomers and dimers. The crystal structure of the domain-swapped dimer of Wal1 was determined revealing two domain-swapped dimers in the asymmetric unit, an arrangement reminiscent of the human cystatin C structure. Alphafold models of monomers and Alphafold-Multimer models of domain-swapped dimers of Cof1 and Wal1 were analyzed in the context of the crystal structure. The methodology and data presented here contribute to a deeper understanding of the oligomerization mechanisms of phytocystatins and their potential biotechnological applications in agriculture.
Collapse
Affiliation(s)
- Gisele Alvarenga Simpson
- Laboratory of Molecular Biophysics, Department of Cellular Biology, University of Brasília, Brasília, Brazil
| | - Isabela Fernandes Rezende
- Laboratory of Molecular Biophysics, Department of Cellular Biology, University of Brasília, Brasília, Brazil
| | - Alencar da Silva Peixoto
- Laboratory of Molecular Biophysics, Department of Cellular Biology, University of Brasília, Brasília, Brazil
| | | | | | - Sônia Maria de Freitas
- Laboratory of Molecular Biophysics, Department of Cellular Biology, University of Brasília, Brasília, Brazil
| | - Napoleão Fonseca Valadares
- Laboratory of Molecular Biophysics, Department of Cellular Biology, University of Brasília, Brasília, Brazil..
| |
Collapse
|
48
|
Butenko A, Lukeš J, Speijer D, Wideman JG. Mitochondrial genomes revisited: why do different lineages retain different genes? BMC Biol 2024; 22:15. [PMID: 38273274 PMCID: PMC10809612 DOI: 10.1186/s12915-024-01824-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Accepted: 01/11/2024] [Indexed: 01/27/2024] Open
Abstract
The mitochondria contain their own genome derived from an alphaproteobacterial endosymbiont. From thousands of protein-coding genes originally encoded by their ancestor, only between 1 and about 70 are encoded on extant mitochondrial genomes (mitogenomes). Thanks to a dramatically increasing number of sequenced and annotated mitogenomes a coherent picture of why some genes were lost, or relocated to the nucleus, is emerging. In this review, we describe the characteristics of mitochondria-to-nucleus gene transfer and the resulting varied content of mitogenomes across eukaryotes. We introduce a 'burst-upon-drift' model to best explain nuclear-mitochondrial population genetics with flares of transfer due to genetic drift.
Collapse
Affiliation(s)
- Anzhelika Butenko
- Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budějovice (Budweis), Czech Republic
- Faculty of Science, University of Ostrava, Ostrava, Czech Republic
- Faculty of Sciences, University of South Bohemia, České Budějovice (Budweis), Czech Republic
| | - Julius Lukeš
- Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budějovice (Budweis), Czech Republic
- Faculty of Sciences, University of South Bohemia, České Budějovice (Budweis), Czech Republic
| | - Dave Speijer
- Medical Biochemistry, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands
| | - Jeremy G Wideman
- Center for Mechanisms of Evolution, Biodesign Institute, School of Life Sciences, Arizona State University, Tempe, USA.
| |
Collapse
|
49
|
Bret H, Gao J, Zea DJ, Andreani J, Guerois R. From interaction networks to interfaces, scanning intrinsically disordered regions using AlphaFold2. Nat Commun 2024; 15:597. [PMID: 38238291 PMCID: PMC10796318 DOI: 10.1038/s41467-023-44288-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Accepted: 12/07/2023] [Indexed: 01/22/2024] Open
Abstract
The revolution brought about by AlphaFold2 opens promising perspectives to unravel the complexity of protein-protein interaction networks. The analysis of interaction networks obtained from proteomics experiments does not systematically provide the delimitations of the interaction regions. This is of particular concern in the case of interactions mediated by intrinsically disordered regions, in which the interaction site is generally small. Using a dataset of protein-peptide complexes involving intrinsically disordered regions that are non-redundant with the structures used in AlphaFold2 training, we show that when using the full sequences of the proteins, AlphaFold2-Multimer only achieves 40% success rate in identifying the correct site and structure of the interface. By delineating the interaction region into fragments of decreasing size and combining different strategies for integrating evolutionary information, we manage to raise this success rate up to 90%. We obtain similar success rates using a much larger dataset of protein complexes taken from the ELM database. Beyond the correct identification of the interaction site, our study also explores specificity issues. We show the advantages and limitations of using the AlphaFold2 confidence score to discriminate between alternative binding partners, a task that can be particularly challenging in the case of small interaction motifs.
Collapse
Affiliation(s)
- Hélène Bret
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| | - Jinmei Gao
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| | - Diego Javier Zea
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| | - Jessica Andreani
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France.
| | - Raphaël Guerois
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France.
| |
Collapse
|
50
|
Kawato S, Nozaki R, Kondo H, Hirono I. Integrase-associated niche differentiation of endogenous large DNA viruses in crustaceans. Microbiol Spectr 2024; 12:e0055923. [PMID: 38063384 PMCID: PMC10871703 DOI: 10.1128/spectrum.00559-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Accepted: 11/15/2023] [Indexed: 01/13/2024] Open
Abstract
IMPORTANCE Crustacean genomes harbor sequences originating from a family of large DNA viruses called nimaviruses, but it is unclear why they are present. We show that endogenous nimaviruses selectively insert into repetitive sequences within the host genome, and this insertion specificity was correlated with different types of integrases, which are DNA recombination enzymes encoded by the nimaviruses themselves. This suggests that endogenous nimaviruses have colonized various genomic niches through the acquisition of integrases with different insertion specificities. Our results point to a novel survival strategy of endogenous large DNA viruses colonizing the host genomes. These findings may clarify the evolution and spread of nimaviruses in crustaceans and lead to measures to control and prevent the spread of pathogenic nimaviruses in aquaculture settings.
Collapse
Affiliation(s)
- Satoshi Kawato
- Laboratory of Genome Science, Tokyo University of Marine Science and Technology, Tokyo, Japan
| | - Reiko Nozaki
- Laboratory of Genome Science, Tokyo University of Marine Science and Technology, Tokyo, Japan
| | - Hidehiro Kondo
- Laboratory of Genome Science, Tokyo University of Marine Science and Technology, Tokyo, Japan
| | - Ikuo Hirono
- Laboratory of Genome Science, Tokyo University of Marine Science and Technology, Tokyo, Japan
| |
Collapse
|