1
|
Li H, Nithin C, Kmiecik S, Huang SY. Computational methods for modeling protein-protein interactions in the AI era: Current status and future directions. Drug Discov Today 2025:104382. [PMID: 40398752 DOI: 10.1016/j.drudis.2025.104382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2025] [Revised: 04/30/2025] [Accepted: 05/14/2025] [Indexed: 05/23/2025]
Abstract
The modeling of protein-protein interactions (PPIs) has been revolutionized by artificial intelligence, with deep learning and end-to-end frameworks such as AlphaFold and its derivatives now dominating the field. This review surveys the current computational landscape for predicting protein complex structures, outlining the role of traditional docking approaches as well as focusing on recent advances in AI-driven methods. We discuss key challenges, including protein flexibility, reliance on co-evolutionary signals, modeling of large assemblies, and interactions involving intrinsically disordered regions (IDRs). Recent innovations aimed at improving sampling diversity, integrating experimental data, and enhancing robustness are also highlighted. Although classical methods remain relevant in specific contexts, the continued evolution of AI-based tools offers transformative potential for structural biology. These advances are poised to deepen our understanding of biomolecular interactions and accelerate the design of therapeutic interventions.
Collapse
Affiliation(s)
- Hao Li
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| | - Chandran Nithin
- University of Warsaw, Biological and Chemical Research Centre, Faculty of Chemistry, Warsaw, Poland
| | - Sebastian Kmiecik
- University of Warsaw, Biological and Chemical Research Centre, Faculty of Chemistry, Warsaw, Poland
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| |
Collapse
|
2
|
Maurino VG. Next generation technologies for protein structure determination: challenges and breakthroughs in plant biology applications. JOURNAL OF PLANT PHYSIOLOGY 2025; 310:154522. [PMID: 40382917 DOI: 10.1016/j.jplph.2025.154522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/22/2025] [Revised: 05/13/2025] [Accepted: 05/14/2025] [Indexed: 05/20/2025]
Abstract
Advancements in structural biology have significantly deepened our understanding of plant proteins, which are central to critical biological functions such as photosynthesis, metabolism, signal transduction, and structural architechture. Gaining insights into their structures is crucial for unraveling their functions and mechanisms, which in turn has profound implications for agriculture, biotechnology, and environmental sustainability. Traditional methods in protein structural biology often fall short in addressing large protein assemblies and membrane proteins, and, in particular the dynamics and structural features of proteins in the native cellular context. This paper explores how next-generation technologies are transforming the field of plant protein structural biology, offering powerful tools to overcome longstanding obstacles and enabling remarkable scientific breakthroughs. Key technologies discussed include advanced X-ray crystallography, Cryo-Electron microscopy, Nuclear Magnetic Resonance spectroscopy, Cross-linking mass spectrometry, and Artificial Intelligence-driven approaches. These technologies are examined in terms of their challenges, innovations, and application with particular emphasis on their relevance to plant systems. Future directions in plant protein structural biology are also discussed. Although technical details are not covered in depth, readers are referred to the primary literature for more comprehensive information.
Collapse
Affiliation(s)
- Veronica G Maurino
- Molecular Plant Physiology, Institute for Cellular and Molecular Botany (IZMB), University of Bonn, Kirschallee 1, 53115, Bonn, Germany.
| |
Collapse
|
3
|
Zhang Y, Jindal M, Viswanath S, Sitharam M. A New Discrete-Geometry Approach for Integrative Docking of Proteins Using Chemical Cross-Links. J Chem Inf Model 2025; 65:4576-4592. [PMID: 40299996 DOI: 10.1021/acs.jcim.4c02412] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/01/2025]
Abstract
The structures of protein complexes allow us to understand and modulate the biological functions of the proteins. Integrative docking is a computational method to obtain the structures of a protein complex, given the atomic structures of the constituent proteins along with other experimental data on the complex, such as chemical cross-links or SAXS profiles. Here, we develop a new discrete geometry-based method, wall-EASAL, for integrative rigid docking of protein pairs given the structures of the constituent proteins and chemical cross-links. The method is an adaptation of efficient atlasing and search of assembly landscapes (EASAL), a state-of-the-art discrete geometry method for efficient and exhaustive sampling of macromolecular configurations under pairwise intermolecular distance constraints. We provide a mathematical proof that the method finds a structure satisfying the cross-link constraints under a natural condition satisfied by energy landscapes. We compare wall-EASAL with integrative modeling platform (IMP), a commonly used integrative modeling method, on a benchmark, varying the numbers, types, and sources of input cross-links, and sources of monomer structures. The wall-EASAL method performs similarly to IMP in terms of the average satisfaction of the configurations to the input cross-links and the average similarity of the configurations to their corresponding native structures. But wall-EASAL is more efficient than IMP and more robust against false positive cross-links in the context of binary integrative rigid docking. Although the current study uses cross-links, the method is general and any source of distance constraints can be used for integrative docking with wall-EASAL. However, the current implementation only supports binary rigid protein docking, i.e., assumes that the monomer structures are known and remain rigid. Additionally, the current implementation is deterministic, i.e., it does not account for some uncertainties in the cross-linking data, such as noise in the cross-link distances. Neither of these appears to be a theoretical or algorithmic limitation of the EASAL methodology. Structures from wall-EASAL can be incorporated in methods for modeling large macromolecular assemblies, for example by suggesting rigid bodies or restraints for use in these methods. This will facilitate the characterization of assemblies and cellular neighborhoods at increased efficiency, accuracy, and precision. The wall-EASAL method is available at https://bitbucket.org/geoplexity/easal-dev/src/Crosslink and the benchmark is available at https://github.com/isblab/Integrative_docking_benchmark.
Collapse
Affiliation(s)
- Yichi Zhang
- CISE Department, University of Florida, Gainesville 32611-6120, Florida, United States
| | - Muskaan Jindal
- National Center for Biological Sciences, Tata Institute of Fundamental Research, Bengaluru 560065, India
| | - Shruthi Viswanath
- National Center for Biological Sciences, Tata Institute of Fundamental Research, Bengaluru 560065, India
| | - Meera Sitharam
- CISE Department, University of Florida, Gainesville 32611-6120, Florida, United States
| |
Collapse
|
4
|
Armeev GA, Moiseenko AV, Motorin NA, Afonin DA, Zhao L, Vasilev VA, Oleinikov PD, Glukhov GS, Peters GS, Studitsky VM, Feofanov AV, Shaytan AK, Shi X, Sokolova OS. Structure and dynamics of a nucleosome core particle based on Widom 603 DNA sequence. Structure 2025; 33:948-959.e5. [PMID: 40101710 DOI: 10.1016/j.str.2025.02.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2024] [Revised: 01/03/2025] [Accepted: 02/19/2025] [Indexed: 03/20/2025]
Abstract
Nucleosomes are fundamental elements of chromatin organization that participate in compacting genomic DNA and serve as targets for the binding of numerous regulatory proteins. Currently, over 500 different nucleosome structures are known. Despite the large number of nucleosome structures, all of them were formed on only about twenty different DNA sequences. Using cryo-electron microscopy, we determined the structure of the nucleosome formed on a high-affinity Widom 603 DNA sequence at 4 Å resolution; an atomic model was built. We proposed an integrative modeling approach to study the nucleosomal DNA unwrapping based on the cryoelectron microscopy (cryo-EM) data. We also demonstrated the DNA unwrapping of the Widom 603 nucleosome using small angle X-ray scattering and single particle Förster resonance energy transfer measurements. Our results are consistent with the asymmetry of nucleosomal DNA unwrapping. Our data revealed the dependence of nucleosome structure and dynamics on the sequence of nucleosomal DNA.
Collapse
Affiliation(s)
- Grigoriy A Armeev
- Department of Biology, Lomonosov Moscow State University, 119234 Moscow, Russia.
| | - Andrey V Moiseenko
- Department of Biology, Lomonosov Moscow State University, 119234 Moscow, Russia
| | - Nikita A Motorin
- Department of Biology, Lomonosov Moscow State University, 119234 Moscow, Russia
| | - Dmitriy A Afonin
- Department of Biology, Lomonosov Moscow State University, 119234 Moscow, Russia; Fox Chase Cancer Center, Philadelphia, PA 19111, USA
| | - Lei Zhao
- Faculty of Biology, Shenzhen MSU-BIT University, Shenzhen, Guangdong 518172, China
| | - Veniamin A Vasilev
- Department of Biology, Lomonosov Moscow State University, 119234 Moscow, Russia
| | - Pavel D Oleinikov
- Department of Biology, Lomonosov Moscow State University, 119234 Moscow, Russia
| | - Grigory S Glukhov
- Department of Biology, Lomonosov Moscow State University, 119234 Moscow, Russia; Faculty of Biology, Shenzhen MSU-BIT University, Shenzhen, Guangdong 518172, China
| | - Georgy S Peters
- Faculty of Physics, Lomonosov Moscow State University, 119234 Moscow, Russia
| | - Vasily M Studitsky
- Department of Biology, Lomonosov Moscow State University, 119234 Moscow, Russia; Fox Chase Cancer Center, Philadelphia, PA 19111, USA
| | - Alexey V Feofanov
- Department of Biology, Lomonosov Moscow State University, 119234 Moscow, Russia; Faculty of Biology, Shenzhen MSU-BIT University, Shenzhen, Guangdong 518172, China
| | - Alexey K Shaytan
- Department of Biology, Lomonosov Moscow State University, 119234 Moscow, Russia
| | - Xiangyan Shi
- Faculty of Biology, Shenzhen MSU-BIT University, Shenzhen, Guangdong 518172, China
| | - Olga S Sokolova
- Faculty of Biology, Shenzhen MSU-BIT University, Shenzhen, Guangdong 518172, China.
| |
Collapse
|
5
|
Schaffer LV, Hu M, Qian G, Moon KM, Pal A, Soni N, Latham AP, Pontano Vaites L, Tsai D, Mattson NM, Licon K, Bachelder R, Cesnik A, Gaur I, Le T, Leineweber W, Palar A, Pulido E, Qin Y, Zhao X, Churas C, Lenkiewicz J, Chen J, Ono K, Pratt D, Zage P, Echeverria I, Sali A, Harper JW, Gygi SP, Foster LJ, Huttlin EL, Lundberg E, Ideker T. Multimodal cell maps as a foundation for structural and functional genomics. Nature 2025:10.1038/s41586-025-08878-3. [PMID: 40205054 DOI: 10.1038/s41586-025-08878-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2024] [Accepted: 03/10/2025] [Indexed: 04/11/2025]
Abstract
Human cells consist of a complex hierarchy of components, many of which remain unexplored1,2. Here we construct a global map of human subcellular architecture through joint measurement of biophysical interactions and immunofluorescence images for over 5,100 proteins in U2OS osteosarcoma cells. Self-supervised multimodal data integration resolves 275 molecular assemblies spanning the range of 10-8 to 10-5 m, which we validate systematically using whole-cell size-exclusion chromatography and annotate using large language models3. We explore key applications in structural biology, yielding structures for 111 heterodimeric complexes and an expanded Rag-Ragulator assembly. The map assigns unexpected functions to 975 proteins, including roles for C18orf21 in RNA processing and DPP9 in interferon signalling, and identifies assemblies with multiple localizations or cell type specificity. It decodes paediatric cancer genomes4, identifying 21 recurrently mutated assemblies and implicating 102 validated new cancer proteins. The associated Cell Visualization Portal and Mapping Toolkit provide a reference platform for structural and functional cell biology.
Collapse
Affiliation(s)
- Leah V Schaffer
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Mengzhou Hu
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Gege Qian
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA, USA
| | - Kyung-Mee Moon
- Department of Biochemistry & Molecular Biology, Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada
| | - Abantika Pal
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Neelesh Soni
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Andrew P Latham
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | | | - Dorothy Tsai
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Nicole M Mattson
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Katherine Licon
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Robin Bachelder
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Anthony Cesnik
- Department of Bioengineering, Stanford University, Palo Alto, CA, USA
| | - Ishan Gaur
- Department of Bioengineering, Stanford University, Palo Alto, CA, USA
| | - Trang Le
- Department of Bioengineering, Stanford University, Palo Alto, CA, USA
| | | | - Aji Palar
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Ernst Pulido
- Department of Bioengineering, Stanford University, Palo Alto, CA, USA
| | - Yue Qin
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
- Broad Institute of MIT and Harvard, Boston, MA, USA
| | - Xiaoyu Zhao
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Christopher Churas
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Joanna Lenkiewicz
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Jing Chen
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Keiichiro Ono
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Dexter Pratt
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Peter Zage
- Department of Pediatrics, Division of Hematology-Oncology, University of California San Diego, La Jolla, CA, USA
| | - Ignacia Echeverria
- Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, CA, USA
- Quantitative Biosciences Institute, University of California San Francisco, San Francisco, CA, USA
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
- Quantitative Biosciences Institute, University of California San Francisco, San Francisco, CA, USA
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA, USA
| | - J Wade Harper
- Department of Cell Biology, Harvard Medical School, Boston, MA, USA
| | - Steven P Gygi
- Department of Cell Biology, Harvard Medical School, Boston, MA, USA
| | - Leonard J Foster
- Department of Biochemistry & Molecular Biology, Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada
| | - Edward L Huttlin
- Department of Cell Biology, Harvard Medical School, Boston, MA, USA.
| | - Emma Lundberg
- Department of Bioengineering, Stanford University, Palo Alto, CA, USA.
- Department of Pathology, Stanford University, Palo Alto, CA, USA.
- Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH Royal Institute of Technology, Stockholm, Sweden.
- Chan Zuckerberg Biohub, San Francisco, CA, USA.
| | - Trey Ideker
- Department of Medicine, University of California San Diego, La Jolla, CA, USA.
- Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA, USA.
- Department of Bioengineering, University of California San Diego, La Jolla, CA, USA.
| |
Collapse
|
6
|
Kant S, Nithin C, Mukherjee S, Maity A, Bahadur RP. Protein-RNA Docking Benchmark v3.0 Integrated With Binding Affinity. Proteins 2025. [PMID: 40202108 DOI: 10.1002/prot.26825] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2024] [Revised: 03/19/2025] [Accepted: 03/26/2025] [Indexed: 04/10/2025]
Abstract
We introduce an updated non-redundant protein-RNA docking benchmark version 3.0 (PRDBv3.0) containing 197 test cases curated from 288 unique protein-RNA complexes available in the Protein Data Bank until July 2024. Among these, 27 are unbound-unbound (UU) type where both the binding partners are available in their unbound states, 160 are unbound-bound (UB) type where only the protein is available in unbound state and remaining 10 are bound-unbound (BU) type where only the RNA is available in unbound state. The benchmark is categorized into three classes based on the conformational flexibility of the protein interface: 117 rigid-body (R) complexes with minimal structural changes, 41 semi-flexible (S) complexes showing moderate conformational changes and 29 full-flexible (F) complexes with significant conformational changes. The current benchmark represents a 62% increase in the number of test cases compared to its previous version. Binding affinity (Kd) values for a subset of 105 protein-RNA complexes from PRDBv3.0 are catalogued along with additional experimental details to develop a comprehensive protein-RNA affinity benchmark. Moreover, a total of 255 unique RNA-binding domains, present in RNA-binding proteins, are also catalogued in this updated benchmark. PRDBv3.0 will facilitate the evaluation of both rigid-body and flexible docking methods as well as the methods that aim to predict binding affinity. The updated benchmark is freely available at http://www.csb.iitkgp.ac.in/applications/PRDBv3/PRDBv3.php.
Collapse
Affiliation(s)
- Shri Kant
- Computational Structural Biology Laboratory, Department of Bioscience and Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur, India
| | - Chandran Nithin
- Biological and Chemical Research Centre, Faculty of Chemistry, University of Warsaw, Warsaw, Poland
| | - Sunandan Mukherjee
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Atanu Maity
- Bioinformatics Center, Department of Bioscience and Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur, India
| | - Ranjit Prasad Bahadur
- Computational Structural Biology Laboratory, Department of Bioscience and Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur, India
- Bioinformatics Center, Department of Bioscience and Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur, India
| |
Collapse
|
7
|
Mohammadi A, Deroo S, Leitner A, Stengel F, Krammer EM, Aebersold R, Prévost M, Raussens V. Characterization of the N- and C-terminal domain interface of the three main apoE isoforms: A combined quantitative cross-linking mass spectrometry and molecular modeling study. Biochim Biophys Acta Gen Subj 2025; 1869:130768. [PMID: 39880049 DOI: 10.1016/j.bbagen.2025.130768] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2024] [Revised: 01/06/2025] [Accepted: 01/19/2025] [Indexed: 01/31/2025]
Abstract
Apolipoprotein E (apoE) polymorphism is associated with different pathologies such as atherosclerosis and Alzheimer's disease. Knowledge of the three-dimensional structure of apoE and isoform-specific structural differences are prerequisites for the rational design of small molecule structure modulators that correct the detrimental effects of pathological isoforms. In this study, cross-linking mass spectrometry (XL-MS) targeting Asp, Glu and Lys residues was used to explore the intramolecular interactions in the E2, E3 and E4 isoforms of apoE. The resulting quantitative XL-MS data combined with molecular modeling revealed isoform-specific characteristics of the N- and C-terminal domain interfaces as well as the isoform-dependent dynamic equilibrium of these interfaces. Finally, the data identified a network of salt bridges formed by R61-R112-E109 residues in the N-terminal helical bundle as a modulator of the interaction with the C-terminal domain making this network a potential drug target.
Collapse
Affiliation(s)
- Azadeh Mohammadi
- Center for Structural Biology and Bioinformatics, Université Libre de Bruxelles (ULB), Brussels, Belgium; Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland; Currently at Puratos NV, Industrialaan 25, 1702 Groot-Bijgaarden, Belgium.
| | - Stéphanie Deroo
- Center for Structural Biology and Bioinformatics, Université Libre de Bruxelles (ULB), Brussels, Belgium
| | - Alexander Leitner
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | - Florian Stengel
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland; University of Konstanz, Department of Biology, Universitätsstrasse 10, 78457 Konstanz, Germany
| | - Eva-Maria Krammer
- Center for Structural Biology and Bioinformatics, Université Libre de Bruxelles (ULB), Brussels, Belgium; Unité de Glycobiologie Structurale et Fonctionnelle (UGSF), University Lille, CNRS, UMR, 8576, Lille, France
| | - Ruedi Aebersold
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland; Faculty of Science, University of Zurich, Zurich, Switzerland
| | - Martine Prévost
- Center for Structural Biology and Bioinformatics, Université Libre de Bruxelles (ULB), Brussels, Belgium
| | - Vincent Raussens
- Center for Structural Biology and Bioinformatics, Université Libre de Bruxelles (ULB), Brussels, Belgium
| |
Collapse
|
8
|
Latham AP, Rožič M, Webb BM, Sali A. Tutorial on integrative spatiotemporal modeling by integrative modeling platform. Protein Sci 2025; 34:e70107. [PMID: 40130765 PMCID: PMC11934212 DOI: 10.1002/pro.70107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2024] [Revised: 02/26/2025] [Accepted: 03/09/2025] [Indexed: 03/26/2025]
Abstract
Cells function through dynamic interactions between macromolecules. Detailed characterization of the dynamics of large biomolecular systems is often not feasible by individual biophysical methods. In such cases, it may be possible to compute useful models by integrating multiple sources of information. We have previously developed an integrative method to model dynamic processes by computing biomolecular heterogeneity at fixed time points, then generating static integrative structural modes for each of these heterogeneity models, and finally connecting these static models to produce a scored trajectory model that depicts the process. Here, we demonstrate how to compute, score, and assess these integrative spatiotemporal models using our open-source Integrative Modeling Platform (IMP) program (https://integrativemodeling.org/).
Collapse
Affiliation(s)
- Andrew P. Latham
- Quantitative Biosciences InstituteUniversity of California, San FranciscoSan FranciscoCaliforniaUSA
- Department of Bioengineering and Therapeutic SciencesUniversity of California, San FranciscoSan FranciscoCaliforniaUSA
- Department of Pharmaceutical ChemistryUniversity of California, San FranciscoSan FranciscoCaliforniaUSA
| | - Miha Rožič
- Quantitative Biosciences InstituteUniversity of California, San FranciscoSan FranciscoCaliforniaUSA
- Department of Bioengineering and Therapeutic SciencesUniversity of California, San FranciscoSan FranciscoCaliforniaUSA
- Department of Pharmaceutical ChemistryUniversity of California, San FranciscoSan FranciscoCaliforniaUSA
| | - Benjamin M. Webb
- Quantitative Biosciences InstituteUniversity of California, San FranciscoSan FranciscoCaliforniaUSA
- Department of Bioengineering and Therapeutic SciencesUniversity of California, San FranciscoSan FranciscoCaliforniaUSA
- Department of Pharmaceutical ChemistryUniversity of California, San FranciscoSan FranciscoCaliforniaUSA
| | - Andrej Sali
- Quantitative Biosciences InstituteUniversity of California, San FranciscoSan FranciscoCaliforniaUSA
- Department of Bioengineering and Therapeutic SciencesUniversity of California, San FranciscoSan FranciscoCaliforniaUSA
- Department of Pharmaceutical ChemistryUniversity of California, San FranciscoSan FranciscoCaliforniaUSA
| |
Collapse
|
9
|
Latham AP, Zhang W, Tempkin JOB, Otsuka S, Ellenberg J, Sali A. Integrative spatiotemporal modeling of biomolecular processes: Application to the assembly of the nuclear pore complex. Proc Natl Acad Sci U S A 2025; 122:e2415674122. [PMID: 40085653 PMCID: PMC11929490 DOI: 10.1073/pnas.2415674122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2024] [Accepted: 02/06/2025] [Indexed: 03/16/2025] Open
Abstract
Dynamic processes involving biomolecules are essential for the function of the cell. Here, we introduce an integrative method for computing models of these processes based on multiple heterogeneous sources of information, including time-resolved experimental data and physical models of dynamic processes. First, for each time point, a set of coarse models of compositional and structural heterogeneity is computed (heterogeneity models). Second, for each heterogeneity model, a set of static integrative structure models is computed (a snapshot model). Finally, these snapshot models are selected and connected into a series of trajectories that optimize the likelihood of both the snapshot models and transitions between them (a trajectory model). The method is demonstrated by application to the assembly process of the human nuclear pore complex in the context of the reforming nuclear envelope during mitotic cell division, based on live-cell correlated electron tomography, bulk fluorescence correlation spectroscopy-calibrated quantitative live imaging, and a structural model of the fully assembled nuclear pore complex. Modeling of the assembly process improves the model precision over static integrative structure modeling alone. The method is applicable to a wide range of time-dependent systems in cell biology and is available to the broader scientific community through an implementation in the open source Integrative Modeling Platform (IMP) software.
Collapse
Affiliation(s)
- Andrew P. Latham
- Department of Bioengineering and Therapeutic Sciences, Quantitative Biosciences Institute, University of California, San Francisco, CA94143
- Department of Pharmaceutical Chemistry, Quantitative Biosciences Institute, University of California, San Francisco, CA94143
| | - Wanlu Zhang
- Cell Biology and Biophysics Unit, European Molecular Biology Laboratory, Heidelberg69117, Germany
| | - Jeremy O. B. Tempkin
- Department of Bioengineering and Therapeutic Sciences, Quantitative Biosciences Institute, University of California, San Francisco, CA94143
- Department of Pharmaceutical Chemistry, Quantitative Biosciences Institute, University of California, San Francisco, CA94143
| | - Shotaro Otsuka
- Cell Biology and Biophysics Unit, European Molecular Biology Laboratory, Heidelberg69117, Germany
| | - Jan Ellenberg
- Cell Biology and Biophysics Unit, European Molecular Biology Laboratory, Heidelberg69117, Germany
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, Quantitative Biosciences Institute, University of California, San Francisco, CA94143
- Department of Pharmaceutical Chemistry, Quantitative Biosciences Institute, University of California, San Francisco, CA94143
| |
Collapse
|
10
|
Wang Y, Cheng J. Reconstructing 3D chromosome structures from single-cell Hi-C data with SO(3)-equivariant graph neural networks. NAR Genom Bioinform 2025; 7:lqaf027. [PMID: 40124711 PMCID: PMC11928942 DOI: 10.1093/nargab/lqaf027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2024] [Revised: 02/23/2025] [Accepted: 03/05/2025] [Indexed: 03/25/2025] Open
Abstract
The spatial conformation of chromosomes and genomes of single cells is relevant to cellular function and useful for elucidating the mechanism underlying gene expression and genome methylation. The chromosomal contacts (i.e. chromosomal regions in spatial proximity) entailing the three-dimensional (3D) structure of the genome of a single cell can be obtained by single-cell chromosome conformation capture techniques, such as single-cell Hi-C (ScHi-C). However, due to the sparsity of chromosomal contacts in ScHi-C data, it is still challenging for traditional 3D conformation optimization methods to reconstruct the 3D chromosome structures from ScHi-C data. Here, we present a machine learning-based method based on a novel SO(3)-equivariant graph neural network (HiCEGNN) to reconstruct 3D structures of chromosomes of single cells from ScHi-C data. HiCEGNN consistently outperforms both the traditional optimization methods and the only other deep learning method across diverse cells, different structural resolutions, and different noise levels of the data. Moreover, HiCEGNN is robust against the noise in the ScHi-C data.
Collapse
Affiliation(s)
- Yanli Wang
- Department of Electrical Engineering and Computer Science, NextGen Precision Health Institute, University of Missouri, Columbia, MO 65211, United States
| | - Jianlin Cheng
- Department of Electrical Engineering and Computer Science, NextGen Precision Health Institute, University of Missouri, Columbia, MO 65211, United States
| |
Collapse
|
11
|
Raveh B, Eliasian R, Rashkovits S, Russel D, Hayama R, Sparks S, Singh D, Lim R, Villa E, Rout MP, Cowburn D, Sali A. Integrative mapping reveals molecular features underlying the mechanism of nucleocytoplasmic transport. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2023.12.31.573409. [PMID: 38260487 PMCID: PMC10802240 DOI: 10.1101/2023.12.31.573409] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]
Abstract
Nuclear Pore Complexes (NPCs) enable rapid, selective, and robust nucleocytoplasmic transport. To explain how transport emerges from the system components and their interactions, we used experimental data and theoretical information to construct an integrative Brownian dynamics model of transport through an NPC, coupled to a kinetic model of transport in the cell. The model recapitulates key aspects of transport for a wide range of molecular cargos, including pre-ribosomes and viral capsids. It quantifies how flexible phenylalanine-glycine (FG) repeat proteins raise an entropy barrier to passive diffusion and how this barrier is selectively lowered in facilitated diffusion by the many transient interactions of nuclear transport receptors with the FG repeats. Selective transport is enhanced by "fuzzy" multivalent interactions, redundant FG repeats, coupling to the energy-dependent RanGTP concentration gradient, and exponential dependence of transport kinetics on the transport barrier. Our model will facilitate rational modulation of the NPC and its artificial mimics.
Collapse
|
12
|
Peulen TO, Hemmen K, Greife A, Webb BM, Felekyan S, Sali A, Seidel CAM, Sanabria H, Heinze KG. tttrlib: modular software for integrating fluorescence spectroscopy, imaging, and molecular modeling. Bioinformatics 2025; 41:btaf025. [PMID: 39836627 PMCID: PMC11796090 DOI: 10.1093/bioinformatics/btaf025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Revised: 01/04/2025] [Accepted: 01/17/2025] [Indexed: 01/23/2025] Open
Abstract
SUMMARY We introduce software for reading, writing and processing fluorescence single-molecule and image spectroscopy data and developing analysis pipelines to unify various spectroscopic analysis tools. Our software can be used for processing multiple experiment types, e.g. for time-resolved single-molecule spectroscopy, laser scanning microscopy, fluorescence correlation spectroscopy and image correlation spectroscopy. The software is file format agnostic and processes multiple time-resolved data formats and outputs. Our software eliminates the need for data conversion and mitigates data archiving issues. AVAILABILITY AND IMPLEMENTATION tttrlib is available via pip (https://pypi.org/project/tttrlib/) and bioconda while the open-source code is available via GitHub (https://github.com/fluorescence-tools/tttrlib). Presented examples and additional documentation demonstrating how to implement in vitro and live-cell image spectroscopy analysis are available at https://docs.peulen.xyz/tttrlib and https://zenodo.org/records/14002224.
Collapse
Affiliation(s)
- Thomas-Otavio Peulen
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and Quantitative Biosciences Institute, University of California, San Francisco, CA, 94143, United States
- Rudolf Virchow Center for Integrative and Translational Bioimaging, Julius-Maximilians-University Würzburg (JMU), Würzburg, 97080, Germany
| | - Katherina Hemmen
- Rudolf Virchow Center for Integrative and Translational Bioimaging, Julius-Maximilians-University Würzburg (JMU), Würzburg, 97080, Germany
| | - Annemarie Greife
- Chair of Molecular Physical Chemistry, Heinrich-Heine University, Düsseldorf, 40225, Germany
| | - Benjamin M Webb
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and Quantitative Biosciences Institute, University of California, San Francisco, CA, 94143, United States
| | - Suren Felekyan
- Chair of Molecular Physical Chemistry, Heinrich-Heine University, Düsseldorf, 40225, Germany
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and Quantitative Biosciences Institute, University of California, San Francisco, CA, 94143, United States
| | - Claus A M Seidel
- Chair of Molecular Physical Chemistry, Heinrich-Heine University, Düsseldorf, 40225, Germany
| | - Hugo Sanabria
- Department of Physics & Astronomy, Clemson University, Clemson, SC, 29634, United States
| | - Katrin G Heinze
- Rudolf Virchow Center for Integrative and Translational Bioimaging, Julius-Maximilians-University Würzburg (JMU), Würzburg, 97080, Germany
| |
Collapse
|
13
|
Harju J, Broedersz CP. Physical models of bacterial chromosomes. Mol Microbiol 2025; 123:143-153. [PMID: 38578226 PMCID: PMC11841833 DOI: 10.1111/mmi.15257] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Revised: 03/12/2024] [Accepted: 03/18/2024] [Indexed: 04/06/2024]
Abstract
The interplay between bacterial chromosome organization and functions such as transcription and replication can be studied in increasing detail using novel experimental techniques. Interpreting the resulting quantitative data, however, can be theoretically challenging. In this minireview, we discuss how connecting experimental observations to biophysical theory and modeling can give rise to new insights on bacterial chromosome organization. We consider three flavors of models of increasing complexity: simple polymer models that explore how physical constraints, such as confinement or plectoneme branching, can affect bacterial chromosome organization; bottom-up mechanistic models that connect these constraints to their underlying causes, for instance, chromosome compaction to macromolecular crowding, or supercoiling to transcription; and finally, data-driven methods for inferring interpretable and quantitative models directly from complex experimental data. Using recent examples, we discuss how biophysical models can both deepen our understanding of how bacterial chromosomes are structured and give rise to novel predictions about bacterial chromosome organization.
Collapse
Affiliation(s)
- Janni Harju
- Department of Physics and AstronomyVrije Universiteit AmsterdamAmsterdamThe Netherlands
| | - Chase P. Broedersz
- Department of Physics and AstronomyVrije Universiteit AmsterdamAmsterdamThe Netherlands
- Department of Physics, Arnold Sommerfeld Center for Theoretical Physics and Center for NanoScienceLudwig‐Maximilian‐University MunichMunichGermany
| |
Collapse
|
14
|
Vallat B, Webb BM, Zalevsky A, Tangmunarunkit H, Sekharan MR, Voinea S, Shafaeibejestan A, Sagendorf J, Hoch JC, Kurisu G, Morris KL, Velankar S, Kesselman C, Burley SK, Berman HM, Sali A. PDB-IHM: A System for Deposition, Curation, Validation, and Dissemination of Integrative Structures. J Mol Biol 2025:168963. [PMID: 40110969 DOI: 10.1016/j.jmb.2025.168963] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2024] [Revised: 01/21/2025] [Accepted: 01/22/2025] [Indexed: 03/22/2025]
Abstract
Structures of many large biomolecular assemblies are now being determined using integrative approaches. In these approaches, information derived from multiple experimental and computational methods is combined to compute three-dimensional structures of multi-protein complexes and other macromolecular machines. A standalone prototype data resource for integrative structures called PDB-Dev was built, based on recommendations of the Integrative and Hybrid Methods (IHM) Task Force of the Worldwide Protein Data Bank (wwPDB). This effort included developing data standards and software tools for collecting, curating, validating, visualizing, archiving, and disseminating integrative structures that span diverse spatiotemporal scales and conformational states. Mechanisms have been created to validate integrative structures based on the experimental data underpinning them. Building upon this foundational framework, PDB-Dev has been further expanded to handle large dynamic macromolecular systems and integrative structures that combine, for example, experimental restraints with atomic coordinates computed by machine learning algorithms. Data standards and supporting tools have also been extended to capture information about biomolecular dynamics, such as conformational transitions and related kinetic data derived from biophysical methods. Recently, PDB-Dev was unified with the PDB archive and rebranded as PDB-IHM (pdb-ihm.org), further promoting FAIR (Findable, Accessible, Interoperable, and Reusable) principles of data stewardship for integrative structural biology.
Collapse
Affiliation(s)
- Brinda Vallat
- Research Collaboratory for Structural Bioinformatics Protein Data Bank and the Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Rutgers Cancer Institute, Rutgers, The State University of New Jersey, New Brunswick, NJ 08901, USA.
| | - Benjamin M Webb
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Department of Bioengineering and Therapeutic Sciences, Quantitative Biosciences Institute (QBI), and Department of Pharmaceutical Chemistry, University of California, San Francisco., San Francisco, CA 94157, USA
| | - Arthur Zalevsky
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Department of Bioengineering and Therapeutic Sciences, Quantitative Biosciences Institute (QBI), and Department of Pharmaceutical Chemistry, University of California, San Francisco., San Francisco, CA 94157, USA
| | - Hongsuda Tangmunarunkit
- Information Sciences Institute, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, USA
| | - Monica R Sekharan
- Research Collaboratory for Structural Bioinformatics Protein Data Bank and the Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Serban Voinea
- Information Sciences Institute, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, USA
| | - Aref Shafaeibejestan
- Information Sciences Institute, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, USA
| | - Jared Sagendorf
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Department of Bioengineering and Therapeutic Sciences, Quantitative Biosciences Institute (QBI), and Department of Pharmaceutical Chemistry, University of California, San Francisco., San Francisco, CA 94157, USA
| | - Jeffrey C Hoch
- Biological Magnetic Resonance Data Bank, Department of Molecular Biology and Biophysics, University of Connecticut, Farmington, CT 06030-3305, USA
| | - Genji Kurisu
- Protein Data Bank Japan, Institute for Protein Research, Osaka University, Suita, Osaka 565-0871, Japan
| | - Kyle L Morris
- Electron Microscopy Data Bank, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, UK
| | - Sameer Velankar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, UK
| | - Carl Kesselman
- Information Sciences Institute, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, USA
| | - Stephen K Burley
- Research Collaboratory for Structural Bioinformatics Protein Data Bank and the Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Rutgers Cancer Institute, Rutgers, The State University of New Jersey, New Brunswick, NJ 08901, USA; Research Collaboratory for Structural Bioinformatics Protein Data Bank, San Diego Supercomputer Center, University of California, La Jolla, CA 92093, USA; Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Rutgers Artificial Intelligence and Data Science (RAD) Collaboratory, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Helen M Berman
- Research Collaboratory for Structural Bioinformatics Protein Data Bank and the Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Department of Quantitative and Computational Biology, University of Southern California, Los Angeles CA 90089, USA
| | - Andrej Sali
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Department of Bioengineering and Therapeutic Sciences, Quantitative Biosciences Institute (QBI), and Department of Pharmaceutical Chemistry, University of California, San Francisco., San Francisco, CA 94157, USA
| |
Collapse
|
15
|
Majila K, Ullanat V, Viswanath S. A deep learning method for predicting interactions for intrinsically disordered regions of proteins. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2024.12.19.629373. [PMID: 39763873 PMCID: PMC11702703 DOI: 10.1101/2024.12.19.629373] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/14/2025]
Abstract
Intrinsically disordered proteins or regions (IDPs/IDRs) adopt diverse binding modes with different partners, ranging from ordered to multivalent to fuzzy conformations in the bound state. Characterizing IDR interfaces is challenging experimentally and computationally. Alphafold-multimer and Alphafold3, the state-of-the-art structure prediction methods, are less accurate at predicting IDR binding sites at their benchmarked confidence cutoffs. Their performance improves upon lowering the confidence cutoffs. Here, we developed Disobind, a deep-learning method that predicts inter-protein contact maps and interface residues for an IDR and a partner protein, given their sequences. It outperforms AlphaFold-multimer and AlphaFold3 at multiple confidence cutoffs. Combining the Disobind and AlphaFold-multimer predictions further improves the performance. In contrast to most current methods, Disobind considers the context of the binding partner and does not depend on structures and multiple sequence alignments. Its predictions can be used to localize IDRs in integrative structures of large assemblies and characterize and modulate IDR-mediated interactions.
Collapse
Affiliation(s)
- Kartik Majila
- National Center for Biological Sciences, Tata Institute of Fundamental Research, Bangalore, India 560065
| | - Varun Ullanat
- National Center for Biological Sciences, Tata Institute of Fundamental Research, Bangalore, India 560065
| | - Shruthi Viswanath
- National Center for Biological Sciences, Tata Institute of Fundamental Research, Bangalore, India 560065
| |
Collapse
|
16
|
Rathore S, Gahlot D, Castin J, Pandey A, Arvindekar S, Viswanath S, Thukral L. Multiscale simulations reveal architecture of NOTCH protein and ligand specific features. Biophys J 2025; 124:393-407. [PMID: 39674890 PMCID: PMC11788485 DOI: 10.1016/j.bpj.2024.12.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 07/15/2024] [Accepted: 12/10/2024] [Indexed: 12/17/2024] Open
Abstract
NOTCH, a single-pass transmembrane protein, plays a crucial role in cell fate determination through cell-to-cell communication. It interacts with two canonical ligands, Delta-like (DLL) and Jagged (JAG), located on neighboring cells to regulate diverse cellular processes. Despite extensive studies on the functional roles of NOTCH and its ligands in cellular growth, the structural details of full-length NOTCH and its ligands remain poorly understood. In this study, we employed fragment-based modeling and multiscale simulations to study the full-length structure of the human NOTCH ectodomain, comprising 1756 amino acids. We performed coarse-grained dynamics simulations of NOTCH in both glycosylated and nonglycosylated forms to investigate the role of glycosylation in modulating its conformational dynamics. In apo form, coarse-grained simulations revealed that glycosylated NOTCH protein can transition from an elongated structure of ∼86 nm from the membrane surface to a semicompact state (∼23.81 ± 9.98 nm), which aligns with cryo-EM data. To transition from the apo form to ligand-bound forms of NOTCH, we followed an atomistic and integrative modeling approach to model the interactions between NOTCH-DLL4 and NOTCH-JAG1. Atomistic simulations of the smaller bound fragment EGF8-13 patch revealed conformational plasticity critical for NOTCH binding, while integrative modeling of full-length complexes suggested a larger binding surface than reported previously. Simulations of pathogenic mutations revealed that E360K and R448Q disrupted the NOTCH-ligand interaction surfaces, causing dissociation. In contrast, C1133Y in the Abruptex domain compromised protein stability by disrupting the domain's interaction with the ligand-binding domain in the apo form of NOTCH-ECD. These findings provide a detailed molecular understanding of NOTCH and its ligands, offering insights that could enable the development of novel therapeutic approaches to selectively target pathogenic NOTCH signaling.
Collapse
Affiliation(s)
- Surabhi Rathore
- CSIR-Institute of Genomics and Integrative Biology, New Delhi, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| | - Deepanshi Gahlot
- CSIR-Institute of Genomics and Integrative Biology, New Delhi, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| | - Jesu Castin
- CSIR-Institute of Genomics and Integrative Biology, New Delhi, India
| | - Arastu Pandey
- National Center for Biological Sciences (NCBS), Tata Institute of Fundamental Research (TIFR), GKVK, Bangalore, India
| | - Shreyas Arvindekar
- National Center for Biological Sciences (NCBS), Tata Institute of Fundamental Research (TIFR), GKVK, Bangalore, India
| | - Shruthi Viswanath
- National Center for Biological Sciences (NCBS), Tata Institute of Fundamental Research (TIFR), GKVK, Bangalore, India
| | - Lipi Thukral
- CSIR-Institute of Genomics and Integrative Biology, New Delhi, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India.
| |
Collapse
|
17
|
Zhang Y, Jindal M, Viswanath S, Sitharam M. A new discrete-geometry approach for integrative docking of proteins using chemical crosslinks. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2024.10.24.619977. [PMID: 39553940 PMCID: PMC11565733 DOI: 10.1101/2024.10.24.619977] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/19/2024]
Abstract
The structures of protein complexes allow us to understand and modulate the biological functions of the proteins. Integrative docking is a computational method to obtain the structures of a protein complex, given the atomic structures of the constituent proteins along with other experimental data on the complex, such as chemical crosslinks or SAXS profiles. Here, we develop a new discrete geometry-based method, wall-EASAL, for integrative rigid docking of protein pairs given the structures of the constituent proteins and chemical crosslinks. The method is an adaptation of EASAL (Efficient Atlasing and Search of Assembly Landscapes), a state-of-the-art discrete geometry method for efficient and exhaustive sampling of macromolecular configurations under pairwise inter-molecular distance constraints. We provide a mathematical proof that the method finds a structure satisfying the crosslink constraints under a natural condition satisfied by energy landscapes. We compare wall-EASAL with IMP (Integrative Modeling Platform), a commonly used integrative modeling method, on a benchmark, varying the numbers, types, and sources of input crosslinks, and sources of monomer structures. The wall-EASAL method performs better than IMP in terms of the average satisfaction of the configurations to the input crosslinks and the average similarity of the configurations to their corresponding native structures. The ensembles from IMP exhibit greater variability in these two measures. Further, wall-EASAL is more efficient than IMP. Although the current study uses crosslinks, the method is general and any source of distance constraints can be used for integrative docking with wall-EASAL. However, the current implementation only supports binary rigid protein docking, i.e., assumes that the monomer structures are known and remain rigid. Additionally, the current implementation is deterministic, i.e., it does not account for uncertainties in the crosslinking data beyond using distance bounds. Neither of these appears to be a theoretical or algorithmic limitation of the EASAL methodology. Structures from wall-EASAL can be incorporated in methods for modeling large macromolecular assemblies, for example by suggesting rigid bodies or restraints for use in these methods. This will facilitate the characterization of assemblies and cellular neighborhoods at increased efficiency, accuracy, and precision. The wall-EASAL method is available at https://bitbucket.org/geoplexity/easal-dev/src/Crosslink and the benchmark is available at https://github.com/isblab/Integrative_docking_benchmark.
Collapse
Affiliation(s)
- Yichi Zhang
- CISE Department, University of Florida, Gainesville, Florida 32611-6120, United States
| | - Muskaan Jindal
- National Center for Biological Sciences, Tata Institute of Fundamental Research, Bengaluru 560065, India
| | - Shruthi Viswanath
- National Center for Biological Sciences, Tata Institute of Fundamental Research, Bengaluru 560065, India
| | - Meera Sitharam
- CISE Department, University of Florida, Gainesville, Florida 32611-6120, United States
| |
Collapse
|
18
|
Shah SZ, Perry TN, Graziadei A, Cecatiello V, Kaliyappan T, Misiaszek AD, Müller CW, Ramsay EP, Vannini A. Structural insights into distinct mechanisms of RNA polymerase II and III recruitment to snRNA promoters. Nat Commun 2025; 16:141. [PMID: 39747245 PMCID: PMC11696126 DOI: 10.1038/s41467-024-55553-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2024] [Accepted: 12/17/2024] [Indexed: 01/04/2025] Open
Abstract
RNA polymerase III (Pol III) transcribes short, essential RNAs, including the U6 small nuclear RNA (snRNA). At U6 snRNA genes, Pol III is recruited by the snRNA Activating Protein Complex (SNAPc) and a Brf2-containing TFIIIB complex, forming a pre-initiation complex (PIC). Uniquely, SNAPc also recruits Pol II at the remaining splicesosomal snRNA genes (U1, 2, 4 and 5). The mechanism of SNAPc cross-polymerase engagement and the role of the SNAPC2 and SNAPC5 subunits remain poorly defined. Here, we present cryo-EM structures of the full-length SNAPc-containing Pol III PIC assembled on the U6 snRNA promoter in the open and melting states at 3.2-4.2 Å resolution. The structural comparison revealed differences with the Saccharomyces cerevisiae Pol III PIC and the basis of selective SNAPc engagement within Pol III and Pol II PICs. Additionally, crosslinking mass spectrometry localizes SNAPC2 and SNAPC5 near the promoter DNA, expanding upon existing descriptions of snRNA Pol III PIC structure.
Collapse
Affiliation(s)
| | | | | | | | | | - Agata D Misiaszek
- Structural and Computational Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
- Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland
| | - Christoph W Müller
- Structural and Computational Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | | | | |
Collapse
|
19
|
Wu T, Cheng AY, Zhang Y, Xu J, Wu J, Wen L, Li X, Liu B, Dou X, Wang P, Zhang L, Fei J, Li J, Ouyang Z, He C. KARR-seq reveals cellular higher-order RNA structures and RNA-RNA interactions. Nat Biotechnol 2024; 42:1909-1920. [PMID: 38238480 PMCID: PMC11255127 DOI: 10.1038/s41587-023-02109-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Accepted: 12/15/2023] [Indexed: 02/12/2024]
Abstract
RNA fate and function are affected by their structures and interactomes. However, how RNA and RNA-binding proteins (RBPs) assemble into higher-order structures and how RNA molecules may interact with each other to facilitate functions remain largely unknown. Here we present KARR-seq, which uses N3-kethoxal labeling and multifunctional chemical crosslinkers to covalently trap and determine RNA-RNA interactions and higher-order RNA structures inside cells, independent of local protein binding to RNA. KARR-seq depicts higher-order RNA structure and detects widespread intermolecular RNA-RNA interactions with high sensitivity and accuracy. Using KARR-seq, we show that translation represses mRNA compaction under native and stress conditions. We determined the higher-order RNA structures of respiratory syncytial virus (RSV) and vesicular stomatitis virus (VSV) and identified RNA-RNA interactions between the viruses and the host RNAs that potentially regulate viral replication.
Collapse
Affiliation(s)
- Tong Wu
- Department of Chemistry, University of Chicago, Chicago, IL, USA
- Howard Hughes Medical Institute, Chicago, IL, USA
| | - Anthony Youzhi Cheng
- Department of Genetics and Genome Sciences and Institute for Systems Genomics, University of Connecticut, Farmington, CT, USA
- Department of Biostatistics and Epidemiology, School of Public Health and Health Sciences, University of Massachusetts, Amherst, MA, USA
- Genome Institute of Singapore, Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
| | - Yuexiu Zhang
- Department of Veterinary Biosciences, College of Veterinary Medicine, The Ohio State University, Columbus, OH, USA
| | - Jiayu Xu
- Department of Veterinary Biosciences, College of Veterinary Medicine, The Ohio State University, Columbus, OH, USA
| | - Jinjun Wu
- Department of Biochemistry and Molecular Biology, Institute for Biophysical Dynamics, University of Chicago, Chicago, IL, USA
| | - Li Wen
- Department of Biochemistry and Molecular Biology, Institute for Biophysical Dynamics, University of Chicago, Chicago, IL, USA
| | - Xiao Li
- Department of Veterinary Biosciences, College of Veterinary Medicine, The Ohio State University, Columbus, OH, USA
| | - Bei Liu
- Department of Chemistry, University of Chicago, Chicago, IL, USA
- Howard Hughes Medical Institute, Chicago, IL, USA
| | - Xiaoyang Dou
- Department of Chemistry, University of Chicago, Chicago, IL, USA
- Howard Hughes Medical Institute, Chicago, IL, USA
| | - Pingluan Wang
- Department of Chemistry, University of Chicago, Chicago, IL, USA
- Howard Hughes Medical Institute, Chicago, IL, USA
| | - Linda Zhang
- Department of Chemistry, University of Chicago, Chicago, IL, USA
- Howard Hughes Medical Institute, Chicago, IL, USA
| | - Jingyi Fei
- Department of Biochemistry and Molecular Biology, Institute for Biophysical Dynamics, University of Chicago, Chicago, IL, USA
| | - Jianrong Li
- Department of Veterinary Biosciences, College of Veterinary Medicine, The Ohio State University, Columbus, OH, USA
| | - Zhengqing Ouyang
- Department of Biostatistics and Epidemiology, School of Public Health and Health Sciences, University of Massachusetts, Amherst, MA, USA.
| | - Chuan He
- Department of Chemistry, University of Chicago, Chicago, IL, USA.
- Howard Hughes Medical Institute, Chicago, IL, USA.
- Department of Biochemistry and Molecular Biology, Institute for Biophysical Dynamics, University of Chicago, Chicago, IL, USA.
| |
Collapse
|
20
|
Pasani S, Menon KS, Viswanath S. The molecular architecture of the desmosomal outer dense plaque by integrative structural modeling. Protein Sci 2024; 33:e5217. [PMID: 39548826 PMCID: PMC11568391 DOI: 10.1002/pro.5217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Revised: 10/22/2024] [Accepted: 10/23/2024] [Indexed: 11/18/2024]
Abstract
Desmosomes mediate cell-cell adhesion and are prevalent in tissues under mechanical stress. However, their detailed structural characterization is not available. Here, we characterized the molecular architecture of the desmosomal outer dense plaque (ODP) using Bayesian integrative structural modeling via the Integrative Modeling Platform. Starting principally from the structural interpretation of a cryo-electron tomography (cryo-ET) map of the ODP, we integrated information from x-ray crystallography, an immuno-electron microscopy study, biochemical assays, in silico predictions of transmembrane and disordered regions, homology modeling, and stereochemistry information. The integrative structure was validated by information from imaging, tomography, and biochemical studies that were not used in modeling. The ODP resembles a densely packed cylinder with a plakophilin (PKP) layer and a plakoglobin (PG) layer; the desmosomal cadherins and PKP span these two layers. Our integrative approach allowed us to localize disordered regions, such as the N-terminus of PKP and the C-terminus of PG. We refined previous protein-protein interactions between desmosomal proteins and provided possible structural hypotheses for defective cell-cell adhesion in several diseases by mapping disease-related mutations on the structure. Finally, we point to features of the structure that could confer resilience to mechanical stress. Our model provides a basis for generating experimentally verifiable hypotheses on the structure and function of desmosomal proteins in normal and disease states.
Collapse
Affiliation(s)
- Satwik Pasani
- National Center for Biological SciencesTata Institute of Fundamental ResearchBengaluruIndia
| | - Kavya S. Menon
- National Center for Biological SciencesTata Institute of Fundamental ResearchBengaluruIndia
| | - Shruthi Viswanath
- National Center for Biological SciencesTata Institute of Fundamental ResearchBengaluruIndia
| |
Collapse
|
21
|
Pellequer JL. Perspectives Toward an Integrative Structural Biology Pipeline With Atomic Force Microscopy Topographic Images. J Mol Recognit 2024; 37:e3102. [PMID: 39329418 DOI: 10.1002/jmr.3102] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2024] [Revised: 08/21/2024] [Accepted: 09/03/2024] [Indexed: 09/28/2024]
Abstract
After the recent double revolutions in structural biology, which include the use of direct detectors for cryo-electron microscopy resulting in a significant improvement in the expected resolution of large macromolecule structures, and the advent of AlphaFold which allows for near-accurate prediction of any protein structures, the field of structural biology is now pursuing more ambitious targets, including several MDa assemblies. But complex target systems cannot be tackled using a single biophysical technique. The field of integrative structural biology has emerged as a global solution. The aim is to integrate data from multiple complementary techniques to produce a final three-dimensional model that cannot be obtained from any single technique. The absence of atomic force microscopy data from integrative structural biology platforms is not necessarily due to its nm resolution, as opposed to Å resolution for x-ray crystallography, nuclear magnetic resonance, or electron microscopy. Rather a significant issue was that the AFM topographic data lacked interpretability. Fortunately, with the introduction of the AFM-Assembly pipeline and other similar tools, it is now possible to integrate AFM topographic data into integrative modeling platforms. The advantages of single molecule techniques, such as AFM, include the ability to confirm experimentally any assembled molecular models or to produce alternative conformations that mimic the inherent flexibility of large proteins or complexes. The review begins with a brief overview of the historical developments of AFM data in structural biology, followed by an examination of the strengths and limitations of AFM imaging, which have hindered its integration into modern modeling platforms. This review discusses the correction and improvement of AFM topographic images, as well as the principles behind the AFM-Assembly pipeline. It also presents and discusses a series of challenges that need to be addressed in order to improve the incorporation of AFM data into integrative modeling platform.
Collapse
Affiliation(s)
- Jean-Luc Pellequer
- Univ. Grenoble Alpes, CEA, CNRS, Institut de Biologie Structurale (IBS), Grenoble, France
| |
Collapse
|
22
|
Singh D, Soni N, Hutchings J, Echeverria I, Shaikh F, Duquette M, Suslov S, Li Z, van Eeuwen T, Molloy K, Shi Y, Wang J, Guo Q, Chait BT, Fernandez-Martinez J, Rout MP, Sali A, Villa E. The molecular architecture of the nuclear basket. Cell 2024; 187:5267-5281.e13. [PMID: 39127037 DOI: 10.1016/j.cell.2024.07.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Revised: 05/24/2024] [Accepted: 07/12/2024] [Indexed: 08/12/2024]
Abstract
The nuclear pore complex (NPC) is the sole mediator of nucleocytoplasmic transport. Despite great advances in understanding its conserved core architecture, the peripheral regions can exhibit considerable variation within and between species. One such structure is the cage-like nuclear basket. Despite its crucial roles in mRNA surveillance and chromatin organization, an architectural understanding has remained elusive. Using in-cell cryo-electron tomography and subtomogram analysis, we explored the NPC's structural variations and the nuclear basket across fungi (yeast; S. cerevisiae), mammals (mouse; M. musculus), and protozoa (T. gondii). Using integrative structural modeling, we computed a model of the basket in yeast and mammals that revealed how a hub of nucleoporins (Nups) in the nuclear ring binds to basket-forming Mlp/Tpr proteins: the coiled-coil domains of Mlp/Tpr form the struts of the basket, while their unstructured termini constitute the basket distal densities, which potentially serve as a docking site for mRNA preprocessing before nucleocytoplasmic transport.
Collapse
Affiliation(s)
- Digvijay Singh
- School of Biological Sciences, University of California, San Diego, La Jolla, CA 92093, USA
| | - Neelesh Soni
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Joshua Hutchings
- School of Biological Sciences, University of California, San Diego, La Jolla, CA 92093, USA
| | - Ignacia Echeverria
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Farhaz Shaikh
- School of Biological Sciences, University of California, San Diego, La Jolla, CA 92093, USA
| | - Madeleine Duquette
- School of Biological Sciences, University of California, San Diego, La Jolla, CA 92093, USA
| | - Sergey Suslov
- School of Biological Sciences, University of California, San Diego, La Jolla, CA 92093, USA
| | - Zhixun Li
- State Key Laboratory of Protein and Plant Gene Research, Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, School of Life Sciences, Peking University, Beijing 100871, P.R. China
| | - Trevor van Eeuwen
- Laboratory of Cellular and Structural Biology, The Rockefeller University, New York, NY 10065, USA
| | - Kelly Molloy
- Laboratory of Mass Spectrometry and Gaseous Ion Chemistry, The Rockefeller University, New York, NY 10065, USA
| | - Yi Shi
- Laboratory of Mass Spectrometry and Gaseous Ion Chemistry, The Rockefeller University, New York, NY 10065, USA
| | - Junjie Wang
- Laboratory of Mass Spectrometry and Gaseous Ion Chemistry, The Rockefeller University, New York, NY 10065, USA
| | - Qiang Guo
- State Key Laboratory of Protein and Plant Gene Research, Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, School of Life Sciences, Peking University, Beijing 100871, P.R. China
| | - Brian T Chait
- Laboratory of Mass Spectrometry and Gaseous Ion Chemistry, The Rockefeller University, New York, NY 10065, USA
| | - Javier Fernandez-Martinez
- Laboratory of Cellular and Structural Biology, The Rockefeller University, New York, NY 10065, USA; Ikerbasque, Basque Foundation for Science, 48013 Bilbao, Spain; Instituto Biofisika (UPV/EHU, CSIC), University of the Basque Country, 48940 Leioa, Spain.
| | - Michael P Rout
- Laboratory of Cellular and Structural Biology, The Rockefeller University, New York, NY 10065, USA.
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA; Quantitative Biosciences Institute, University of California, San Francisco, San Francisco, CA 94158, USA; Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA 94158, USA.
| | - Elizabeth Villa
- School of Biological Sciences, University of California, San Diego, La Jolla, CA 92093, USA; Howard Hughes Medical Institute, University of California, San Diego, La Jolla, CA 92093, USA.
| |
Collapse
|
23
|
Vallat B, Webb BM, Westbrook JD, Goddard TD, Hanke CA, Graziadei A, Peisach E, Zalevsky A, Sagendorf J, Tangmunarunkit H, Voinea S, Sekharan M, Yu J, Bonvin AAMJJ, DiMaio F, Hummer G, Meiler J, Tajkhorshid E, Ferrin TE, Lawson CL, Leitner A, Rappsilber J, Seidel CAM, Jeffries CM, Burley SK, Hoch JC, Kurisu G, Morris K, Patwardhan A, Velankar S, Schwede T, Trewhella J, Kesselman C, Berman HM, Sali A. IHMCIF: An Extension of the PDBx/mmCIF Data Standard for Integrative Structure Determination Methods. J Mol Biol 2024; 436:168546. [PMID: 38508301 PMCID: PMC11377171 DOI: 10.1016/j.jmb.2024.168546] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Revised: 03/11/2024] [Accepted: 03/14/2024] [Indexed: 03/22/2024]
Abstract
IHMCIF (github.com/ihmwg/IHMCIF) is a data information framework that supports archiving and disseminating macromolecular structures determined by integrative or hybrid modeling (IHM), and making them Findable, Accessible, Interoperable, and Reusable (FAIR). IHMCIF is an extension of the Protein Data Bank Exchange/macromolecular Crystallographic Information Framework (PDBx/mmCIF) that serves as the framework for the Protein Data Bank (PDB) to archive experimentally determined atomic structures of biological macromolecules and their complexes with one another and small molecule ligands (e.g., enzyme cofactors and drugs). IHMCIF serves as the foundational data standard for the PDB-Dev prototype system, developed for archiving and disseminating integrative structures. It utilizes a flexible data representation to describe integrative structures that span multiple spatiotemporal scales and structural states with definitions for restraints from a variety of experimental methods contributing to integrative structural biology. The IHMCIF extension was created with the benefit of considerable community input and recommendations gathered by the Worldwide Protein Data Bank (wwPDB) Task Force for Integrative or Hybrid Methods (wwpdb.org/task/hybrid). Herein, we describe the development of IHMCIF to support evolving methodologies and ongoing advancements in integrative structural biology. Ultimately, IHMCIF will facilitate the unification of PDB-Dev data and tools with the PDB archive so that integrative structures can be archived and disseminated through PDB.
Collapse
Affiliation(s)
- Brinda Vallat
- Research Collaboratory for Structural Bioinformatics Protein Data Bank and the Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ 08901, USA.
| | - Benjamin M Webb
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, the Quantitative Biosciences Institute (QBI), and the Research Collaboratory for Structural Bioinformatics Protein Data Bank, University of California, San Francisco, San Francisco, CA 94157, USA
| | - John D Westbrook
- Research Collaboratory for Structural Bioinformatics Protein Data Bank and the Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ 08901, USA
| | - Thomas D Goddard
- Department of Pharmaceutical Chemistry, University of California, San Francisco, CA 94158, USA
| | - Christian A Hanke
- Molecular Physical Chemistry, Heinrich Heine University Düsseldorf, 40225 Düsseldorf, Germany
| | - Andrea Graziadei
- Bioanalytics, Institute of Biotechnology, Technische Universität Berlin, 10623 Berlin, Germany; Human Technopole, 20157 Milan, Italy
| | - Ezra Peisach
- Research Collaboratory for Structural Bioinformatics Protein Data Bank and the Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Arthur Zalevsky
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, the Quantitative Biosciences Institute (QBI), and the Research Collaboratory for Structural Bioinformatics Protein Data Bank, University of California, San Francisco, San Francisco, CA 94157, USA
| | - Jared Sagendorf
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, the Quantitative Biosciences Institute (QBI), and the Research Collaboratory for Structural Bioinformatics Protein Data Bank, University of California, San Francisco, San Francisco, CA 94157, USA
| | - Hongsuda Tangmunarunkit
- Information Sciences Institute, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, USA
| | - Serban Voinea
- Information Sciences Institute, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, USA
| | - Monica Sekharan
- Research Collaboratory for Structural Bioinformatics Protein Data Bank and the Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Jian Yu
- Protein Data Bank Japan, Institute for Protein Research, Osaka University, Suita, Osaka 565-0871, Japan
| | - Alexander A M J J Bonvin
- Bijvoet Centre for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, 3584 CH Utrecht, the Netherlands
| | - Frank DiMaio
- Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, WA 98195, USA
| | - Gerhard Hummer
- Department of Theoretical Biophysics, Max Planck Institute of Biophysics, 60438 Frankfurt am Main, Germany; Institute for Biophysics, Goethe University Frankfurt, 60438 Frankfurt am Main, Germany
| | - Jens Meiler
- Center for Structural Biology, Vanderbilt University, 465 21st Avenue South, Nashville, TN 37221, USA; Institute for Drug Discovery, Leipzig University Medical School, 04103 Leipzig, Germany
| | - Emad Tajkhorshid
- NIH Resource for Macromolecular Modeling and Visualization, Beckman Institute for Advanced Science and Technology, Department of Biochemistry, and Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Thomas E Ferrin
- Department of Pharmaceutical Chemistry, University of California, San Francisco, CA 94158, USA
| | - Catherine L Lawson
- Research Collaboratory for Structural Bioinformatics Protein Data Bank and the Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Alexander Leitner
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, 8093 Zurich, Switzerland
| | - Juri Rappsilber
- Bioanalytics, Institute of Biotechnology, Technische Universität Berlin, 10623 Berlin, Germany; Wellcome Centre for Cell Biology, University of Edinburgh, Max Born Crescent, Edinburgh EH9 3BF, UK
| | - Claus A M Seidel
- Molecular Physical Chemistry, Heinrich Heine University Düsseldorf, 40225 Düsseldorf, Germany
| | - Cy M Jeffries
- European Molecular Biology Laboratory (EMBL), Hamburg Unit, c/o Deutsches Elektronen-Synchrotron (DESY), Notkestrasse 85, 22607 Hamburg, Germany
| | - Stephen K Burley
- Research Collaboratory for Structural Bioinformatics Protein Data Bank and the Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ 08901, USA; Research Collaboratory for Structural Bioinformatics Protein Data Bank, San Diego Supercomputer Center, University of California, La Jolla, CA 92093, USA; Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Jeffrey C Hoch
- Biological Magnetic Resonance Data Bank, Department of Molecular Biology and Biophysics, University of Connecticut, Farmington, CT 06030-3305, USA
| | - Genji Kurisu
- Protein Data Bank Japan, Institute for Protein Research, Osaka University, Suita, Osaka 565-0871, Japan
| | - Kyle Morris
- Electron Microscopy Data Bank, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Ardan Patwardhan
- Electron Microscopy Data Bank, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Sameer Velankar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, UK
| | - Torsten Schwede
- Biozentrum, University of Basel, Basel, Switzerland; Computational Structural Biology & SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Jill Trewhella
- School of Life and Environmental Sciences, The University of Sydney, Sydney, NSW 2006, Australia; Department of Chemistry, University of Utah, Salt Lake City, UT 84112, USA
| | - Carl Kesselman
- Information Sciences Institute, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, USA
| | - Helen M Berman
- Research Collaboratory for Structural Bioinformatics Protein Data Bank and the Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Department of Quantitative and Computational Biology, University of Southern California, Los Angeles CA 90089, USA
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, the Quantitative Biosciences Institute (QBI), and the Research Collaboratory for Structural Bioinformatics Protein Data Bank, University of California, San Francisco, San Francisco, CA 94157, USA
| |
Collapse
|
24
|
Muscat S, Martino G, Manigrasso J, Marcia M, De Vivo M. On the Power and Challenges of Atomistic Molecular Dynamics to Investigate RNA Molecules. J Chem Theory Comput 2024. [PMID: 39150960 DOI: 10.1021/acs.jctc.4c00773] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/18/2024]
Abstract
RNA molecules play a vital role in biological processes within the cell, with significant implications for science and medicine. Notably, the biological functions exerted by specific RNA molecules are often linked to the RNA conformational ensemble. However, the experimental characterization of such three-dimensional RNA structures is challenged by the structural heterogeneity of RNA and by its multiple dynamic interactions with binding partners such as small molecules, proteins, and metal ions. Consequently, our current understanding of the structure-function relationship of RNA molecules is still limited. In this context, we highlight molecular dynamics (MD) simulations as a powerful tool to complement experimental efforts on RNAs. Despite the recognized limitations of current force fields for RNA MD simulations, examining the dynamics of selected RNAs has provided valuable functional insights into their structures.
Collapse
Affiliation(s)
- Stefano Muscat
- Laboratory of Molecular Modelling and Drug Discovery, Istituto Italiano di Tecnologia, Via Morego 30, 16163 Genoa, Italy
| | - Gianfranco Martino
- Laboratory of Molecular Modelling and Drug Discovery, Istituto Italiano di Tecnologia, Via Morego 30, 16163 Genoa, Italy
| | - Jacopo Manigrasso
- Medicinal Chemistry, Research and Early Development, Cardiovascular, Renal and Metabolism (CVRM), BioPharmaceuticals R&D, AstraZeneca, 431 50 Mölndal, Sweden
| | - Marco Marcia
- European Molecular Biology Laboratory Grenoble, 71 Avenue des Martyrs, 38042 Grenoble, France
- Department of Cell and Molecular Biology, Uppsala University, Husargatan 3, 751 23 Uppsala, Sweden
- Istituto Italiano di Tecnologia, Via Morego 30, 16163 Genoa, Italy
| | - Marco De Vivo
- Laboratory of Molecular Modelling and Drug Discovery, Istituto Italiano di Tecnologia, Via Morego 30, 16163 Genoa, Italy
| |
Collapse
|
25
|
Liu X, Zhang Y, Wen Z, Hao Y, Banks CA, Cesare J, Bhattacharya S, Arvindekar S, Lange JJ, Xie Y, Garcia BA, Slaughter BD, Unruh JR, Viswanath S, Florens L, Workman JL, Washburn MP. An integrated structural model of the DNA damage-responsive H3K4me3 binding WDR76:SPIN1 complex with the nucleosome. Proc Natl Acad Sci U S A 2024; 121:e2318601121. [PMID: 39116123 PMCID: PMC11331135 DOI: 10.1073/pnas.2318601121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Accepted: 06/21/2024] [Indexed: 08/10/2024] Open
Abstract
Serial capture affinity purification (SCAP) is a powerful method to isolate a specific protein complex. When combined with cross-linking mass spectrometry and computational approaches, one can build an integrated structural model of the isolated complex. Here, we applied SCAP to dissect a subpopulation of WDR76 in complex with SPIN1, a histone reader that recognizes trimethylated histone H3 lysine4 (H3K4me3). In contrast to a previous SCAP analysis of the SPIN1:SPINDOC complex, histones and the H3K4me3 mark were enriched with the WDR76:SPIN1 complex. Next, interaction network analysis of copurifying proteins and microscopy analysis revealed a potential role of the WDR76:SPIN1 complex in the DNA damage response. Since we detected 149 pairs of cross-links between WDR76, SPIN1, and histones, we then built an integrated structural model of the complex where SPIN1 recognized the H3K4me3 epigenetic mark while interacting with WDR76. Finally, we used the powerful Bayesian Integrative Modeling approach as implemented in the Integrative Modeling Platform to build a model of WDR76 and SPIN1 bound to the nucleosome.
Collapse
Affiliation(s)
- Xingyu Liu
- Stowers Institute for Medical Research, Kansas City, MO64110
| | - Ying Zhang
- Stowers Institute for Medical Research, Kansas City, MO64110
| | - Zhihui Wen
- Stowers Institute for Medical Research, Kansas City, MO64110
| | - Yan Hao
- Stowers Institute for Medical Research, Kansas City, MO64110
| | | | - Joseph Cesare
- Stowers Institute for Medical Research, Kansas City, MO64110
- Medical Scientist Training Program, Department of Cancer Biology, University of Kansas Medical Center, Kansas City, KS66150
| | | | - Shreyas Arvindekar
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, Bangalore560065, India
| | | | - Yixuan Xie
- Department of Biochemistry and Molecular Biophysics, Washington University St. Louis School of Medicine, St. Louis, MO63110
| | - Benjamin A. Garcia
- Department of Biochemistry and Molecular Biophysics, Washington University St. Louis School of Medicine, St. Louis, MO63110
| | | | - Jay R. Unruh
- Stowers Institute for Medical Research, Kansas City, MO64110
| | - Shruthi Viswanath
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, Bangalore560065, India
| | | | | | | |
Collapse
|
26
|
Gao J, Tong M, Lee C, Gaertig J, Legal T, Bui KH. DomainFit: Identification of protein domains in cryo-EM maps at intermediate resolution using AlphaFold2-predicted models. Structure 2024; 32:1248-1259.e5. [PMID: 38754431 PMCID: PMC11316655 DOI: 10.1016/j.str.2024.04.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 03/18/2024] [Accepted: 04/19/2024] [Indexed: 05/18/2024]
Abstract
Cryoelectron microscopy (cryo-EM) has revolutionized the structural determination of macromolecular complexes. With the paradigm shift to structure determination of highly complex endogenous macromolecular complexes ex vivo and in situ structural biology, there are an increasing number of structures of native complexes. These complexes often contain unidentified proteins, related to different cellular states or processes. Identifying proteins at resolutions lower than 4 Å remains challenging because side chains cannot be visualized reliably. Here, we present DomainFit, a program for semi-automated domain-level protein identification from cryo-EM maps, particularly at resolutions lower than 4 Å. By fitting domains from AlphaFold2-predicted models into cryo-EM maps, the program performs statistical analyses and attempts to identify the domains and protein candidates forming the density. Using DomainFit, we identified two microtubule inner proteins, one of which contains a CCDC81 domain and is exclusively localized in the proximal region of the doublet microtubule in Tetrahymena thermophila.
Collapse
Affiliation(s)
- Jerry Gao
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Montréal, QC H3A 0C7, Canada; Centre de recherche en biologie structurale, McGill University, Montréal, QC H3G 0B1, Canada
| | - Maxwell Tong
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Montréal, QC H3A 0C7, Canada; Centre de recherche en biologie structurale, McGill University, Montréal, QC H3G 0B1, Canada
| | - Chinkyu Lee
- Department of Cellular Biology, University of Georgia, Athens 30602-2607, GA, USA
| | - Jacek Gaertig
- Department of Cellular Biology, University of Georgia, Athens 30602-2607, GA, USA
| | - Thibault Legal
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Montréal, QC H3A 0C7, Canada; Centre de recherche en biologie structurale, McGill University, Montréal, QC H3G 0B1, Canada.
| | - Khanh Huy Bui
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Montréal, QC H3A 0C7, Canada; Centre de recherche en biologie structurale, McGill University, Montréal, QC H3G 0B1, Canada.
| |
Collapse
|
27
|
Frasnetti E, Magni A, Castelli M, Serapian SA, Moroni E, Colombo G. Structures, dynamics, complexes, and functions: From classic computation to artificial intelligence. Curr Opin Struct Biol 2024; 87:102835. [PMID: 38744148 DOI: 10.1016/j.sbi.2024.102835] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Revised: 04/14/2024] [Accepted: 04/22/2024] [Indexed: 05/16/2024]
Abstract
Computational approaches can provide highly detailed insight into the molecular recognition processes that underlie drug binding, the assembly of protein complexes, and the regulation of biological functional processes. Classical simulation methods can bridge a wide range of length- and time-scales typically involved in such processes. Lately, automated learning and artificial intelligence methods have shown the potential to expand the reach of physics-based approaches, ushering in the possibility to model and even design complex protein architectures. The synergy between atomistic simulations and AI methods is an emerging frontier with a huge potential for advances in structural biology. Herein, we explore various examples and frameworks for these approaches, providing select instances and applications that illustrate their impact on fundamental biomolecular problems.
Collapse
Affiliation(s)
- Elena Frasnetti
- Department of Chemistry, University of Pavia, via Taramelli 12, 27100 Pavia, Italy
| | - Andrea Magni
- Department of Chemistry, University of Pavia, via Taramelli 12, 27100 Pavia, Italy
| | - Matteo Castelli
- Department of Chemistry, University of Pavia, via Taramelli 12, 27100 Pavia, Italy
| | - Stefano A Serapian
- Department of Chemistry, University of Pavia, via Taramelli 12, 27100 Pavia, Italy
| | | | - Giorgio Colombo
- Department of Chemistry, University of Pavia, via Taramelli 12, 27100 Pavia, Italy.
| |
Collapse
|
28
|
Shor B, Schneidman-Duhovny D. Integrative modeling meets deep learning: Recent advances in modeling protein assemblies. Curr Opin Struct Biol 2024; 87:102841. [PMID: 38795564 DOI: 10.1016/j.sbi.2024.102841] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Revised: 04/24/2024] [Accepted: 04/27/2024] [Indexed: 05/28/2024]
Abstract
Recent progress in protein structure prediction based on deep learning revolutionized the field of Structural Biology. Beyond single proteins, it also enabled high-throughput prediction of structures of protein-protein interactions. Despite the success in predicting complex structures, large macromolecular assemblies still require specialized approaches. Here we describe recent advances in modeling macromolecular assemblies using integrative and hierarchical approaches. We highlight applications that predict protein-protein interactions and challenges in modeling complexes based on the interaction networks, including the prediction of complex stoichiometry and heterogeneity.
Collapse
Affiliation(s)
- Ben Shor
- The Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel. https://twitter.com/ben_shor
| | - Dina Schneidman-Duhovny
- The Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel.
| |
Collapse
|
29
|
Chen G, Zhang Z. IDRWalker: A Random Walk Based Tool for Generating Intrinsically Disordered Regions in Large Protein Complexes. ACS OMEGA 2024; 9:32059-32065. [PMID: 39072126 PMCID: PMC11270708 DOI: 10.1021/acsomega.4c04161] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/30/2024] [Revised: 06/16/2024] [Accepted: 06/27/2024] [Indexed: 07/30/2024]
Abstract
Intrinsically disordered regions (IDRs), which may be functionally important, are common in proteins. However, the structures of IDRs are often missing due to their highly dynamic nature. In the study of IDRs, integrative modeling combining computational simulations and experimental data is a common approach, for which initial structures of the IDRs need to be built. However, applying this method to large protein complexes is challenging because existing structure generation tools are sometimes unsuitable for IDRs in large systems. To facilitate convenient and rapid structure generation of IDRs in large protein complexes, we developed a computational tool named IDRWalker based on self-avoiding random walks. Three protein complexes were used to illustrate the efficiency of the tool, and it was found that IDRs in more than 800 chains of the nuclear pore complex could be generated in minutes. These structures of large protein complexes with added IDRs can be further used to run computational simulations for integrative modeling.
Collapse
Affiliation(s)
- Guanglin Chen
- Department
of Physics, University of Science and Technology
of China, Hefei, Anhui 230026, PR China
| | - Zhiyong Zhang
- Department
of Physics, University of Science and Technology
of China, Hefei, Anhui 230026, PR China
- MOE
Key Laboratory for Cellular Dynamics, University of Science and Technology
of China, Hefei, Anhui 230026, PR
China
| |
Collapse
|
30
|
Clark T, Mohan J, Schaffer L, Obernier K, Al Manir S, Churas CP, Dailamy A, Doctor Y, Forget A, Hansen JN, Hu M, Lenkiewicz J, Levinson MA, Marquez C, Nourreddine S, Niestroy J, Pratt D, Qian G, Thaker S, Bélisle-Pipon JC, Brandt C, Chen J, Ding Y, Fodeh S, Krogan N, Lundberg E, Mali P, Payne-Foster P, Ratcliffe S, Ravitsky V, Sali A, Schulz W, Ideker T. Cell Maps for Artificial Intelligence: AI-Ready Maps of Human Cell Architecture from Disease-Relevant Cell Lines. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.21.589311. [PMID: 38826258 PMCID: PMC11142054 DOI: 10.1101/2024.05.21.589311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2024]
Abstract
This article describes the Cell Maps for Artificial Intelligence (CM4AI) project and its goals, methods, standards, current datasets, software tools , status, and future directions. CM4AI is the Functional Genomics Data Generation Project in the U.S. National Institute of Health's (NIH) Bridge2AI program. Its overarching mission is to produce ethical, AI-ready datasets of cell architecture, inferred from multimodal data collected for human cell lines, to enable transformative biomedical AI research.
Collapse
|
31
|
Jagielnicki M, Kucharska I, Bennett BC, Harris AL, Yeager M. Connexin Gap Junction Channels and Hemichannels: Insights from High-Resolution Structures. BIOLOGY 2024; 13:298. [PMID: 38785780 PMCID: PMC11117596 DOI: 10.3390/biology13050298] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Revised: 03/29/2024] [Accepted: 04/08/2024] [Indexed: 05/25/2024]
Abstract
Connexins (Cxs) are a family of integral membrane proteins, which function as both hexameric hemichannels (HCs) and dodecameric gap junction channels (GJCs), behaving as conduits for the electrical and molecular communication between cells and between cells and the extracellular environment, respectively. Their proper functioning is crucial for many processes, including development, physiology, and response to disease and trauma. Abnormal GJC and HC communication can lead to numerous pathological states including inflammation, skin diseases, deafness, nervous system disorders, and cardiac arrhythmias. Over the last 15 years, high-resolution X-ray and electron cryomicroscopy (cryoEM) structures for seven Cx isoforms have revealed conservation in the four-helix transmembrane (TM) bundle of each subunit; an αβ fold in the disulfide-bonded extracellular loops and inter-subunit hydrogen bonding across the extracellular gap that mediates end-to-end docking to form a tight seal between hexamers in the GJC. Tissue injury is associated with cellular Ca2+ overload. Surprisingly, the binding of 12 Ca2+ ions in the Cx26 GJC results in a novel electrostatic gating mechanism that blocks cation permeation. In contrast, acidic pH during tissue injury elicits association of the N-terminal (NT) domains that sterically blocks the pore in a "ball-and-chain" fashion. The NT domains under physiologic conditions display multiple conformational states, stabilized by protein-protein and protein-lipid interactions, which may relate to gating mechanisms. The cryoEM maps also revealed putative lipid densities within the pore, intercalated among transmembrane α-helices and between protomers, the functions of which are unknown. For the future, time-resolved cryoEM of isolated Cx channels as well as cryotomography of GJCs and HCs in cells and tissues will yield a deeper insight into the mechanisms for channel regulation. The cytoplasmic loop (CL) and C-terminal (CT) domains are divergent in sequence and length, are likely involved in channel regulation, but are not visualized in the high-resolution X-ray and cryoEM maps presumably due to conformational flexibility. We expect that the integrated use of synergistic physicochemical, spectroscopic, biophysical, and computational methods will reveal conformational dynamics relevant to functional states. We anticipate that such a wealth of results under different pathologic conditions will accelerate drug discovery related to Cx channel modulation.
Collapse
Affiliation(s)
- Maciej Jagielnicki
- The Phillip and Patricia Frost Institute for Chemistry and Molecular Science, Department of Chemistry, University of Miami, 1201 Memorial Drive, Miami, FL 33146, USA; (M.J.); (I.K.)
| | - Iga Kucharska
- The Phillip and Patricia Frost Institute for Chemistry and Molecular Science, Department of Chemistry, University of Miami, 1201 Memorial Drive, Miami, FL 33146, USA; (M.J.); (I.K.)
| | - Brad C. Bennett
- Department of Biological and Environmental Sciences, Howard College of Arts and Sciences, Samford University, Birmingham, AL 35229, USA;
| | - Andrew L. Harris
- Rutgers New Jersey Medical School, Department of Pharmacology, Physiology and Neuroscience, Newark, NJ 07103, USA;
| | - Mark Yeager
- The Phillip and Patricia Frost Institute for Chemistry and Molecular Science, Department of Chemistry, University of Miami, 1201 Memorial Drive, Miami, FL 33146, USA; (M.J.); (I.K.)
- The Phillip and Patricia Frost Institute for Chemistry and Molecular Science, Department of Biochemistry and Molecular Biology, University of Miami, Miami, FL 33146, USA
| |
Collapse
|
32
|
Singh D, Soni N, Hutchings J, Echeverria I, Shaikh F, Duquette M, Suslov S, Li Z, van Eeuwen T, Molloy K, Shi Y, Wang J, Guo Q, Chait BT, Fernandez-Martinez J, Rout MP, Sali A, Villa E. The Molecular Architecture of the Nuclear Basket. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.27.587068. [PMID: 38586009 PMCID: PMC10996695 DOI: 10.1101/2024.03.27.587068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/09/2024]
Abstract
The nuclear pore complex (NPC) is the sole mediator of nucleocytoplasmic transport. Despite great advances in understanding its conserved core architecture, the peripheral regions can exhibit considerable variation within and between species. One such structure is the cage-like nuclear basket. Despite its crucial roles in mRNA surveillance and chromatin organization, an architectural understanding has remained elusive. Using in-cell cryo-electron tomography and subtomogram analysis, we explored the NPC's structural variations and the nuclear basket across fungi (yeast; S. cerevisiae), mammals (mouse; M. musculus), and protozoa (T. gondii). Using integrative structural modeling, we computed a model of the basket in yeast and mammals that revealed how a hub of Nups in the nuclear ring binds to basket-forming Mlp/Tpr proteins: the coiled-coil domains of Mlp/Tpr form the struts of the basket, while their unstructured termini constitute the basket distal densities, which potentially serve as a docking site for mRNA preprocessing before nucleocytoplasmic transport.
Collapse
Affiliation(s)
- Digvijay Singh
- School of Biological Sciences, University of California San Diego, La Jolla, CA 92093, USA
| | - Neelesh Soni
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, USA
| | - Joshua Hutchings
- School of Biological Sciences, University of California San Diego, La Jolla, CA 92093, USA
| | - Ignacia Echeverria
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Farhaz Shaikh
- School of Biological Sciences, University of California San Diego, La Jolla, CA 92093, USA
| | - Madeleine Duquette
- School of Biological Sciences, University of California San Diego, La Jolla, CA 92093, USA
| | - Sergey Suslov
- School of Biological Sciences, University of California San Diego, La Jolla, CA 92093, USA
| | - Zhixun Li
- State Key Laboratory of Protein and Plant Gene Research, Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, School of Life Sciences, Peking University, Beijing 100871, P. R. China
| | - Trevor van Eeuwen
- Laboratory of Cellular and Structural Biology, The Rockefeller University, New York, NY 10065, USA
| | - Kelly Molloy
- Laboratory of Mass Spectrometry and Gaseous Ion Chemistry, The Rockefeller University, New York, NY 10065, USA
| | - Yi Shi
- Laboratory of Mass Spectrometry and Gaseous Ion Chemistry, The Rockefeller University, New York, NY 10065, USA
| | - Junjie Wang
- Laboratory of Mass Spectrometry and Gaseous Ion Chemistry, The Rockefeller University, New York, NY 10065, USA
| | - Qiang Guo
- State Key Laboratory of Protein and Plant Gene Research, Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, School of Life Sciences, Peking University, Beijing 100871, P. R. China
| | - Brian T Chait
- Laboratory of Mass Spectrometry and Gaseous Ion Chemistry, The Rockefeller University, New York, NY 10065, USA
| | - Javier Fernandez-Martinez
- Laboratory of Cellular and Structural Biology, The Rockefeller University, New York, NY 10065, USA
- Ikerbasque, Basque Foundation for Science, 48013 Bilbao, Spain
- Instituto Biofisika (UPV/EHU, CSIC), University of the Basque Country, 48940 Leioa, Spain
| | - Michael P Rout
- Laboratory of Cellular and Structural Biology, The Rockefeller University, New York, NY 10065, USA
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, USA
- Quantitative Biosciences Institute, University of California, San Francisco, San Francisco, CA 94158, USA
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Elizabeth Villa
- School of Biological Sciences, University of California San Diego, La Jolla, CA 92093, USA
- Howard Hughes Medical Institute, University of California San Diego, La Jolla, CA 92093, USA
| |
Collapse
|
33
|
Pasani S, Menon KS, Viswanath S. The molecular architecture of the desmosomal outer dense plaque by integrative structural modeling. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.06.13.544884. [PMID: 37398295 PMCID: PMC10312763 DOI: 10.1101/2023.06.13.544884] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Desmosomes mediate cell-cell adhesion and are prevalent in tissues under mechanical stress. However, their detailed structural characterization is not available. Here, we characterized the molecular architecture of the desmosomal outer dense plaque (ODP) using Bayesian integrative structural modeling via the Integrative Modeling Platform. Starting principally from the structural interpretation of an electron cryo-tomogram, we integrated information from X-ray crystallography, an immuno-electron microscopy study, biochemical assays, in-silico predictions of transmembrane and disordered regions, homology modeling, and stereochemistry information. The integrative structure was validated by information from imaging, tomography, and biochemical studies that were not used in modeling. The ODP resembles a densely packed cylinder with a PKP layer and a PG layer; the desmosomal cadherins and PKP span these two layers. Our integrative approach allowed us to localize disordered regions, such as N-PKP and PG-C. We refined previous protein-protein interactions between desmosomal proteins and provided possible structural hypotheses for defective cell-cell adhesion in several diseases by mapping disease-related mutations on the structure. Finally, we point to features of the structure that could confer resilience to mechanical stress. Our model provides a basis for generating experimentally verifiable hypotheses on the structure and function of desmosomal proteins in normal and disease states.
Collapse
Affiliation(s)
- Satwik Pasani
- National Center for Biological Sciences, Tata Institute of Fundamental Research, Bengaluru 560065, India
| | - Kavya S Menon
- National Center for Biological Sciences, Tata Institute of Fundamental Research, Bengaluru 560065, India
| | - Shruthi Viswanath
- National Center for Biological Sciences, Tata Institute of Fundamental Research, Bengaluru 560065, India
| |
Collapse
|
34
|
Arvindekar S, Pathak AS, Majila K, Viswanath S. Optimizing representations for integrative structural modeling using Bayesian model selection. Bioinformatics 2024; 40:btae106. [PMID: 38391029 PMCID: PMC10924281 DOI: 10.1093/bioinformatics/btae106] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Revised: 02/03/2024] [Accepted: 02/21/2024] [Indexed: 02/24/2024] Open
Abstract
MOTIVATION Integrative structural modeling combines data from experiments, physical principles, statistics of previous structures, and prior models to obtain structures of macromolecular assemblies that are challenging to characterize experimentally. The choice of model representation is a key decision in integrative modeling, as it dictates the accuracy of scoring, efficiency of sampling, and resolution of analysis. But currently, the choice is usually made ad hoc, manually. RESULTS Here, we report NestOR (Nested Sampling for Optimizing Representation), a fully automated, statistically rigorous method based on Bayesian model selection to identify the optimal coarse-grained representation for a given integrative modeling setup. Given an integrative modeling setup, it determines the optimal representations from given candidate representations based on their model evidence and sampling efficiency. The performance of NestOR was evaluated on a benchmark of four macromolecular assemblies. AVAILABILITY AND IMPLEMENTATION NestOR is implemented in the Integrative Modeling Platform (https://integrativemodeling.org) and is available at https://github.com/isblab/nestor. Data for the benchmark is at https://www.doi.org/10.5281/zenodo.10360718.
Collapse
Affiliation(s)
- Shreyas Arvindekar
- National Center for Biological Sciences, Tata Institute of Fundamental Research, Bangalore 560065, India
| | - Aditi S Pathak
- National Center for Biological Sciences, Tata Institute of Fundamental Research, Bangalore 560065, India
| | - Kartik Majila
- National Center for Biological Sciences, Tata Institute of Fundamental Research, Bangalore 560065, India
| | - Shruthi Viswanath
- National Center for Biological Sciences, Tata Institute of Fundamental Research, Bangalore 560065, India
| |
Collapse
|
35
|
Di Ianni A, Di Ianni A, Cowan K, Barbero LM, Sirtori FR. Leveraging Cross-Linking Mass Spectrometry for Modeling Antibody-Antigen Complexes. J Proteome Res 2024; 23:1049-1061. [PMID: 38372774 DOI: 10.1021/acs.jproteome.3c00816] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]
Abstract
Elucidating antibody-antigen complexes at the atomic level is of utmost interest for understanding immune responses and designing better therapies. Cross-linking mass spectrometry (XL-MS) has emerged as a powerful tool for mapping protein-protein interactions, suggesting valuable structural insights. However, the use of XL-MS studies to enable epitope/paratope mapping of antibody-antigen complexes is still limited up to now. XL-MS data can be used to drive integrative modeling of antibody-antigen complexes, where cross-links information serves as distance restraints for the precise determination of binding interfaces. In this approach, XL-MS data are employed to identify connections between binding interfaces of the antibody and the antigen, thus informing molecular modeling. Current literature provides minimal input about the impact of XL-MS data on the integrative modeling of antibody-antigen complexes. Here, we applied XL-MS to retrieve information about binding interfaces of three antibody-antigen complexes. We leveraged XL-MS data to perform integrative modeling using HADDOCK (active-passive residues and distance restraints strategies) and AlphaLink2. We then compared these three approaches with initial predictions of investigated antibody-antigen complexes by AlphaFold Multimer. This work emphasizes the importance of cross-linking data in resolving conformational dynamics of antibody-antigen complexes, ultimately enhancing the design of better protein therapeutics and vaccines.
Collapse
Affiliation(s)
- Andrea Di Ianni
- NBE-DMPK Innovative BioAnalytics, Merck Serono RBM S.p.A., an Affiliate of Merck KGaA, Darmstadt, Germany, Via Ribes 1, Colleretto Giacosa (TO) 10010, Italy
- University of Turin, Molecular Biotechnology Center, Department of Molecular Biotechnology and Health Sciences, University of Turin, Turin 10126, Italy
| | - Alessio Di Ianni
- Martin Luther University Halle-Wittenberg, Department of Pharmaceutical Chemistry and Bioanalytics, Center for Structural Mass Spectrometry, Institute of Pharmacy, Kurt-Mothes-Str. 3, Halle/Saale D-06120, Germany
| | - Kyra Cowan
- New Biological Entities, Drug Metabolism and Pharmacokinetics (NBE-DMPK), Research and Development, Merck KGaA, Frankfurterstrasse 250, Darmstadt 64293, Germany
| | - Luca M Barbero
- NBE-DMPK Innovative BioAnalytics, Merck Serono RBM S.p.A., an Affiliate of Merck KGaA, Darmstadt, Germany, Via Ribes 1, Colleretto Giacosa (TO) 10010, Italy
| | - Federico Riccardi Sirtori
- NBE-DMPK Innovative BioAnalytics, Merck Serono RBM S.p.A., an Affiliate of Merck KGaA, Darmstadt, Germany, Via Ribes 1, Colleretto Giacosa (TO) 10010, Italy
| |
Collapse
|
36
|
Shor B, Schneidman-Duhovny D. CombFold: predicting structures of large protein assemblies using a combinatorial assembly algorithm and AlphaFold2. Nat Methods 2024; 21:477-487. [PMID: 38326495 PMCID: PMC10927564 DOI: 10.1038/s41592-024-02174-0] [Citation(s) in RCA: 27] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Accepted: 01/09/2024] [Indexed: 02/09/2024]
Abstract
Deep learning models, such as AlphaFold2 and RosettaFold, enable high-accuracy protein structure prediction. However, large protein complexes are still challenging to predict due to their size and the complexity of interactions between multiple subunits. Here we present CombFold, a combinatorial and hierarchical assembly algorithm for predicting structures of large protein complexes utilizing pairwise interactions between subunits predicted by AlphaFold2. CombFold accurately predicted (TM-score >0.7) 72% of the complexes among the top-10 predictions in two datasets of 60 large, asymmetric assemblies. Moreover, the structural coverage of predicted complexes was 20% higher compared to corresponding Protein Data Bank entries. We applied the method on complexes from Complex Portal with known stoichiometry but without known structure and obtained high-confidence predictions. CombFold supports the integration of distance restraints based on crosslinking mass spectrometry and fast enumeration of possible complex stoichiometries. CombFold's high accuracy makes it a promising tool for expanding structural coverage beyond monomeric proteins.
Collapse
Affiliation(s)
- Ben Shor
- The Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Dina Schneidman-Duhovny
- The Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel.
| |
Collapse
|
37
|
Beck M, Covino R, Hänelt I, Müller-McNicoll M. Understanding the cell: Future views of structural biology. Cell 2024; 187:545-562. [PMID: 38306981 DOI: 10.1016/j.cell.2023.12.017] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 12/05/2023] [Accepted: 12/11/2023] [Indexed: 02/04/2024]
Abstract
Determining the structure and mechanisms of all individual functional modules of cells at high molecular detail has often been seen as equal to understanding how cells work. Recent technical advances have led to a flush of high-resolution structures of various macromolecular machines, but despite this wealth of detailed information, our understanding of cellular function remains incomplete. Here, we discuss present-day limitations of structural biology and highlight novel technologies that may enable us to analyze molecular functions directly inside cells. We predict that the progression toward structural cell biology will involve a shift toward conceptualizing a 4D virtual reality of cells using digital twins. These will capture cellular segments in a highly enriched molecular detail, include dynamic changes, and facilitate simulations of molecular processes, leading to novel and experimentally testable predictions. Transferring biological questions into algorithms that learn from the existing wealth of data and explore novel solutions may ultimately unveil how cells work.
Collapse
Affiliation(s)
- Martin Beck
- Max Planck Institute of Biophysics, Max-von-Laue-Straße 3, 60438 Frankfurt am Main, Germany; Goethe University Frankfurt, Frankfurt, Germany.
| | - Roberto Covino
- Frankfurt Institute for Advanced Studies, Ruth-Moufang-Straße 1, 60438 Frankfurt am Main, Germany.
| | - Inga Hänelt
- Goethe University Frankfurt, Frankfurt, Germany.
| | | |
Collapse
|
38
|
Hoff SE, Zinke M, Izadi-Pruneyre N, Bonomi M. Bonds and bytes: The odyssey of structural biology. Curr Opin Struct Biol 2024; 84:102746. [PMID: 38101027 DOI: 10.1016/j.sbi.2023.102746] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Revised: 11/20/2023] [Accepted: 11/24/2023] [Indexed: 12/17/2023]
Abstract
Characterizing structural and dynamic properties of proteins and large macromolecular assemblies is crucial to understand the molecular mechanisms underlying biological functions. In the field of structural biology, no single method comprehensively reveals the behavior of biological systems across various spatiotemporal scales. Instead, we have a versatile toolkit of techniques, each contributing a piece to the overall puzzle. Integrative structural biology combines different techniques to create accurate and precise multi-scale models that expand our understanding of complex biological systems. This review outlines recent advancements in computational and experimental methods in structural biology, with special focus on recent Artificial Intelligence techniques, emphasizes integrative approaches that combine different types of data for precise spatiotemporal modeling, and provides an outlook into future directions of this field.
Collapse
Affiliation(s)
- S E Hoff
- Institut Pasteur, Université Paris Cité, CNRS UMR 3528, Structural Bioinformatics Unit, Paris, France
| | - M Zinke
- Institut Pasteur, Université Paris Cité, CNRS UMR 3528, Bacterial Transmembrane Systems Unit, Paris, France. https://twitter.com/ZinkeMaximilian
| | - N Izadi-Pruneyre
- Institut Pasteur, Université Paris Cité, CNRS UMR 3528, Bacterial Transmembrane Systems Unit, Paris, France.
| | - M Bonomi
- Institut Pasteur, Université Paris Cité, CNRS UMR 3528, Structural Bioinformatics Unit, Paris, France.
| |
Collapse
|
39
|
Liu T, Qiu QT, Hua KJ, Ma BG. Chromosome structure modeling tools and their evaluation in bacteria. Brief Bioinform 2024; 25:bbae044. [PMID: 38385874 PMCID: PMC10883143 DOI: 10.1093/bib/bbae044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 12/31/2023] [Accepted: 01/22/2024] [Indexed: 02/23/2024] Open
Abstract
The three-dimensional (3D) structure of bacterial chromosomes is crucial for understanding chromosome function. With the growing availability of high-throughput chromosome conformation capture (3C/Hi-C) data, the 3D structure reconstruction algorithms have become powerful tools to study bacterial chromosome structure and function. It is highly desired to have a recommendation on the chromosome structure reconstruction tools to facilitate the prokaryotic 3D genomics. In this work, we review existing chromosome 3D structure reconstruction algorithms and classify them based on their underlying computational models into two categories: constraint-based modeling and thermodynamics-based modeling. We briefly compare these algorithms utilizing 3C/Hi-C datasets and fluorescence microscopy data obtained from Escherichia coli and Caulobacter crescentus, as well as simulated datasets. We discuss current challenges in the 3D reconstruction algorithms for bacterial chromosomes, primarily focusing on software usability. Finally, we briefly prospect future research directions for bacterial chromosome structure reconstruction algorithms.
Collapse
Affiliation(s)
- Tong Liu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Qin-Tian Qiu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Kang-Jian Hua
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Bin-Guang Ma
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| |
Collapse
|
40
|
Goodsell DS, Autin L. Integrative modeling of JCVI-Syn3A nucleoids with a modular approach. Curr Res Struct Biol 2023; 7:100121. [PMID: 38221989 PMCID: PMC10784680 DOI: 10.1016/j.crstbi.2023.100121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 12/15/2023] [Accepted: 12/18/2023] [Indexed: 01/16/2024] Open
Abstract
A lattice-based method is presented for creating 3D models of entire bacterial nucleoids integrating ultrastructural information cryoelectron tomography, genomic and proteomic data, and experimental atomic structures of biomolecules and assemblies. The method is used to generate models of the minimal genome bacterium JCVI-Syn3A, producing a series of models that test hypotheses about transcription, condensation, and overall distribution of the genome. Lattice models are also used to generate atomic models of an entire JCVI-Syn3A cell.
Collapse
Affiliation(s)
- David S. Goodsell
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, 92037, USA
- Research Collaboratory for Structural Bioinformatics Protein Data and Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, 08854, USA
| | - Ludovic Autin
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, 92037, USA
| |
Collapse
|
41
|
Arvindekar S, Pathak AS, Majila K, Viswanath S. Optimizing representations for integrative structural modeling using Bayesian model selection. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.12.571227. [PMID: 38168172 PMCID: PMC10760022 DOI: 10.1101/2023.12.12.571227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
Motivation Integrative structural modeling combines data from experiments, physical principles, statistics of previous structures, and prior models to obtain structures of macromolecular assemblies that are challenging to characterize experimentally. The choice of model representation is a key decision in integrative modeling, as it dictates the accuracy of scoring, efficiency of sampling, and resolution of analysis. But currently, the choice is usually made ad hoc, manually. Results Here, we report NestOR (Nested Sampling for Optimizing Representation), a fully automated, statistically rigorous method based on Bayesian model selection to identify the optimal coarse-grained representation for a given integrative modeling setup. Given an integrative modeling setup, it determines the optimal representations from given candidate representations based on their model evidence and sampling efficiency. The performance of NestOR was evaluated on a benchmark of four macromolecular assemblies. Availability NestOR is implemented in the Integrative Modeling Platform (https://integrativemodeling.org) and is available at https://github.com/isblab/nestor.
Collapse
Affiliation(s)
- Shreyas Arvindekar
- National Center for Biological Sciences, Tata Institute of Fundamental Research, Bangalore, India 560065
| | - Aditi S. Pathak
- National Center for Biological Sciences, Tata Institute of Fundamental Research, Bangalore, India 560065
| | - Kartik Majila
- National Center for Biological Sciences, Tata Institute of Fundamental Research, Bangalore, India 560065
| | - Shruthi Viswanath
- National Center for Biological Sciences, Tata Institute of Fundamental Research, Bangalore, India 560065
| |
Collapse
|
42
|
Akey CW, Echeverria I, Ouch C, Nudelman I, Shi Y, Wang J, Chait BT, Sali A, Fernandez-Martinez J, Rout MP. Implications of a multiscale structure of the yeast nuclear pore complex. Mol Cell 2023; 83:3283-3302.e5. [PMID: 37738963 PMCID: PMC10630966 DOI: 10.1016/j.molcel.2023.08.025] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Revised: 06/23/2023] [Accepted: 08/24/2023] [Indexed: 09/24/2023]
Abstract
Nuclear pore complexes (NPCs) direct the nucleocytoplasmic transport of macromolecules. Here, we provide a composite multiscale structure of the yeast NPC, based on improved 3D density maps from cryogenic electron microscopy and AlphaFold2 models. Key features of the inner and outer rings were integrated into a comprehensive model. We resolved flexible connectors that tie together the core scaffold, along with equatorial transmembrane complexes and a lumenal ring that anchor this channel within the pore membrane. The organization of the nuclear double outer ring reveals an architecture that may be shared with ancestral NPCs. Additional connections between the core scaffold and the central transporter suggest that under certain conditions, a degree of local organization is present at the periphery of the transport machinery. These connectors may couple conformational changes in the scaffold to the central transporter to modulate transport. Collectively, this analysis provides insights into assembly, transport, and NPC evolution.
Collapse
Affiliation(s)
- Christopher W Akey
- Department of Pharmacology, Physiology and Biophysics, Boston University, Chobanian and Avedisian School of Medicine, 700 Albany Street, Boston, MA 02118, USA.
| | - Ignacia Echeverria
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, USA; Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Christna Ouch
- Department of Pharmacology, Physiology and Biophysics, Boston University, Chobanian and Avedisian School of Medicine, 700 Albany Street, Boston, MA 02118, USA; Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, 364 Plantation St., Worcester, MA 01605, USA
| | - Ilona Nudelman
- Laboratory of Cellular and Structural Biology, The Rockefeller University, New York, NY 10065, USA
| | - Yi Shi
- Laboratory of Mass Spectrometry and Gaseous Ion Chemistry, The Rockefeller University, New York, NY, USA
| | - Junjie Wang
- Laboratory of Mass Spectrometry and Gaseous Ion Chemistry, The Rockefeller University, New York, NY, USA
| | - Brian T Chait
- Laboratory of Mass Spectrometry and Gaseous Ion Chemistry, The Rockefeller University, New York, NY, USA
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, USA; Quantitative Biosciences Institute, University of California, San Francisco, San Francisco, CA 94158, USA; Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Javier Fernandez-Martinez
- Laboratory of Cellular and Structural Biology, The Rockefeller University, New York, NY 10065, USA; Ikerbasque, Basque Foundation for Science, 48013 Bilbao, Spain; Instituto Biofisika (UPV/EHU, CSIC), University of the Basque Country, 48940 Leioa, Spain
| | - Michael P Rout
- Laboratory of Cellular and Structural Biology, The Rockefeller University, New York, NY 10065, USA.
| |
Collapse
|
43
|
Ghanaeian A, Majhi S, McCafferty CL, Nami B, Black CS, Yang SK, Legal T, Papoulas O, Janowska M, Valente-Paterno M, Marcotte EM, Wloga D, Bui KH. Integrated modeling of the Nexin-dynein regulatory complex reveals its regulatory mechanism. Nat Commun 2023; 14:5741. [PMID: 37714832 PMCID: PMC10504270 DOI: 10.1038/s41467-023-41480-7] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Accepted: 09/05/2023] [Indexed: 09/17/2023] Open
Abstract
Cilia are hairlike protrusions that project from the surface of eukaryotic cells and play key roles in cell signaling and motility. Ciliary motility is regulated by the conserved nexin-dynein regulatory complex (N-DRC), which links adjacent doublet microtubules and regulates and coordinates the activity of outer doublet complexes. Despite its critical role in cilia motility, the assembly and molecular basis of the regulatory mechanism are poorly understood. Here, using cryo-electron microscopy in conjunction with biochemical cross-linking and integrative modeling, we localize 12 DRC subunits in the N-DRC structure of Tetrahymena thermophila. We also find that the CCDC96/113 complex is in close contact with the DRC9/10 in the linker region. In addition, we reveal that the N-DRC is associated with a network of coiled-coil proteins that most likely mediates N-DRC regulatory activity.
Collapse
Affiliation(s)
- Avrin Ghanaeian
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Québec, Canada
| | - Sumita Majhi
- Laboratory of Cytoskeleton and Cilia Biology, Nencki Institute of Experimental Biology of Polish Academy of Sciences, 3 Pasteur Street, 02-093, Warsaw, Poland
| | - Caitlyn L McCafferty
- Department of Molecular Biosciences, Center for Systems and Synthetic Biology, University of Texas, Austin, TX, USA
| | - Babak Nami
- Genetics and Genome Biology Program, Hospital for Sick Children, Toronto, Canada
| | - Corbin S Black
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Québec, Canada
| | - Shun Kai Yang
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Québec, Canada
| | - Thibault Legal
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Québec, Canada
| | - Ophelia Papoulas
- Department of Molecular Biosciences, Center for Systems and Synthetic Biology, University of Texas, Austin, TX, USA
| | - Martyna Janowska
- Laboratory of Cytoskeleton and Cilia Biology, Nencki Institute of Experimental Biology of Polish Academy of Sciences, 3 Pasteur Street, 02-093, Warsaw, Poland
- Laboratory of Immunology, Mossakowski Institute of Experimental and Clinical Medicine, Polish Academy of Science, Pawinskiego 5, 02-106, Warsaw, Poland
| | - Melissa Valente-Paterno
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Québec, Canada
| | - Edward M Marcotte
- Department of Molecular Biosciences, Center for Systems and Synthetic Biology, University of Texas, Austin, TX, USA
| | - Dorota Wloga
- Laboratory of Cytoskeleton and Cilia Biology, Nencki Institute of Experimental Biology of Polish Academy of Sciences, 3 Pasteur Street, 02-093, Warsaw, Poland.
| | - Khanh Huy Bui
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Québec, Canada.
| |
Collapse
|
44
|
Cohen S, Schneidman-Duhovny D. A deep learning model for predicting optimal distance range in crosslinking mass spectrometry data. Proteomics 2023; 23:e2200341. [PMID: 37070547 DOI: 10.1002/pmic.202200341] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Revised: 04/02/2023] [Accepted: 04/03/2023] [Indexed: 04/19/2023]
Abstract
Macromolecular assemblies play an important role in all cellular processes. While there has recently been significant progress in protein structure prediction based on deep learning, large protein complexes cannot be predicted with these approaches. The integrative structure modeling approach characterizes multi-subunit complexes by computational integration of data from fast and accessible experimental techniques. Crosslinking mass spectrometry is one such technique that provides spatial information about the proximity of crosslinked residues. One of the challenges in interpreting crosslinking datasets is designing a scoring function that, given a structure, can quantify how well it fits the data. Most approaches set an upper bound on the distance between Cα atoms of crosslinked residues and calculate a fraction of satisfied crosslinks. However, the distance spanned by the crosslinker greatly depends on the neighborhood of the crosslinked residues. Here, we design a deep learning model for predicting the optimal distance range for a crosslinked residue pair based on the structures of their neighborhoods. We find that our model can predict the distance range with the area under the receiver-operator curve of 0.86 and 0.7 for intra- and inter-protein crosslinks, respectively. Our deep scoring function can be used in a range of structure modeling applications.
Collapse
Affiliation(s)
- Shon Cohen
- The Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Dina Schneidman-Duhovny
- The Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel
| |
Collapse
|
45
|
Habeck M. Bayesian methods in integrative structure modeling. Biol Chem 2023; 404:741-754. [PMID: 37505205 DOI: 10.1515/hsz-2023-0145] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Accepted: 07/07/2023] [Indexed: 07/29/2023]
Abstract
There is a growing interest in characterizing the structure and dynamics of large biomolecular assemblies and their interactions within the cellular environment. A diverse array of experimental techniques allows us to study biomolecular systems on a variety of length and time scales. These techniques range from imaging with light, X-rays or electrons, to spectroscopic methods, cross-linking mass spectrometry and functional genomics approaches, and are complemented by AI-assisted protein structure prediction methods. A challenge is to integrate all of these data into a model of the system and its functional dynamics. This review focuses on Bayesian approaches to integrative structure modeling. We sketch the principles of Bayesian inference, highlight recent applications to integrative modeling and conclude with a discussion of current challenges and future perspectives.
Collapse
Affiliation(s)
- Michael Habeck
- Microscopic Image Analysis Group, Jena University Hospital, D-07743 Jena, Germany
- Max Planck Institute for Multidisciplinary Sciences, d-37077 Göttingen, Germany
| |
Collapse
|
46
|
Singh R, Sledzieski S, Bryson B, Cowen L, Berger B. Contrastive learning in protein language space predicts interactions between drugs and protein targets. Proc Natl Acad Sci U S A 2023; 120:e2220778120. [PMID: 37289807 PMCID: PMC10268324 DOI: 10.1073/pnas.2220778120] [Citation(s) in RCA: 59] [Impact Index Per Article: 29.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Accepted: 04/10/2023] [Indexed: 06/10/2023] Open
Abstract
Sequence-based prediction of drug-target interactions has the potential to accelerate drug discovery by complementing experimental screens. Such computational prediction needs to be generalizable and scalable while remaining sensitive to subtle variations in the inputs. However, current computational techniques fail to simultaneously meet these goals, often sacrificing performance of one to achieve the others. We develop a deep learning model, ConPLex, successfully leveraging the advances in pretrained protein language models ("PLex") and employing a protein-anchored contrastive coembedding ("Con") to outperform state-of-the-art approaches. ConPLex achieves high accuracy, broad adaptivity to unseen data, and specificity against decoy compounds. It makes predictions of binding based on the distance between learned representations, enabling predictions at the scale of massive compound libraries and the human proteome. Experimental testing of 19 kinase-drug interaction predictions validated 12 interactions, including four with subnanomolar affinity, plus a strongly binding EPHB1 inhibitor (KD = 1.3 nM). Furthermore, ConPLex embeddings are interpretable, which enables us to visualize the drug-target embedding space and use embeddings to characterize the function of human cell-surface proteins. We anticipate that ConPLex will facilitate efficient drug discovery by making highly sensitive in silico drug screening feasible at the genome scale. ConPLex is available open source at https://ConPLex.csail.mit.edu.
Collapse
Affiliation(s)
- Rohit Singh
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA02139
| | - Samuel Sledzieski
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA02139
| | - Bryan Bryson
- Ragon Institute of MGH, MIT and Harvard, Cambridge, MA02139
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA02139
| | - Lenore Cowen
- Department of Computer Science, Tufts University, Medford, MA02155
| | - Bonnie Berger
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA02139
- Department of Mathematics, Massachusetts Institute of Technology, Cambridge, MA02139
| |
Collapse
|
47
|
Ellison MA, Namjilsuren S, Shirra M, Blacksmith M, Schusteff R, Kerr E, Fang F, Xiang Y, Shi Y, Arndt K. Spt6 directly interacts with Cdc73 and is required for Paf1 complex occupancy at active genes in Saccharomyces cerevisiae. Nucleic Acids Res 2023; 51:4814-4830. [PMID: 36928138 PMCID: PMC10250246 DOI: 10.1093/nar/gkad180] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Revised: 02/21/2023] [Accepted: 02/28/2023] [Indexed: 03/18/2023] Open
Abstract
The Paf1 complex (Paf1C) is a conserved transcription elongation factor that regulates transcription elongation efficiency, facilitates co-transcriptional histone modifications, and impacts molecular processes linked to RNA synthesis, such as polyA site selection. Coupling of the activities of Paf1C to transcription elongation requires its association with RNA polymerase II (Pol II). Mutational studies in yeast identified Paf1C subunits Cdc73 and Rtf1 as important mediators of Paf1C association with Pol II on active genes. While the interaction between Rtf1 and the general elongation factor Spt5 is relatively well-understood, the interactions involving Cdc73 have not been fully elucidated. Using a site-specific protein cross-linking strategy in yeast cells, we identified direct interactions between Cdc73 and two components of the Pol II elongation complex, the elongation factor Spt6 and the largest subunit of Pol II. Both of these interactions require the tandem SH2 domain of Spt6. We also show that Cdc73 and Spt6 can interact in vitro and that rapid depletion of Spt6 dissociates Paf1 from chromatin, altering patterns of Paf1C-dependent histone modifications genome-wide. These results reveal interactions between Cdc73 and the Pol II elongation complex and identify Spt6 as a key factor contributing to the occupancy of Paf1C at active genes in Saccharomyces cerevisiae.
Collapse
Affiliation(s)
- Mitchell A Ellison
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | | | - Margaret K Shirra
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | - Matthew S Blacksmith
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | - Rachel A Schusteff
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | - Eleanor M Kerr
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | - Fei Fang
- Department of Cell Biology, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Yufei Xiang
- Department of Cell Biology, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Yi Shi
- Department of Cell Biology, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Karen M Arndt
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA 15260, USA
| |
Collapse
|
48
|
Ghanaeian A, Majhi S, McCaffrey CL, Nami B, Black CS, Yang SK, Legal T, Papoulas O, Janowska M, Valente-Paterno M, Marcotte EM, Wloga D, Bui KH. Integrated modeling of the Nexin-dynein regulatory complex reveals its regulatory mechanism. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.31.543107. [PMID: 37398254 PMCID: PMC10312493 DOI: 10.1101/2023.05.31.543107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Cilia are hairlike protrusions that project from the surface of eukaryotic cells and play key roles in cell signaling and motility. Ciliary motility is regulated by the conserved nexin-dynein regulatory complex (N-DRC), which links adjacent doublet microtubules and regulates and coordinates the activity of outer doublet complexes. Despite its critical role in cilia motility, the assembly and molecular basis of the regulatory mechanism are poorly understood. Here, utilizing cryo-electron microscopy in conjunction with biochemical cross-linking and integrative modeling, we localized 12 DRC subunits in the N-DRC structure of Tetrahymena thermophila . We also found that the CCDC96/113 complex is in close contact with the N-DRC. In addition, we revealed that the N-DRC is associated with a network of coiled-coil proteins that most likely mediates N-DRC regulatory activity.
Collapse
Affiliation(s)
- Avrin Ghanaeian
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Québec, Canada
| | - Sumita Majhi
- Laboratory of Cytoskeleton and Cilia Biology, Nencki Institute of Experimental Biology of Polish Academy of Sciences, 3 Pasteur Street, 02-093, Warsaw, Poland
| | - Caitie L McCaffrey
- Department of Molecular Biosciences, Center for Systems and Synthetic Biology, University of Texas, Austin, United States
| | - Babak Nami
- Genetics and Genome Biology, The Hospital for Sick Children, Toronto, Canada
| | - Corbin S Black
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Québec, Canada
| | - Shun Kai Yang
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Québec, Canada
| | - Thibault Legal
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Québec, Canada
| | - Ophelia Papoulas
- Department of Molecular Biosciences, Center for Systems and Synthetic Biology, University of Texas, Austin, United States
| | - Martyna Janowska
- Laboratory of Cytoskeleton and Cilia Biology, Nencki Institute of Experimental Biology of Polish Academy of Sciences, 3 Pasteur Street, 02-093, Warsaw, Poland
- current address: Laboratory of Immunology, Mossakowski Institute of Experimental and Clinical Medicine, Polish Academy of Science, Pawinskiego 5, 02-106 Warsaw, Poland
| | - Melissa Valente-Paterno
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Québec, Canada
| | - Edward M Marcotte
- Department of Molecular Biosciences, Center for Systems and Synthetic Biology, University of Texas, Austin, United States
| | - Dorota Wloga
- Laboratory of Cytoskeleton and Cilia Biology, Nencki Institute of Experimental Biology of Polish Academy of Sciences, 3 Pasteur Street, 02-093, Warsaw, Poland
| | - Khanh Huy Bui
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Québec, Canada
| |
Collapse
|
49
|
Vermeire PJ, Lilina AV, Hashim HM, Dlabolová L, Fiala J, Beelen S, Kukačka Z, Harvey JN, Novák P, Strelkov SV. Molecular structure of soluble vimentin tetramers. Sci Rep 2023; 13:8841. [PMID: 37258554 PMCID: PMC10232555 DOI: 10.1038/s41598-023-34814-4] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Accepted: 05/08/2023] [Indexed: 06/02/2023] Open
Abstract
Intermediate filaments (IFs) are essential constituents of the metazoan cytoskeleton. A vast family of cytoplasmic IF proteins are capable of self-assembly from soluble tetrameric species into typical 10-12 nm wide filaments. The primary structure of these proteins includes the signature central 'rod' domain of ~ 300 residues which forms a dimeric α-helical coiled coil composed of three segments (coil1A, coil1B and coil2) interconnected by non-helical, flexible linkers (L1 and L12). The rod is flanked by flexible terminal head and tail domains. At present, the molecular architecture of mature IFs is only poorly known, limiting our capacity to rationalize the effect of numerous disease-related mutations found in IF proteins. Here we addressed the molecular structure of soluble vimentin tetramers which are formed by two antiparallel, staggered dimers with coil1B domains aligned (A11 tetramers). By examining a series of progressive truncations, we show that the presence of the coil1A domain is essential for the tetramer formation. In addition, we employed a novel chemical cross-linking pipeline including isotope labelling to identify intra- and interdimeric cross-links within the tetramer. We conclude that the tetramer is synergistically stabilized by the interactions of the aligned coil1B domains, the interactions between coil1A and the N-terminal portion of coil2, and the electrostatic attraction between the oppositely charged head and rod domains. Our cross-linking data indicate that, starting with a straight A11 tetramer, flexibility of linkers L1 and L12 enables 'backfolding' of both the coil1A and coil2 domains onto the tetrameric core formed by the coil1B domains. Through additional small-angle X-ray scattering experiments we show that the elongated A11 tetramers dominate in low ionic strength solutions, while there is also a significant structural flexibility especially in the terminal domains.
Collapse
Affiliation(s)
| | | | - Hani M Hashim
- Laboratory for Biocrystallography, KU Leuven, 3000, Leuven, Belgium
- Department of Chemistry, KU Leuven, 3000, Leuven, Belgium
| | - Lada Dlabolová
- Department of Biochemistry, Charles University, 12800, Prague, Czech Republic
- Institute of Microbiology of the Czech Academy of Sciences, 14220, Prague, Czech Republic
| | - Jan Fiala
- Department of Biochemistry, Charles University, 12800, Prague, Czech Republic
- Institute of Microbiology of the Czech Academy of Sciences, 14220, Prague, Czech Republic
| | - Steven Beelen
- Laboratory for Biocrystallography, KU Leuven, 3000, Leuven, Belgium
| | - Zdeněk Kukačka
- Department of Biochemistry, Charles University, 12800, Prague, Czech Republic
- Institute of Microbiology of the Czech Academy of Sciences, 14220, Prague, Czech Republic
| | | | - Petr Novák
- Department of Biochemistry, Charles University, 12800, Prague, Czech Republic
- Institute of Microbiology of the Czech Academy of Sciences, 14220, Prague, Czech Republic
| | - Sergei V Strelkov
- Laboratory for Biocrystallography, KU Leuven, 3000, Leuven, Belgium.
| |
Collapse
|
50
|
Shor B, Schneidman-Duhovny D. Predicting structures of large protein assemblies using combinatorial assembly algorithm and AlphaFold2. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.16.541003. [PMID: 37293053 PMCID: PMC10245790 DOI: 10.1101/2023.05.16.541003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Deep learning models, such as AlphaFold2 and RosettaFold, enable high-accuracy protein structure prediction. However, large protein complexes are still challenging to predict due to their size and the complexity of interactions between multiple subunits. Here we present CombFold, a combinatorial and hierarchical assembly algorithm for predicting structures of large protein complexes utilizing pairwise interactions between subunits predicted by AlphaFold2. CombFold accurately predicted (TM-score > 0.7) 72% of the complexes among the Top-10 predictions in two datasets of 60 large, asymmetric assemblies. Moreover, the structural coverage of predicted complexes was 20% higher compared to corresponding PDB entries. We applied the method on complexes from Complex Portal with known stoichiometry but without known structure and obtained high-confidence predictions. CombFold supports the integration of distance restraints based on crosslinking mass spectrometry and fast enumeration of possible complex stoichiometries. CombFold's high accuracy makes it a promising tool for expanding structural coverage beyond monomeric proteins.
Collapse
Affiliation(s)
- Ben Shor
- The Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Dina Schneidman-Duhovny
- The Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel
| |
Collapse
|