1
|
Posani E, Janoš P, Haack D, Toor N, Bonomi M, Magistrato A, Bussi G. Ensemble refinement of mismodeled cryo-EM RNA structures using all-atom simulations. Nat Commun 2025; 16:4549. [PMID: 40379699 PMCID: PMC12084557 DOI: 10.1038/s41467-025-59769-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2024] [Accepted: 05/02/2025] [Indexed: 05/19/2025] Open
Abstract
The advent of single-particle cryogenic electron microscopy (cryo-EM) has enabled near-atomic resolution imaging of large macromolecules, enhancing functional insights. However, current cryo-EM refinement tools condense all single-particle images into a single structure, which can misrepresent highly flexible molecules like RNAs. Here, we combine molecular dynamics simulations with cryo-EM density maps to better account for the structural dynamics of a complex and biologically relevant RNA macromolecule. Namely, using metainference, a Bayesian method, we reconstruct an ensemble of structures of the group II intron ribozyme, which better matches experimental data, and we reveal inaccuracies of single-structure approaches in modeling flexible regions. An analysis of all RNA-containing structures deposited in the Protein Data Bank reveals that this issue affects most cryo-EM structures in the 2.5-4 Å range. Thus, RNA structures determined by cryo-EM require careful handling, and our method may be broadly applicable to other RNA systems.
Collapse
Affiliation(s)
- Elisa Posani
- Scuola Internazionale Superiore di Studi Avanzati (SISSA), Trieste, Italy
| | | | - Daniel Haack
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA, USA
| | - Navtej Toor
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA, USA
| | - Massimiliano Bonomi
- Institut Pasteur, Université Paris Cité, CNRS UMR 3528, Computational Structural Biology Unit, Paris, France
| | | | - Giovanni Bussi
- Scuola Internazionale Superiore di Studi Avanzati (SISSA), Trieste, Italy.
| |
Collapse
|
2
|
Mallet V, Rapisarda C, Minoux H, Ovsjanikov M. Finding antibodies in cryo-EM maps with CrAI. Bioinformatics 2025; 41:btaf157. [PMID: 40203077 PMCID: PMC12077295 DOI: 10.1093/bioinformatics/btaf157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2024] [Revised: 03/31/2025] [Accepted: 04/07/2025] [Indexed: 04/11/2025] Open
Abstract
MOTIVATION Therapeutic antibodies have emerged as a prominent class of new drugs due to their high specificity and their ability to bind to several protein targets. Once an initial antibody has been identified, its design and characteristics are refined using structural information, when it is available. Cryo-EM is currently the most effective method to obtain 3D structures. It relies on well-established methods to process raw data into a 3D map, which may, however, be noisy and contain artifacts. To fully interpret these maps the number, position, and structure of antibodies and other proteins present must be determined. Unfortunately, existing automated methods addressing this step have limited accuracy, require additional inputs and high-resolution maps, and exhibit long running times. RESULTS We propose the first fully automatic and efficient method dedicated to finding antibodies in cryo-EM maps: CrAI. This machine learning approach leverages the conserved structure of antibodies and a dedicated novel database that we built to solve this problem. Running a prediction takes only a few seconds, instead of hours, and requires nothing but the cryo-EM map, seamlessly integrating within automated analysis pipelines. Our method can find the location and pose of both Fabs and VHHs at resolutions up to 10 Å and is significantly more reliable than existing approaches. AVAILABILITY AND IMPLEMENTATION We make our method available both in open source github.com/Sanofi-Public/crai and as a ChimeraX bundle (crai).
Collapse
Affiliation(s)
- Vincent Mallet
- LIX, Ecole Polytechnique, IPP Paris, Palaiseau, 91120, France
| | - Chiara Rapisarda
- Integrated Drug Discovery, Structural Biology and Biophysics, Sanofi, Vitry-sur-Seine, 94400, France
| | - Hervé Minoux
- Integrated Drug Discovery, Structural Biology and Biophysics, Sanofi, Vitry-sur-Seine, 94400, France
| | - Maks Ovsjanikov
- LIX, Ecole Polytechnique, IPP Paris, Palaiseau, 91120, France
| |
Collapse
|
3
|
Stevens A, Kashyap S, Crofut EH, Wang SE, Muratore KA, Johnson PJ, Zhou ZH. Structures of Native Doublet Microtubules from Trichomonas vaginalis Reveal Parasite-Specific Proteins. Nat Commun 2025; 16:3996. [PMID: 40301421 PMCID: PMC12041511 DOI: 10.1038/s41467-025-59369-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2024] [Accepted: 04/21/2025] [Indexed: 05/01/2025] Open
Abstract
Doublet microtubules (DMTs) are flagellar components required for the protist Trichomonas vaginalis (Tv) to swim through the human genitourinary tract to cause trichomoniasis, the most common non-viral sexually transmitted disease. Lack of structures of Tv's DMT (Tv-DMT) has prevented structure-guided drug design to manage Tv infection. Here, we determine the 16 nm, 32 nm, 48 nm and 96 nm-repeat structures of native Tv-DMT at resolution ranging from 3.4 to 4.4 Å by cryogenic electron microscopy (cryoEM) and built an atomic model for the entire Tv-DMT. These structures show that Tv-DMT is composed of 30 different proteins, including the α- and β-tubulin, 19 microtubule inner proteins (MIPs) and 9 microtubule outer proteins. While the A-tubule of Tv-DMT is simplistic compared to DMTs of other organisms, the B-tubule of Tv-DMT features parasite-specific proteins, such as TvFAP40 and TvFAP35. Notably, TvFAP40 and TvFAP35 form filaments near the inner and outer junctions, respectively, and interface with stabilizing MIPs. This atomic model of the Tv-DMT highlights diversity of eukaryotic motility machineries and provides a structural framework to inform rational design of therapeutics against trichomoniasis.
Collapse
Affiliation(s)
- Alexander Stevens
- Department of Microbiology, Immunology & Molecular Genetics, University of California, Los Angeles (UCLA), Los Angeles, CA, USA
- California NanoSystems Institute, UCLA, Los Angeles, CA, USA
- Department of Chemistry and Biochemistry, UCLA, Los Angeles, CA, USA
| | - Saarang Kashyap
- Department of Microbiology, Immunology & Molecular Genetics, University of California, Los Angeles (UCLA), Los Angeles, CA, USA
- California NanoSystems Institute, UCLA, Los Angeles, CA, USA
| | - Ethan H Crofut
- Department of Microbiology, Immunology & Molecular Genetics, University of California, Los Angeles (UCLA), Los Angeles, CA, USA
- California NanoSystems Institute, UCLA, Los Angeles, CA, USA
| | - Shuqi E Wang
- Department of Microbiology, Immunology & Molecular Genetics, University of California, Los Angeles (UCLA), Los Angeles, CA, USA
| | - Katherine A Muratore
- Department of Microbiology, Immunology & Molecular Genetics, University of California, Los Angeles (UCLA), Los Angeles, CA, USA
| | - Patricia J Johnson
- Department of Microbiology, Immunology & Molecular Genetics, University of California, Los Angeles (UCLA), Los Angeles, CA, USA.
| | - Z Hong Zhou
- Department of Microbiology, Immunology & Molecular Genetics, University of California, Los Angeles (UCLA), Los Angeles, CA, USA.
- California NanoSystems Institute, UCLA, Los Angeles, CA, USA.
- Department of Chemistry and Biochemistry, UCLA, Los Angeles, CA, USA.
| |
Collapse
|
4
|
Cai Y, Zhang Z, Xu X, Xu L, Chen Y, Zhang G, Zhou X. Fitting Atomic Structures into Cryo-EM Maps by Coupling Deep Learning-Enhanced Map Processing with Global-Local Optimization. J Chem Inf Model 2025; 65:3800-3811. [PMID: 40152222 DOI: 10.1021/acs.jcim.5c00004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2025]
Abstract
With the breakthroughs in protein structure prediction technology, constructing atomic structures from cryo-electron microscopy (cryo-EM) density maps through structural fitting has become increasingly critical. However, the accuracy of the constructed models heavily relies on the precision of the structure-to-map fitting. In this study, we introduce DEMO-EMfit, a progressive method that integrates deep learning-based backbone map extraction with a global-local structural pose search to fit atomic structures into density maps. DEMO-EMfit was extensively evaluated on a benchmark data set comprising both cryo-electron tomography (cryo-ET) and cryo-EM maps of protein and nucleic acid complexes. The results demonstrate that DEMO-EMfit outperforms state-of-the-art approaches, offering an efficient and accurate tool for fitting atomic structures into density maps.
Collapse
Affiliation(s)
- Yaxian Cai
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Ziying Zhang
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Xiangyu Xu
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Liang Xu
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Yu Chen
- School of Computer Science and Technology, Soochow University, Suzhou 215006, China
| | - Guijun Zhang
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Xiaogen Zhou
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| |
Collapse
|
5
|
Tlučková K, Kaczmarek B, Salmazo A, Bernecky C. Mechanism of mammalian transcriptional repression by noncoding RNA. Nat Struct Mol Biol 2025; 32:607-612. [PMID: 39762629 PMCID: PMC11996674 DOI: 10.1038/s41594-024-01448-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Accepted: 11/08/2024] [Indexed: 04/16/2025]
Abstract
Transcription by RNA polymerase II (Pol II) can be repressed by noncoding RNA, including the human RNA Alu. However, the mechanism by which endogenous RNAs repress transcription remains unclear. Here we present cryogenic-electron microscopy structures of Pol II bound to Alu RNA, which reveal that Alu RNA mimics how DNA and RNA bind to Pol II during transcription elongation. Further, we show how distinct domains of the general transcription factor TFIIF control repressive activity. Together, we reveal how a noncoding RNA can regulate mammalian gene expression.
Collapse
Affiliation(s)
- Katarína Tlučková
- Institute of Science and Technology Austria (ISTA), Klosterneuburg, Austria
| | - Beata Kaczmarek
- Institute of Science and Technology Austria (ISTA), Klosterneuburg, Austria
| | - Anita Salmazo
- Institute of Science and Technology Austria (ISTA), Klosterneuburg, Austria
| | - Carrie Bernecky
- Institute of Science and Technology Austria (ISTA), Klosterneuburg, Austria.
| |
Collapse
|
6
|
Giri N, Chen X, Wang L, Cheng J. A Labeled Dataset for AI-based Cryo-EM Map Enhancement. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.16.643562. [PMID: 40166245 PMCID: PMC11957003 DOI: 10.1101/2025.03.16.643562] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/02/2025]
Abstract
Cryo-electron microscopy (cryo-EM) has transformed structural biology by enabling near-atomic resolution imaging of macromolecular complexes. However, cryo-EM density maps suffer from intrinsic noise arising from structural sources, shot noise, and digital recording, which complicates accurate atomic structure building. While various methods for denoising cryo-EM density maps exist, there is a lack of standardized datasets for benchmarking artificial intelligence (AI) approaches. Here, we present an open-source dataset for cryo-EM density map denoising comprising 650 high-resolution (1-4 Å) experimental maps paired with three types of generated label maps: regression maps capturing idealized density distributions, binary classification maps distinguishing structural elements from background, and atom-type classification maps. Each map is standardized to 1 Å voxel size and validated through Fourier Shell Correlation analysis, demonstrating substantial resolution improvements in label maps compared to experimental maps. This resource bridges the gap between structural biology and artificial intelligence communities, enabling researchers to develop and benchmark innovative methods for enhancing cryo-EM density maps.
Collapse
Affiliation(s)
- Nabin Giri
- University of Missouri, Electrical Engineering and Computer Science, Columbia, 65211, USA
- NextGen Precision Health Institute, Columbia, 65211, USA
| | - Xiao Chen
- Computer Science Department, Hamilton College, Clinton, NY, 13323, USA
| | - Liguo Wang
- Laboratory for Biological Structure, Brookhaven National Laboratory, Upton, NY, 11973, USA
| | - Jianlin Cheng
- University of Missouri, Electrical Engineering and Computer Science, Columbia, 65211, USA
- NextGen Precision Health Institute, Columbia, 65211, USA
| |
Collapse
|
7
|
Liu X, Clemens DL, Lee BY, Aguirre R, Horwitz MA, Zhou ZH. Structure, identification and characterization of the RibD-enolase complex in Francisella. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.02.641097. [PMID: 40093042 PMCID: PMC11908141 DOI: 10.1101/2025.03.02.641097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 03/19/2025]
Abstract
Francisella tularensis is a highly infectious bacterium, a Tier 1-select agent, and the causative agent of tularemia, a potentially fatal zoonotic disease. In this study originally aiming to identify anti-tularemia drug targets, we serendipitously determined the atomic structures and identified their components of the native RibD-enolase protein complex in Francisella novicida; and subsequently systematically characterized the catalytic functions of the RibD-enolase complex. Originally discovered as individually protein in Escherichia coli and yeast, respectively, RibD and enolase are two essential enzymes involved in distinct metabolic pathways, both of which could serve as potential therapeutic targets for tularemia treatment and prevention. Our biochemical validation using pull-down assays confirmed the formation of this complex in vivo, revealing that all eluted RibD is bound to enolase, while the majority of enolase remained uncomplexed. Structural analysis reveals unique features of the Francisella complex, including key RibD-enolase interactions that mediate complex assembly and β-strand swapping between RibD subunits. Furthermore, molecular dynamics simulations of the ligand-bound RibD-enolase complex highlight localized conformational changes within the substrate-binding sites and suggest a gating mechanism between RibD's substrate and cofactor-binding sites to ensure efficient uptake and turnover. Despite the physical association between RibD and enolase, enzymatic assays indicated their catalytic activities are independent of each other, thus the complex may have alternative functional roles that warrant further exploration. Our study provides the first structural and biochemical characterization of the RibD-enolase complex, establishing a foundation for further investigations into its functional significance in Francisella and potential antibacterial development.
Collapse
Affiliation(s)
- Xiaoyu Liu
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles (UCLA), Los Angeles, CA 90095, USA
- The California NanoSystems Institute (CNSI), UCLA, Los Angeles, CA 90095, USA
| | | | - Bai-Yu Lee
- Department of Medicine, UCLA, Los Angeles, CA 90095, USA
| | - Roman Aguirre
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles (UCLA), Los Angeles, CA 90095, USA
- The California NanoSystems Institute (CNSI), UCLA, Los Angeles, CA 90095, USA
- Department of Chemistry and Biochemistry, University of California (UCLA), Los Angeles, CA 90095, USA
| | - Marcus A. Horwitz
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles (UCLA), Los Angeles, CA 90095, USA
- Department of Medicine, UCLA, Los Angeles, CA 90095, USA
| | - Z. Hong Zhou
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles (UCLA), Los Angeles, CA 90095, USA
- The California NanoSystems Institute (CNSI), UCLA, Los Angeles, CA 90095, USA
- Department of Chemistry and Biochemistry, University of California (UCLA), Los Angeles, CA 90095, USA
| |
Collapse
|
8
|
Li S, Terashi G, Zhang Z, Kihara D. Advancing structure modeling from cryo-EM maps with deep learning. Biochem Soc Trans 2025; 53:BST20240784. [PMID: 39927816 DOI: 10.1042/bst20240784] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2024] [Revised: 01/16/2025] [Accepted: 01/21/2025] [Indexed: 02/11/2025]
Abstract
Cryo-electron microscopy (cryo-EM) has revolutionized structural biology by enabling the determination of biomolecular structures that are challenging to resolve using conventional methods. Interpreting a cryo-EM map requires accurate modeling of the structures of underlying biomolecules. Here, we concisely discuss the evolution and current state of automatic structure modeling from cryo-EM density maps. We classify modeling methods into two categories: de novo modeling methods from high-resolution maps (better than 5 Å) and methods that model by fitting individual structures of component proteins to maps at lower resolution (worse than 5 Å). Special attention is given to the role of deep learning in the modeling process, highlighting how AI-driven approaches are transformative in cryo-EM structure modeling. We conclude by discussing future directions in the field.
Collapse
Affiliation(s)
- Shu Li
- Department of Computer Science, Purdue University, West Lafayette, IN, U.S.A
| | - Genki Terashi
- Department of Biological Sciences, Purdue University, West Lafayette, IN, U.S.A
| | - Zicong Zhang
- Department of Computer Science, Purdue University, West Lafayette, IN, U.S.A
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN, U.S.A
- Department of Biological Sciences, Purdue University, West Lafayette, IN, U.S.A
| |
Collapse
|
9
|
Rosignoli S, Pacelli M, Manganiello F, Paiardini A. An outlook on structural biology after AlphaFold: tools, limits and perspectives. FEBS Open Bio 2025; 15:202-222. [PMID: 39313455 PMCID: PMC11788754 DOI: 10.1002/2211-5463.13902] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Revised: 08/19/2024] [Accepted: 09/13/2024] [Indexed: 09/25/2024] Open
Abstract
AlphaFold and similar groundbreaking, AI-based tools, have revolutionized the field of structural bioinformatics, with their remarkable accuracy in ab-initio protein structure prediction. This success has catalyzed the development of new software and pipelines aimed at incorporating AlphaFold's predictions, often focusing on addressing the algorithm's remaining challenges. Here, we present the current landscape of structural bioinformatics shaped by AlphaFold, and discuss how the field is dynamically responding to this revolution, with new software, methods, and pipelines. While the excitement around AI-based tools led to their widespread application, it is essential to acknowledge that their practical success hinges on their integration into established protocols within structural bioinformatics, often neglected in the context of AI-driven advancements. Indeed, user-driven intervention is still as pivotal in the structure prediction process as in complementing state-of-the-art algorithms with functional and biological knowledge.
Collapse
Affiliation(s)
- Serena Rosignoli
- Department of Biochemical sciences “A. Rossi Fanelli”Sapienza Università di RomaItaly
| | - Maddalena Pacelli
- Department of Biochemical sciences “A. Rossi Fanelli”Sapienza Università di RomaItaly
| | - Francesca Manganiello
- Department of Biochemical sciences “A. Rossi Fanelli”Sapienza Università di RomaItaly
| | - Alessandro Paiardini
- Department of Biochemical sciences “A. Rossi Fanelli”Sapienza Università di RomaItaly
| |
Collapse
|
10
|
Farheen F, Terashi G, Zhu H, Kihara D. AI-based methods for biomolecular structure modeling for Cryo-EM. Curr Opin Struct Biol 2025; 90:102989. [PMID: 39864242 PMCID: PMC11793015 DOI: 10.1016/j.sbi.2025.102989] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2024] [Revised: 12/29/2024] [Accepted: 01/04/2025] [Indexed: 01/28/2025]
Abstract
Cryo-electron microscopy (Cryo-EM) has revolutionized structural biology by enabling the determination of macromolecular structures that were challenging to study with conventional methods. Processing cryo-EM data involves several computational steps to derive three-dimensional structures from raw projections. Recent advancements in artificial intelligence (AI) including deep learning have significantly improved the performance of these processes. In this review, we discuss state-of-the-art AI-based techniques used in key steps of cryo-EM data processing, including macromolecular structure modeling and heterogeneity analysis.
Collapse
Affiliation(s)
- Farhanaz Farheen
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
| | - Genki Terashi
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
| | - Han Zhu
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN, USA; Department of Biological Sciences, Purdue University, West Lafayette, IN, USA.
| |
Collapse
|
11
|
Zhu G, Xi F, Zeng W, Zhao Y, Cao W, Liu C, Yang F, Ru Y, Xiao S, Zhang S, Liu H, Tian H, Yang F, Lu B, Sun S, Song H, Sun B, Zhao X, Tang L, Li K, He J, Guo J, Zhu Y, Zhu Z, Sun F, Zheng H. Structural basis of RNA polymerase complexes in African swine fever virus. Nat Commun 2025; 16:501. [PMID: 39779680 PMCID: PMC11711665 DOI: 10.1038/s41467-024-55683-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2024] [Accepted: 12/19/2024] [Indexed: 01/11/2025] Open
Abstract
African swine fever virus is highly contagious and causes a fatal infectious disease in pigs, resulting in a significant global impact on pork supply. The African swine fever virus RNA polymerase serves as a crucial multifunctional protein complex responsible for genome transcription and regulation. Therefore, it is essential to investigate its structural and functional characteristics for the prevention and control of African swine fever. Here, we determine the structures of endogenous African swine fever virus RNA polymerase in both nucleic acid-free and elongation states. The African swine fever virus RNA polymerase shares similarities with the core of typical RNA polymerases, but possesses a distinct subunit M1249L. Notably, the dynamic binding mode of M1249L with RNA polymerase, along with the C-terminal tail insertion of M1249L in the active center of DNA-RNA scaffold binding, suggests the potential of M1249L to regulate RNA polymerase activity within cells. These results are important for understanding the transcription cycle of the African swine fever virus and for developing antiviral strategies.
Collapse
Grants
- the Fundamental Research Funds for the Central Universities (awarded to H.-X. Zheng and F. Yang), the Open Competition Program of Top Ten Critical Priorities of Agricultural Science and Technology Innovation for the 14th Five-Year Plan of Guangdong Province (2024KJ14, awarded to H.-X. Zheng), the China Agriculture Research System of Ministry of Finance and Ministry of Agriculture and Rural Affairs (CARS-35, awarded to H.-X. Zheng), the Project of National Center of Technology Innovation for Pigs (NCTIP-XD/C03, awarded to H.-X. Zheng), the Major Science and Technology Project of Gansu Province (22ZD6NA001 and 22ZD6NA012, awarded to H.-X. Zheng), and the Innovation Program of Chinese Academy of Agricultural Sciences (CAAS-CSLPDCP-202302 and CAAS-ASTIP-2024-LVRI, awarded to H.-X. Zheng)
- the Joint Research Foundation of Gansu Province (24JRRA813, awarded to G.-L. Zhu)
- the National Key R&D Program of China (2021YFD1800100, awarded to Z.-X. Zhu),the Innovation Group of Gansu Province (23JRRA1515, awarded to J.-J. He; 23JRRA546, awarded to Z.-X. Zhu)
Collapse
Affiliation(s)
- Guoliang Zhu
- State Key Laboratory for Animal Disease Control and Prevention, College of Veterinary Medicine, Lanzhou University, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
- African Swine Fever Regional Laboratory of China, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Fei Xi
- State Key Laboratory for Animal Disease Control and Prevention, College of Veterinary Medicine, Lanzhou University, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
- Department of Preventive Veterinary Medicine, College of Veterinary Medicine, Northeast Agricultural University, Harbin, Heilongjiang, China
| | - Wuxia Zeng
- State Key Laboratory for Animal Disease Control and Prevention, College of Veterinary Medicine, Lanzhou University, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
- African Swine Fever Regional Laboratory of China, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Yifei Zhao
- State Key Laboratory for Animal Disease Control and Prevention, College of Veterinary Medicine, Lanzhou University, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Weijun Cao
- State Key Laboratory for Animal Disease Control and Prevention, College of Veterinary Medicine, Lanzhou University, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
- African Swine Fever Regional Laboratory of China, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Chen Liu
- State Key Laboratory for Animal Disease Control and Prevention, College of Veterinary Medicine, Lanzhou University, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Fan Yang
- State Key Laboratory for Animal Disease Control and Prevention, College of Veterinary Medicine, Lanzhou University, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
- African Swine Fever Regional Laboratory of China, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Yi Ru
- State Key Laboratory for Animal Disease Control and Prevention, College of Veterinary Medicine, Lanzhou University, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
- African Swine Fever Regional Laboratory of China, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Shuqi Xiao
- State Key Laboratory for Animal Disease Control and Prevention, College of Veterinary Medicine, Lanzhou University, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Shilei Zhang
- State Key Laboratory for Animal Disease Control and Prevention, College of Veterinary Medicine, Lanzhou University, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
- African Swine Fever Regional Laboratory of China, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Huanan Liu
- State Key Laboratory for Animal Disease Control and Prevention, College of Veterinary Medicine, Lanzhou University, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
- African Swine Fever Regional Laboratory of China, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Hong Tian
- State Key Laboratory for Animal Disease Control and Prevention, College of Veterinary Medicine, Lanzhou University, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
- African Swine Fever Regional Laboratory of China, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Fayu Yang
- State Key Laboratory for Animal Disease Control and Prevention, College of Veterinary Medicine, Lanzhou University, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
- African Swine Fever Regional Laboratory of China, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Biao Lu
- State Key Laboratory for Animal Disease Control and Prevention, College of Veterinary Medicine, Lanzhou University, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Shukai Sun
- State Key Laboratory for Animal Disease Control and Prevention, College of Veterinary Medicine, Lanzhou University, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Haiyang Song
- State Key Laboratory for Animal Disease Control and Prevention, College of Veterinary Medicine, Lanzhou University, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Bozhang Sun
- State Key Laboratory for Animal Disease Control and Prevention, College of Veterinary Medicine, Lanzhou University, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Xiaoyi Zhao
- State Key Laboratory for Animal Disease Control and Prevention, College of Veterinary Medicine, Lanzhou University, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Lijie Tang
- Department of Preventive Veterinary Medicine, College of Veterinary Medicine, Northeast Agricultural University, Harbin, Heilongjiang, China
| | - Kangli Li
- State Key Laboratory for Animal Disease Control and Prevention, College of Veterinary Medicine, Lanzhou University, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Jijun He
- State Key Laboratory for Animal Disease Control and Prevention, College of Veterinary Medicine, Lanzhou University, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Jianhong Guo
- State Key Laboratory for Animal Disease Control and Prevention, College of Veterinary Medicine, Lanzhou University, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
- African Swine Fever Regional Laboratory of China, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Yun Zhu
- National Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China.
- School of Life Sciences, University of Chinese Academy of Sciences, Beijing, China.
| | - Zixiang Zhu
- State Key Laboratory for Animal Disease Control and Prevention, College of Veterinary Medicine, Lanzhou University, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China.
- African Swine Fever Regional Laboratory of China, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China.
| | - Fei Sun
- National Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China.
- School of Life Sciences, University of Chinese Academy of Sciences, Beijing, China.
| | - Haixue Zheng
- State Key Laboratory for Animal Disease Control and Prevention, College of Veterinary Medicine, Lanzhou University, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China.
- African Swine Fever Regional Laboratory of China, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China.
- Gansu Province Research Center for Basic Disciplines of Pathogen Biology, Lanzhou, China.
| |
Collapse
|
12
|
Dubianok Y, Kumar A, Rak A. Structural Biology for Target Identification and Validation. Methods Mol Biol 2025; 2905:17-49. [PMID: 40163296 DOI: 10.1007/978-1-0716-4418-8_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/02/2025]
Abstract
Structural biology is catalyzing a paradigm shift in drug discovery towards rational approaches in target identification and validation. Leveraging structural insights obtained through cryo-EM or X-ray crystallography not only enhances the efficiency of drug discovery projects in terms of time and cost, but also significantly improves the likelihood of achieving market approval.Initiating a successful project necessitates more than just a robust package for target credentialing; it demands a comprehensive strategy for the identification and optimization of potential drugs. The critical evaluation of target druggability is markedly enhanced when supported by experimentally derived structural information. This nuanced approach ensures a more thorough understanding of the technical feasibility of drug development from the project's inception.
Collapse
Affiliation(s)
- Yuliya Dubianok
- Sanofi R&D, Bio Structure and Biophysics at Integrated Drug Discovery, Vitry-sur-Seine, France
| | - Anand Kumar
- Sanofi R&D, Bio Structure and Biophysics at Integrated Drug Discovery, Vitry-sur-Seine, France
| | - Alexey Rak
- Sanofi R&D, Bio Structure and Biophysics at Integrated Drug Discovery, Vitry-sur-Seine, France.
| |
Collapse
|
13
|
Punuru P, Jain A, Kihara D. Secondary Structure Detection and Structure Modeling for Cryo-EM. Methods Mol Biol 2025; 2870:341-355. [PMID: 39543043 DOI: 10.1007/978-1-0716-4213-9_17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2024]
Abstract
Rapid advancements in cryogenic electron microscopy (cryo-EM) have revolutionized the field of structural biology by enabling the determination of complex macromolecular structures at unprecedented resolutions. When cryo-EM density maps have a resolution around 3 Å, the atomic structure can be modeled manually. However, as the resolution decreases, analyzing these density maps becomes increasingly challenging. For modeling structures in lower resolution maps, deep learning can be used to identify structural features in the maps to assist in structure modeling.Here, we present a suite of deep learning-based tools developed by our lab that enable structural biologists to work with cryo-EM maps of a wide range of resolutions. For cryo-EM maps at near-atomic resolution (5 Å or better), DeepMainmast automatically models all-atom structures by tracing the main chain from local map features of amino acids and atoms detected by deep learning; DAQ score quantifies map-model fit and indicates potential misassignments in protein models. In intermediate resolution maps (5-10 Å), Emap2sec and Emap2sec+ can accurately detect protein secondary structures and nucleic acids. These tools and more are available at our web server: https://em.kiharalab.org/ .
Collapse
Affiliation(s)
- Pranav Punuru
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
| | - Anika Jain
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA.
- Department of Computer Science, Purdue University, West Lafayette, IN, USA.
| |
Collapse
|
14
|
Li T, He J, Cao H, Zhang Y, Chen J, Xiao Y, Huang SY. All-atom RNA structure determination from cryo-EM maps. Nat Biotechnol 2025; 43:97-105. [PMID: 38396075 DOI: 10.1038/s41587-024-02149-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2023] [Accepted: 01/24/2024] [Indexed: 02/25/2024]
Abstract
Many methods exist for determining protein structures from cryogenic electron microscopy maps, but this remains challenging for RNA structures. Here we developed EMRNA, a method for accurate, automated determination of full-length all-atom RNA structures from cryogenic electron microscopy maps. EMRNA integrates deep learning-based detection of nucleotides, three-dimensional backbone tracing and scoring with consideration of sequence and secondary structure information, and full-atom construction of the RNA structure. We validated EMRNA on 140 diverse RNA maps ranging from 37 to 423 nt at 2.0-6.0 Å resolutions, and compared EMRNA with auto-DRRAFTER, phenix.map_to_model and CryoREAD on a set of 71 cases. EMRNA achieves a median accuracy of 2.36 Å root mean square deviation and 0.86 TM-score for full-length RNA structures, compared with 6.66 Å and 0.58 for auto-DRRAFTER. EMRNA also obtains a high residue coverage and sequence match of 93.30% and 95.30% in the built models, compared with 58.20% and 42.20% for phenix.map_to_model and 56.45% and 52.3% for CryoREAD. EMRNA is fast and can build an RNA structure of 100 nt within 3 min.
Collapse
Affiliation(s)
- Tao Li
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, China
| | - Jiahua He
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, China
| | - Hong Cao
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, China
| | - Yi Zhang
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, China
| | - Ji Chen
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, China
| | - Yi Xiao
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, China.
| | - Sheng-You Huang
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, China.
| |
Collapse
|
15
|
Luo D, Alsuwaykit Z, Khan D, Strnad O, Isenberg T, Viola I. DiffFit: Visually-Guided Differentiable Fitting of Molecule Structures to a Cryo-EM Map. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2025; 31:558-568. [PMID: 39255135 DOI: 10.1109/tvcg.2024.3456404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2024]
Abstract
We introduce DiffFit, a differentiable algorithm for fitting protein atomistic structures into an experimental reconstructed Cryo-Electron Microscopy (cryo-EM) volume map. In structural biology, this process is necessary to semi-automatically composite large mesoscale models of complex protein assemblies and complete cellular structures that are based on measured cryo-EM data. The current approaches require manual fitting in three dimensions to start, resulting in approximately aligned structures followed by an automated fine-tuning of the alignment. The DiffFit approach enables domain scientists to fit new structures automatically and visualize the results for inspection and interactive revision. The fitting begins with differentiable three-dimensional (3D) rigid transformations of the protein atom coordinates followed by sampling the density values at the atom coordinates from the target cryo-EM volume. To ensure a meaningful correlation between the sampled densities and the protein structure, we proposed a novel loss function based on a multi-resolution volume-array approach and the exploitation of the negative space. This loss function serves as a critical metric for assessing the fitting quality, ensuring the fitting accuracy and an improved visualization of the results. We assessed the placement quality of DiffFit with several large, realistic datasets and found it to be superior to that of previous methods. We further evaluated our method in two use cases: automating the integration of known composite structures into larger protein complexes and facilitating the fitting of predicted protein domains into volume densities to aid researchers in identifying unknown proteins. We implemented our algorithm as an open-source plugin (github.com/nanovis/DiffFit) in ChimeraX, a leading visualization software in the field. All supplemental materials are available at osf. io/5tx4q.
Collapse
|
16
|
Wang X, Zhu H, Terashi G, Taluja M, Kihara D. DiffModeler: large macromolecular structure modeling for cryo-EM maps using a diffusion model. Nat Methods 2024; 21:2307-2317. [PMID: 39433880 DOI: 10.1038/s41592-024-02479-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Accepted: 09/19/2024] [Indexed: 10/23/2024]
Abstract
Cryogenic electron microscopy (cryo-EM) has now been widely used for determining multichain protein complexes. However, modeling a large complex structure, such as those with more than ten chains, is challenging, particularly when the map resolution decreases. Here we present DiffModeler, a fully automated method for modeling large protein complex structures. DiffModeler employs a diffusion model for backbone tracing and integrates AlphaFold2-predicted single-chain structures for structure fitting. DiffModeler showed an average template modeling score of 0.88 and 0.91 for two datasets of cryo-EM maps of 0-5 Å resolution and 0.92 for intermediate resolution maps (5-10 Å), substantially outperforming existing methodologies. Further benchmarking at low resolutions (10-20 Å) confirms its versatility, demonstrating plausible performance.
Collapse
Affiliation(s)
- Xiao Wang
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
| | - Han Zhu
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
| | - Genki Terashi
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
| | - Manav Taluja
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
- School of Computer Science and Engineering, Vellore Institute of Technology, Vellore, India
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN, USA.
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA.
| |
Collapse
|
17
|
Lander GC. Single particle cryo-EM map and model validation: It's not crystal clear. Curr Opin Struct Biol 2024; 89:102918. [PMID: 39293191 DOI: 10.1016/j.sbi.2024.102918] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2024] [Revised: 08/23/2024] [Accepted: 08/25/2024] [Indexed: 09/20/2024]
Abstract
The application of single particle cryogenic electron microscopy (cryo-EM) to structure determination continues to have a transformative impact on our understanding on biological systems. While there has been a great deal of algorithmic development focused on improving attainable resolutions and streamlining atomic model building, there has not been commensurate development of validation metrics to ensure the accuracy of our cryo-EM maps and models. This review emphasizes the persistent issues that currently complicate single particle cryo-EM structure validation, and highlights the metrics that are gaining broad acceptance by the community. This article aims to underscore the need for further development of validation criteria and the potential role of machine learning methodologies in confidently assessing the quality of cryo-EM structures.
Collapse
Affiliation(s)
- Gabriel C Lander
- Dept. of Integrative Structural and Computational Biology, Scripps Research, La Jolla, CA 92037, USA.
| |
Collapse
|
18
|
Muenks A, Farrell DP, Zhou G, DiMaio F. Automated identification of small molecules in cryo-electron microscopy data with density- and energy-guided evaluation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.11.20.623795. [PMID: 39605546 PMCID: PMC11601544 DOI: 10.1101/2024.11.20.623795] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2024]
Abstract
Methodological improvements in cryo-electron microscopy (cryoEM) have made it a useful tool in ligand-bound structure determination for biology and drug design. However, determining the conformation and identity of bound ligands is still challenging at the resolutions typical for cryoEM. Automated methods can aid in ligand conformational modeling, but current ligand identification tools - developed for X-ray crystallography data - perform poorly at resolutions common for cryoEM. Here, we present EMERALD-ID, a method capable of docking and evaluating small molecule conformations for ligand identification. EMERALD-ID identifies 43% of common ligands exactly and identifies closely related ligands in 66% of cases. We then use this tool to discover possible ligand identification errors, as well as previously unidentified ligands. Furthermore, we show EMERALD-ID is capable of identifying ligands from custom ligand libraries of various small molecule types, including human metabolites and drug fragments. Our method provides a valuable addition to cryoEM modeling tools to improve small molecule model accuracy and quality.
Collapse
Affiliation(s)
- Andrew Muenks
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Daniel P. Farrell
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Guangfeng Zhou
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Frank DiMaio
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Lead contact
| |
Collapse
|
19
|
Wang N, Xing J, Su X, Pan J, Chen H, Shi L, Si L, Yang W, Li M. Architecture of the ATP-driven motor for protein import into chloroplasts. MOLECULAR PLANT 2024; 17:1702-1718. [PMID: 39327731 DOI: 10.1016/j.molp.2024.09.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2024] [Revised: 09/21/2024] [Accepted: 09/23/2024] [Indexed: 09/28/2024]
Abstract
Thousands of nuclear-encoded proteins are transported into chloroplasts through the TOC-TIC translocon that spans the chloroplast envelope membranes. A motor complex pulls the translocated proteins out of the TOC-TIC complex into the chloroplast stroma by hydrolyzing ATP. The Orf2971-FtsHi complex has been suggested to serve as the ATP-hydrolyzing motor in Chlamydomonas reinhardtii, but little is known about its architecture and assembly. Here, we report the 3.2-Å resolution structure of the Chlamydomonas Orf2971-FtsHi complex. The 20-subunit complex spans the chloroplast inner envelope, with two bulky modules protruding into the intermembrane space and stromal matrix. Six subunits form a hetero-hexamer that potentially provides the pulling force through ATP hydrolysis. The remaining subunits, including potential enzymes/chaperones, likely facilitate the complex assembly and regulate its proper function. Taken together, our results provide the structural foundation for a mechanistic understanding of chloroplast protein translocation.
Collapse
Affiliation(s)
- Ning Wang
- Key Laboratory of Biomacromolecules (CAS), National Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Jiale Xing
- Photosynthesis Research Center, Key Laboratory of Photobiology, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
| | - Xiaodong Su
- Key Laboratory of Biomacromolecules (CAS), National Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing, China
| | - Junting Pan
- Photosynthesis Research Center, Key Laboratory of Photobiology, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China; University of Chinese Academy of Sciences, Beijing, China; China National Botanical Garden, Beijing 100093, China; Institute of Botany, Chinese Academy of Sciences, Beijing, China
| | - Hui Chen
- Key Laboratory of Epigenetic Regulation and Intervention, Chinese Academy of Sciences Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, Shanghai 200031, China
| | - Lifang Shi
- Key Laboratory of Biomacromolecules (CAS), National Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Long Si
- Key Laboratory of Biomacromolecules (CAS), National Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing, China
| | - Wenqiang Yang
- Photosynthesis Research Center, Key Laboratory of Photobiology, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China; University of Chinese Academy of Sciences, Beijing, China; China National Botanical Garden, Beijing 100093, China; Institute of Botany, Chinese Academy of Sciences, Beijing, China.
| | - Mei Li
- Key Laboratory of Biomacromolecules (CAS), National Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China.
| |
Collapse
|
20
|
Li T, Cao H, He J, Huang SY. Automated detection and de novo structure modeling of nucleic acids from cryo-EM maps. Nat Commun 2024; 15:9367. [PMID: 39477926 PMCID: PMC11525807 DOI: 10.1038/s41467-024-53721-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Accepted: 10/18/2024] [Indexed: 11/02/2024] Open
Abstract
Cryo-electron microscopy (cryo-EM) is one of the most powerful experimental methods for macromolecular structure determination. However, accurate DNA/RNA structure modeling from cryo-EM maps is still challenging especially for protein-DNA/RNA or multi-chain DNA/RNA complexes. Here we propose a deep learning-based method for accurate de novo structure determination of DNA/RNA from cryo-EM maps at <5 Å resolutions, which is referred to as EM2NA. EM2NA is extensively evaluated on a diverse test set of 50 experimental maps at 2.0-5.0 Å resolutions, and compared with state-of-the-art methods including CryoREAD, ModelAngelo, and phenix.map_to_model. On average, EM2NA achieves a residue coverage of 83.15%, C4' RMSD of 1.06 Å, and sequence recall of 46.86%, which outperforms the existing methods. Moreover, EM2NA is applied to build the DNA/RNA structures with 10 to 5347 nt from an EMDB-wide data set of 263 unmodeled raw maps, demonstrating its ability in the blind model building of DNA/RNA from cryo-EM maps. EM2NA is fast and can normally build a DNA/RNA structure of <500 nt within 10 minutes.
Collapse
Affiliation(s)
- Tao Li
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, China
| | - Hong Cao
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, China
| | - Jiahua He
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, China
| | - Sheng-You Huang
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, China.
| |
Collapse
|
21
|
Chen S, Zhang S, Fang X, Lin L, Zhao H, Yang Y. Protein complex structure modeling by cross-modal alignment between cryo-EM maps and protein sequences. Nat Commun 2024; 15:8808. [PMID: 39394203 PMCID: PMC11470027 DOI: 10.1038/s41467-024-53116-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Accepted: 10/02/2024] [Indexed: 10/13/2024] Open
Abstract
Cryo-electron microscopy (cryo-EM) technique is widely used for protein structure determination. Current automatic cryo-EM protein complex modeling methods mostly rely on prior chain separation. However, chain separation without sequence guidance often suffers from errors caused by cross-chain interaction or noise densities, which would accumulate and mislead the subsequent steps. Here, we present EModelX, a fully automated cryo-EM protein complex structure modeling method, which achieves sequence-guiding modeling through cross-modal alignments between cryo-EM maps and protein sequences. EModelX first employs multi-task deep learning to predict Cα atoms, backbone atoms, and amino acid types from cryo-EM maps, which is subsequently used to sample Cα traces with amino acid profiles. The profiles are then aligned with protein sequences to obtain initial structural models, which yielded an average RMSD of 1.17 Å in our test set, approaching atomic-level precision in recovering PDB-deposited structures. After filling unmodeled gaps through sequence-guiding Cα threading, the final models achieved an average TM-score of 0.808, outperforming the state-of-the-art method. The further combination with AlphaFold can improve the average TM-score to 0.911. Analyzes conducted by comparing some EModelX-built models and PDB structures highlight its potential to improve PDB structures. EModelX is accessible at https://bio-web1.nscc-gz.cn/app/EModelX .
Collapse
Affiliation(s)
- Sheng Chen
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China
| | - Sen Zhang
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China
| | - Xiaoyu Fang
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China
| | - Liang Lin
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China
| | - Huiying Zhao
- Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, China
| | - Yuedong Yang
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China.
| |
Collapse
|
22
|
Dialpuri J, Agirre J, Cowtan K, Bond P. NucleoFind: a deep-learning network for interpreting nucleic acid electron density. Nucleic Acids Res 2024; 52:e84. [PMID: 39162213 PMCID: PMC11417358 DOI: 10.1093/nar/gkae715] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Revised: 07/31/2024] [Accepted: 08/06/2024] [Indexed: 08/21/2024] Open
Abstract
Nucleic acid electron density interpretation after phasing by molecular replacement or other methods remains a difficult problem for computer programs to deal with. Programs tend to rely on time-consuming and computationally exhaustive searches to recognise characteristic features. We present NucleoFind, a deep-learning-based approach to interpreting and segmenting electron density. Using an electron density map from X-ray crystallography obtained after molecular replacement, the positions of the phosphate group, sugar ring and nitrogenous base group can be predicted with high accuracy. On average, 78% of phosphate atoms, 85% of sugar atoms and 83% of base atoms are positioned in predicted density after giving NucleoFind maps produced following successful molecular replacement. NucleoFind can use the wealth of context these predicted maps provide to build more accurate and complete nucleic acid models automatically.
Collapse
Affiliation(s)
- Jordan S Dialpuri
- York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK
| | - Jon Agirre
- York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK
| | - Kathryn D Cowtan
- York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK
| | - Paul S Bond
- York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK
| |
Collapse
|
23
|
Sharma KD, Doktorova M, Waxham MN, Heberle FA. Cryo-EM images of phase-separated lipid bilayer vesicles analyzed with a machine-learning approach. Biophys J 2024; 123:2877-2891. [PMID: 38689500 PMCID: PMC11393711 DOI: 10.1016/j.bpj.2024.04.029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Revised: 03/08/2024] [Accepted: 04/26/2024] [Indexed: 05/02/2024] Open
Abstract
Lateral lipid heterogeneity (i.e., raft formation) in biomembranes plays a functional role in living cells. Three-component mixtures of low- and high-melting lipids plus cholesterol offer a simplified experimental model for raft domains in which a liquid-disordered (Ld) phase coexists with a liquid-ordered (Lo) phase. Using such models, we recently showed that cryogenic electron microscopy (cryo-EM) can detect phase separation in lipid vesicles based on differences in bilayer thickness. However, the considerable noise within cryo-EM data poses a significant challenge for accurately determining the membrane phase state at high spatial resolution. To this end, we have developed an image-processing pipeline that utilizes machine learning (ML) to predict the bilayer phase in projection images of lipid vesicles. Importantly, the ML method exploits differences in both the thickness and molecular density of Lo compared to Ld, which leads to improved phase identification. To assess accuracy, we used artificial images of phase-separated lipid vesicles generated from all-atom molecular dynamics simulations of Lo and Ld phases. Synthetic ground-truth data sets mimicking a series of compositions along a tieline of Ld + Lo coexistence were created and then analyzed with various ML models. For all tieline compositions, we find that the ML approach can correctly identify the bilayer phase with >90% accuracy, thus providing a means to isolate the intensity profiles of coexisting Ld and Lo phases, as well as accurately determine domain-size distributions, number of domains, and phase-area fractions. The method described here provides a framework for characterizing nanoscopic lateral heterogeneities in membranes and paves the way for a more detailed understanding of raft properties in biological contexts.
Collapse
Affiliation(s)
- Karan D Sharma
- Department of Chemistry, University of Tennessee, Knoxville, Tennessee
| | - Milka Doktorova
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia
| | - M Neal Waxham
- Department of Neurobiology and Anatomy, University of Texas Health Science Center, Houston, Texas
| | | |
Collapse
|
24
|
Stevens A, Kashyap S, Crofut EH, Wang SE, Muratore KA, Johnson PJ, Zhou ZH. Structures of Native Doublet Microtubules from Trichomonas vaginalis Reveal Parasite-Specific Proteins as Potential Drug Targets. RESEARCH SQUARE 2024:rs.3.rs-4632384. [PMID: 39281863 PMCID: PMC11398567 DOI: 10.21203/rs.3.rs-4632384/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/18/2024]
Abstract
Doublet microtubules (DMTs) are flagellar components required for the protist Trichomonas vaginalis (Tv) to swim through the human genitourinary tract to cause trichomoniasis, the most common non-viral sexually transmitted disease. Lack of DMT structures has prevented structure-guided drug design to manage Tv infection. Here, we determined the cryo-EM structure of native Tv-DMTs, identifying 29 unique proteins, including 18 microtubule inner proteins and 9 microtubule outer proteins. While the A-tubule is simplistic compared to DMTs of other organisms, the B-tubule features specialized, parasite-specific proteins, such as TvFAP40 and TvFAP35 that form filaments near the inner and outer junctions, respectively, to stabilize DMTs and enable Tv locomotion. Notably, a small molecule, assigned as IP6, is coordinated within a pocket of TvFAP40 and has characteristics of a drug molecule. This first atomic model of the Tv-DMT highlights the diversity of eukaryotic motility machinery and provides a structural framework to inform rational design of therapeutics.
Collapse
Affiliation(s)
- Alexander Stevens
- Department of Microbiology, Immunology & Molecular Genetics, University of California, Los Angeles, Los Angeles, CA 90095, USA
- California NanoSystems Institute, University of California, Los Angeles, Los Angeles, CA 90095, USA
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Saarang Kashyap
- Department of Microbiology, Immunology & Molecular Genetics, University of California, Los Angeles, Los Angeles, CA 90095, USA
- California NanoSystems Institute, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Ethan H. Crofut
- Department of Microbiology, Immunology & Molecular Genetics, University of California, Los Angeles, Los Angeles, CA 90095, USA
- California NanoSystems Institute, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Shuqi E. Wang
- Department of Microbiology, Immunology & Molecular Genetics, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Katherine A. Muratore
- Department of Microbiology, Immunology & Molecular Genetics, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Patricia J. Johnson
- Department of Microbiology, Immunology & Molecular Genetics, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Z. Hong Zhou
- Department of Microbiology, Immunology & Molecular Genetics, University of California, Los Angeles, Los Angeles, CA 90095, USA
- California NanoSystems Institute, University of California, Los Angeles, Los Angeles, CA 90095, USA
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA 90095, USA
| |
Collapse
|
25
|
Gao J, Tong M, Lee C, Gaertig J, Legal T, Bui KH. DomainFit: Identification of protein domains in cryo-EM maps at intermediate resolution using AlphaFold2-predicted models. Structure 2024; 32:1248-1259.e5. [PMID: 38754431 PMCID: PMC11316655 DOI: 10.1016/j.str.2024.04.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 03/18/2024] [Accepted: 04/19/2024] [Indexed: 05/18/2024]
Abstract
Cryoelectron microscopy (cryo-EM) has revolutionized the structural determination of macromolecular complexes. With the paradigm shift to structure determination of highly complex endogenous macromolecular complexes ex vivo and in situ structural biology, there are an increasing number of structures of native complexes. These complexes often contain unidentified proteins, related to different cellular states or processes. Identifying proteins at resolutions lower than 4 Å remains challenging because side chains cannot be visualized reliably. Here, we present DomainFit, a program for semi-automated domain-level protein identification from cryo-EM maps, particularly at resolutions lower than 4 Å. By fitting domains from AlphaFold2-predicted models into cryo-EM maps, the program performs statistical analyses and attempts to identify the domains and protein candidates forming the density. Using DomainFit, we identified two microtubule inner proteins, one of which contains a CCDC81 domain and is exclusively localized in the proximal region of the doublet microtubule in Tetrahymena thermophila.
Collapse
Affiliation(s)
- Jerry Gao
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Montréal, QC H3A 0C7, Canada; Centre de recherche en biologie structurale, McGill University, Montréal, QC H3G 0B1, Canada
| | - Maxwell Tong
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Montréal, QC H3A 0C7, Canada; Centre de recherche en biologie structurale, McGill University, Montréal, QC H3G 0B1, Canada
| | - Chinkyu Lee
- Department of Cellular Biology, University of Georgia, Athens 30602-2607, GA, USA
| | - Jacek Gaertig
- Department of Cellular Biology, University of Georgia, Athens 30602-2607, GA, USA
| | - Thibault Legal
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Montréal, QC H3A 0C7, Canada; Centre de recherche en biologie structurale, McGill University, Montréal, QC H3G 0B1, Canada.
| | - Khanh Huy Bui
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Montréal, QC H3A 0C7, Canada; Centre de recherche en biologie structurale, McGill University, Montréal, QC H3G 0B1, Canada.
| |
Collapse
|
26
|
Kyrilis FL, Low JKK, Mackay JP, Kastritis PL. Structural biology in cellulo: Minding the gap between conceptualization and realization. Curr Opin Struct Biol 2024; 87:102843. [PMID: 38788606 DOI: 10.1016/j.sbi.2024.102843] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Revised: 04/30/2024] [Accepted: 05/01/2024] [Indexed: 05/26/2024]
Abstract
Recent technological advances have deepened our perception of cellular structure. However, most structural data doesn't originate from intact cells, limiting our understanding of cellular processes. Here, we discuss current and future developments that will bring us towards a structural picture of the cell. Electron cryotomography is the standard bearer, with its ability to provide in cellulo snapshots. Single-particle electron microscopy (of purified biomolecules and of complex mixtures) and covalent crosslinking combined with mass spectrometry also have significant roles to play, as do artificial intelligence algorithms in their many guises. To integrate these multiple approaches, data curation and standardisation will be critical - as is the need to expand efforts beyond our current protein-centric view to the other (macro)molecules that sustain life.
Collapse
Affiliation(s)
- Fotis L Kyrilis
- Institute of Chemical Biology, National Hellenic Research Foundation, Athens, Greece. https://twitter.com/Fotansky_16
| | - Jason K K Low
- School of Life and Environmental Sciences, University of Sydney, Sydney, NSW 2006, Australia
| | - Joel P Mackay
- School of Life and Environmental Sciences, University of Sydney, Sydney, NSW 2006, Australia.
| | - Panagiotis L Kastritis
- Institute of Chemical Biology, National Hellenic Research Foundation, Athens, Greece; Interdisciplinary Research Center HALOmem, Charles Tanford Protein Center, Martin Luther University Halle-Wittenberg, Kurt-Mothes-Straße 3a, Halle/Saale, Germany; Institute of Biochemistry and Biotechnology, Martin Luther University Halle-Wittenberg, Kurt-Mothes-Straße 3, Halle/Saale, Germany; Biozentrum, Martin Luther University Halle-Wittenberg, Weinbergweg 22, Halle/Saale, Germany.
| |
Collapse
|
27
|
Nestl BM, Nebel BA, Resch V, Schürmann M, Tischler D. The Development and Opportunities of Predictive Biotechnology. Chembiochem 2024; 25:e202300863. [PMID: 38713151 DOI: 10.1002/cbic.202300863] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Revised: 04/05/2024] [Indexed: 05/08/2024]
Abstract
Recent advances in bioeconomy allow a holistic view of existing and new process chains and enable novel production routines continuously advanced by academia and industry. All this progress benefits from a growing number of prediction tools that have found their way into the field. For example, automated genome annotations, tools for building model structures of proteins, and structural protein prediction methods such as AlphaFold2TM or RoseTTAFold have gained popularity in recent years. Recently, it has become apparent that more and more AI-based tools are being developed and used for biocatalysis and biotechnology. This is an excellent opportunity for academia and industry to accelerate advancements in the field further. Biotechnology, as a rapidly growing interdisciplinary field, stands to benefit greatly from these developments.
Collapse
Affiliation(s)
- Bettina M Nestl
- Joint working group on biotransformations of the Association for General and Applied Microbiology VAAM, the Society for Chemical Engineering, Biotechnology DECHEMA, Theodor-Heuss-Allee 25, 60486, Frankfurt, Germany
- Innophore GmbH, Am Eisernen Tor 3, 8010, Graz, Austria
| | - Bernd A Nebel
- Innophore GmbH, Am Eisernen Tor 3, 8010, Graz, Austria
| | - Verena Resch
- Innophore GmbH, Am Eisernen Tor 3, 8010, Graz, Austria
| | - Martin Schürmann
- Joint working group on biotransformations of the Association for General and Applied Microbiology VAAM, the Society for Chemical Engineering, Biotechnology DECHEMA, Theodor-Heuss-Allee 25, 60486, Frankfurt, Germany
- InnoSyn B. V., Urmonderbaan 22, 6167 RD, Geleen, The Netherlands
- SynSilico B. V., Urmonderbaan 22, 6167 RD, Geleen, The Netherlands
| | - Dirk Tischler
- Joint working group on biotransformations of the Association for General and Applied Microbiology VAAM, the Society for Chemical Engineering, Biotechnology DECHEMA, Theodor-Heuss-Allee 25, 60486, Frankfurt, Germany
- Microbial Biotechnology, Ruhr University Bochum, Universitätsstrasse 150, 44780, Bochum, Germany
| |
Collapse
|
28
|
Song X, Bao L, Feng C, Huang Q, Zhang F, Gao X, Han R. Accurate Prediction of Protein Structural Flexibility by Deep Learning Integrating Intricate Atomic Structures and Cryo-EM Density Information. Nat Commun 2024; 15:5538. [PMID: 38956032 PMCID: PMC11219796 DOI: 10.1038/s41467-024-49858-x] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2023] [Accepted: 06/20/2024] [Indexed: 07/04/2024] Open
Abstract
The dynamics of proteins are crucial for understanding their mechanisms. However, computationally predicting protein dynamic information has proven challenging. Here, we propose a neural network model, RMSF-net, which outperforms previous methods and produces the best results in a large-scale protein dynamics dataset; this model can accurately infer the dynamic information of a protein in only a few seconds. By learning effectively from experimental protein structure data and cryo-electron microscopy (cryo-EM) data integration, our approach is able to accurately identify the interactive bidirectional constraints and supervision between cryo-EM maps and PDB models in maximizing the dynamic prediction efficacy. Rigorous 5-fold cross-validation on the dataset demonstrates that RMSF-net achieves test correlation coefficients of 0.746 ± 0.127 at the voxel level and 0.765 ± 0.109 at the residue level, showcasing its ability to deliver dynamic predictions closely approximating molecular dynamics simulations. Additionally, it offers real-time dynamic inference with minimal storage overhead on the order of megabytes. RMSF-net is a freely accessible tool and is anticipated to play an essential role in the study of protein dynamics.
Collapse
Affiliation(s)
- Xintao Song
- Research Center for Mathematics and Interdisciplinary Sciences (Ministry of Education Frontiers Science Center for Nonlinear Expectations), Shandong University, Qingdao, China
- BioMap Research, Menlo Park, CA, USA
- King Abdullah University of Science and Technology (KAUST), Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Thuwal, Saudi Arabia
| | - Lei Bao
- School of Public Health, Hubei University of Medicine, Shiyan, China
| | - Chenjie Feng
- College of Medical Information and Engineering, Ningxia Medical University, Yinchuan, China
| | - Qiang Huang
- Research Center for Mathematics and Interdisciplinary Sciences (Ministry of Education Frontiers Science Center for Nonlinear Expectations), Shandong University, Qingdao, China
| | - Fa Zhang
- School of Medical Technology, Beijing Institute of Technology, Beijing, China.
| | - Xin Gao
- King Abdullah University of Science and Technology (KAUST), Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Thuwal, Saudi Arabia.
| | - Renmin Han
- Research Center for Mathematics and Interdisciplinary Sciences (Ministry of Education Frontiers Science Center for Nonlinear Expectations), Shandong University, Qingdao, China.
- BioMap Research, Menlo Park, CA, USA.
| |
Collapse
|
29
|
Lawson CL, Kryshtafovych A, Pintilie GD, Burley SK, Černý J, Chen VB, Emsley P, Gobbi A, Joachimiak A, Noreng S, Prisant MG, Read RJ, Richardson JS, Rohou AL, Schneider B, Sellers BD, Shao C, Sourial E, Williams CI, Williams CJ, Yang Y, Abbaraju V, Afonine PV, Baker ML, Bond PS, Blundell TL, Burnley T, Campbell A, Cao R, Cheng J, Chojnowski G, Cowtan KD, DiMaio F, Esmaeeli R, Giri N, Grubmüller H, Hoh SW, Hou J, Hryc CF, Hunte C, Igaev M, Joseph AP, Kao WC, Kihara D, Kumar D, Lang L, Lin S, Maddhuri Venkata Subramaniya SR, Mittal S, Mondal A, Moriarty NW, Muenks A, Murshudov GN, Nicholls RA, Olek M, Palmer CM, Perez A, Pohjolainen E, Pothula KR, Rowley CN, Sarkar D, Schäfer LU, Schlicksup CJ, Schröder GF, Shekhar M, Si D, Singharoy A, Sobolev OV, Terashi G, Vaiana AC, Vedithi SC, Verburgt J, Wang X, Warshamanage R, Winn MD, Weyand S, Yamashita K, Zhao M, Schmid MF, Berman HM, Chiu W. Outcomes of the EMDataResource cryo-EM Ligand Modeling Challenge. Nat Methods 2024; 21:1340-1348. [PMID: 38918604 PMCID: PMC11526832 DOI: 10.1038/s41592-024-02321-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2024] [Accepted: 05/24/2024] [Indexed: 06/27/2024]
Abstract
The EMDataResource Ligand Model Challenge aimed to assess the reliability and reproducibility of modeling ligands bound to protein and protein-nucleic acid complexes in cryogenic electron microscopy (cryo-EM) maps determined at near-atomic (1.9-2.5 Å) resolution. Three published maps were selected as targets: Escherichia coli beta-galactosidase with inhibitor, SARS-CoV-2 virus RNA-dependent RNA polymerase with covalently bound nucleotide analog and SARS-CoV-2 virus ion channel ORF3a with bound lipid. Sixty-one models were submitted from 17 independent research groups, each with supporting workflow details. The quality of submitted ligand models and surrounding atoms were analyzed by visual inspection and quantification of local map quality, model-to-map fit, geometry, energetics and contact scores. A composite rather than a single score was needed to assess macromolecule+ligand model quality. These observations lead us to recommend best practices for assessing cryo-EM structures of liganded macromolecules reported at near-atomic resolution.
Collapse
Affiliation(s)
- Catherine L Lawson
- RCSB Protein Data Bank and Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA.
| | | | - Grigore D Pintilie
- Departments of Bioengineering and of Microbiology and Immunology, Stanford University, Stanford, CA, USA
| | - Stephen K Burley
- RCSB Protein Data Bank and Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Rutgers Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ, USA
- RCSB Protein Data Bank and San Diego Supercomputer Center, University of California San Diego, La Jolla, CA, USA
| | - Jiří Černý
- Institute of Biotechnology, Czech Academy of Sciences, Vestec, Czech Republic
| | - Vincent B Chen
- Department of Biochemistry, Duke University, Durham, NC, USA
| | - Paul Emsley
- MRC Laboratory of Molecular Biology, Cambridge, UK
| | - Alberto Gobbi
- Discovery Chemistry, Genentech Inc., San Francisco, CA, USA
- , Berlin, Germany
| | - Andrzej Joachimiak
- Structural Biology Center, X-ray Science Division, Argonne National Laboratory, Argonne, IL, USA
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, IL, USA
| | - Sigrid Noreng
- Structural Biology, Genentech Inc., South San Francisco, CA, USA
- Protein Science, Septerna, South San Francisco, CA, USA
| | | | - Randy J Read
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, UK
| | | | - Alexis L Rohou
- Structural Biology, Genentech Inc., South San Francisco, CA, USA
| | - Bohdan Schneider
- Institute of Biotechnology, Czech Academy of Sciences, Vestec, Czech Republic
| | - Benjamin D Sellers
- Discovery Chemistry, Genentech Inc., San Francisco, CA, USA
- Computational Chemistry, Vilya, South San Francisco, CA, USA
| | - Chenghua Shao
- RCSB Protein Data Bank and Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | | | | | | | - Ying Yang
- Structural Biology, Genentech Inc., South San Francisco, CA, USA
| | - Venkat Abbaraju
- RCSB Protein Data Bank and Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Pavel V Afonine
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Matthew L Baker
- Department of Biochemistry and Molecular Biology, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Paul S Bond
- York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK
| | - Tom L Blundell
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Tom Burnley
- Scientific Computing Department, UKRI Science and Technology Facilities Council, Research Complex at Harwell, Didcot, UK
| | - Arthur Campbell
- Center for Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Renzhi Cao
- Department of Computer Science, Pacific Lutheran University, Tacoma, WA, USA
| | - Jianlin Cheng
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
| | | | - K D Cowtan
- York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK
| | - Frank DiMaio
- Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Reza Esmaeeli
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL, USA
| | - Nabin Giri
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
| | - Helmut Grubmüller
- Theoretical and Computational Biophysics Department, Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany
| | - Soon Wen Hoh
- York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK
| | - Jie Hou
- Department of Computer Science, Saint Louis University, St. Louis, MO, USA
| | - Corey F Hryc
- Department of Biochemistry and Molecular Biology, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Carola Hunte
- Institute of Biochemistry and Molecular Biology, ZBMZ, Faculty of Medicine and CIBSS-Centre for Integrative Biological Signalling Studies, University of Freiburg, Freiburg, Germany
| | - Maxim Igaev
- Theoretical and Computational Biophysics Department, Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany
| | - Agnel P Joseph
- Scientific Computing Department, UKRI Science and Technology Facilities Council, Research Complex at Harwell, Didcot, UK
| | - Wei-Chun Kao
- Institute of Biochemistry and Molecular Biology, ZBMZ, Faculty of Medicine and CIBSS-Centre for Integrative Biological Signalling Studies, University of Freiburg, Freiburg, Germany
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
| | - Dilip Kumar
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX, USA
- Trivedi School of Biosciences, Ashoka University, Sonipat, India
| | - Lijun Lang
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL, USA
- The Chinese University of Hong Kong, Hong Kong, China
| | - Sean Lin
- Division of Computing & Software Systems, University of Washington, Bothell, WA, USA
| | | | - Sumit Mittal
- Biodesign Institute, Arizona State University, Tempe, AZ, USA
- School of Advanced Sciences and Languages, VIT Bhopal University, Bhopal, India
| | - Arup Mondal
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL, USA
- National Renewable Energy Laboratory (NREL), Golden, CO, USA
| | - Nigel W Moriarty
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Andrew Muenks
- Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, WA, USA
| | | | - Robert A Nicholls
- MRC Laboratory of Molecular Biology, Cambridge, UK
- Scientific Computing Department, UKRI Science and Technology Facilities Council, Research Complex at Harwell, Didcot, UK
| | - Mateusz Olek
- York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK
- Electron Bio-Imaging Centre, Diamond Light Source, Harwell Science and Innovation Campus, Didcot, UK
| | - Colin M Palmer
- Scientific Computing Department, UKRI Science and Technology Facilities Council, Research Complex at Harwell, Didcot, UK
| | - Alberto Perez
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL, USA
| | - Emmi Pohjolainen
- Theoretical and Computational Biophysics Department, Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany
| | - Karunakar R Pothula
- Institute of Biological Information Processing (IBI-7, Structural Biochemistry) and Jülich Centre for Structural Biology (JuStruct), Forschungszentrum Jülich, Jülich, Germany
| | | | - Daipayan Sarkar
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
- Biodesign Institute, Arizona State University, Tempe, AZ, USA
- MSU-DOE Plant Research Laboratory, East Lansing, MI, USA
- School of Molecular Sciences, Arizona State University, Tempe, AZ, USA
| | - Luisa U Schäfer
- Institute of Biological Information Processing (IBI-7, Structural Biochemistry) and Jülich Centre for Structural Biology (JuStruct), Forschungszentrum Jülich, Jülich, Germany
| | - Christopher J Schlicksup
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Gunnar F Schröder
- Institute of Biological Information Processing (IBI-7, Structural Biochemistry) and Jülich Centre for Structural Biology (JuStruct), Forschungszentrum Jülich, Jülich, Germany
- Physics Department, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Mrinal Shekhar
- Center for Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Biodesign Institute, Arizona State University, Tempe, AZ, USA
| | - Dong Si
- Division of Computing & Software Systems, University of Washington, Bothell, WA, USA
| | | | - Oleg V Sobolev
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Genki Terashi
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
| | - Andrea C Vaiana
- Theoretical and Computational Biophysics Department, Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany
- Nature's Toolbox (NTx), Rio Rancho, NM, USA
| | | | - Jacob Verburgt
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
| | - Xiao Wang
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
| | | | - Martyn D Winn
- Scientific Computing Department, UKRI Science and Technology Facilities Council, Research Complex at Harwell, Didcot, UK
| | - Simone Weyand
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| | | | - Minglei Zhao
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, IL, USA
| | - Michael F Schmid
- Division of Cryo-EM and Bioimaging, SSRL, SLAC National Accelerator Laboratory, Menlo Park, CA, USA
| | - Helen M Berman
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Wah Chiu
- Departments of Bioengineering and of Microbiology and Immunology, Stanford University, Stanford, CA, USA.
- Division of Cryo-EM and Bioimaging, SSRL, SLAC National Accelerator Laboratory, Menlo Park, CA, USA.
| |
Collapse
|
30
|
Giri N, Cheng J. De novo atomic protein structure modeling for cryoEM density maps using 3D transformer and HMM. Nat Commun 2024; 15:5511. [PMID: 38951555 PMCID: PMC11217428 DOI: 10.1038/s41467-024-49647-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Accepted: 06/13/2024] [Indexed: 07/03/2024] Open
Abstract
Accurately building 3D atomic structures from cryo-EM density maps is a crucial step in cryo-EM-based protein structure determination. Converting density maps into 3D atomic structures for proteins lacking accurate homologous or predicted structures as templates remains a significant challenge. Here, we introduce Cryo2Struct, a fully automated de novo cryo-EM structure modeling method. Cryo2Struct utilizes a 3D transformer to identify atoms and amino acid types in cryo-EM density maps, followed by an innovative Hidden Markov Model (HMM) to connect predicted atoms and build protein backbone structures. Cryo2Struct produces substantially more accurate and complete protein structural models than the widely used ab initio method Phenix. Additionally, its performance in building atomic structural models is robust against changes in the resolution of density maps and the size of protein structures.
Collapse
Affiliation(s)
- Nabin Giri
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
- Roy Blunt NextGen Precision Health, University of Missouri, Columbia, MO, USA
| | - Jianlin Cheng
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA.
- Roy Blunt NextGen Precision Health, University of Missouri, Columbia, MO, USA.
| |
Collapse
|
31
|
Wang Y, Qiu L, Wang B, Guan Z, Dong Z, Zhang J, Cao S, Yang L, Wang B, Gong Z, Zhang L, Ma W, Liu Z, Zhang D, Wang G, Yin P. Structural basis for odorant recognition of the insect odorant receptor OR-Orco heterocomplex. Science 2024; 384:1453-1460. [PMID: 38870272 DOI: 10.1126/science.adn6881] [Citation(s) in RCA: 33] [Impact Index Per Article: 33.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Accepted: 05/29/2024] [Indexed: 06/15/2024]
Abstract
Insects detect and discriminate a diverse array of chemicals using odorant receptors (ORs), which are ligand-gated ion channels comprising a divergent odorant-sensing OR and a conserved odorant receptor co-receptor (Orco). In this work, we report structures of the ApOR5-Orco heterocomplex from the pea aphid Acyrthosiphon pisum alone and bound to its known activating ligand, geranyl acetate. In these structures, three ApOrco subunits serve as scaffold components that cannot bind the ligand and remain relatively unchanged. Upon ligand binding, the pore-forming helix S7b of ApOR5 shifts outward from the central pore axis, causing an asymmetrical pore opening for ion influx. Our study provides insights into odorant recognition and channel gating of the OR-Orco heterocomplex and offers structural resources to support development of innovative insecticides and repellents for pest control.
Collapse
Affiliation(s)
- Yidong Wang
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Synthetic Biology Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan 430070, China
| | - Liang Qiu
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan 430070, China
| | - Bing Wang
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Zeyuan Guan
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan 430070, China
| | - Zhi Dong
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan 430070, China
| | - Jie Zhang
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan 430070, China
| | - Song Cao
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Synthetic Biology Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Lulu Yang
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Bo Wang
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Zhou Gong
- Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, Wuhan 430071, China
| | - Liwei Zhang
- College of Grassland Science and Technology, China Agricultural University, Beijing 100193, China
| | - Weihua Ma
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan 430070, China
| | - Zhu Liu
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan 430070, China
| | - Delin Zhang
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan 430070, China
| | - Guirong Wang
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Synthetic Biology Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Ping Yin
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan 430070, China
| |
Collapse
|
32
|
Stevens A, Kashyap S, Crofut EH, Wang SE, Muratore KA, Johnson PJ, Zhou ZH. Structures of Native Doublet Microtubules from Trichomonas vaginalis Reveal Parasite-Specific Proteins as Potential Drug Targets. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.11.598142. [PMID: 38915691 PMCID: PMC11195118 DOI: 10.1101/2024.06.11.598142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/26/2024]
Abstract
Doublet microtubules (DMTs) are flagellar components required for the protist Trichomonas vaginalis ( Tv ) to swim through the human genitourinary tract to cause trichomoniasis, the most common non-viral sexually transmitted disease. Lack of DMT structures has prevented structure-guided drug design to manage Tv infection. Here, we determined the cryo-EM structure of native Tv- DMTs, identifying 29 unique proteins, including 18 microtubule inner proteins and 9 microtubule outer proteins. While the A-tubule is simplistic compared to DMTs of other organisms, the B-tubule features specialized, parasite-specific proteins, like Tv FAP40 and Tv FAP35 that form filaments near the inner and outer junctions, respectively, to stabilize DMTs and enable Tv locomotion. Notably, a small molecule, assigned as IP6, is coordinated within a pocket of Tv FAP40 and has characteristics of a drug molecule. This first atomic model of the Tv -DMT highlights the diversity of eukaryotic motility machinery and provides a structural framework to inform the rational design of therapeutics.
Collapse
|
33
|
Kellogg GE. Three-Dimensional Interaction Homology: Deconstructing Residue-Residue and Residue-Lipid Interactions in Membrane Proteins. Molecules 2024; 29:2838. [PMID: 38930903 PMCID: PMC11207109 DOI: 10.3390/molecules29122838] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2024] [Revised: 06/09/2024] [Accepted: 06/10/2024] [Indexed: 06/28/2024] Open
Abstract
A method is described to deconstruct the network of hydropathic interactions within and between a protein's sidechain and its environment into residue-based three-dimensional maps. These maps encode favorable and unfavorable hydrophobic and polar interactions, in terms of spatial positions for optimal interactions, relative interaction strength, as well as character. In addition, these maps are backbone angle-dependent. After map calculation and clustering, a finite number of unique residue sidechain interaction maps exist for each backbone conformation, with the number related to the residue's size and interaction complexity. Structures for soluble proteins (~749,000 residues) and membrane proteins (~387,000 residues) were analyzed, with the latter group being subdivided into three subsets related to the residue's position in the membrane protein: soluble domain, core-facing transmembrane domain, and lipid-facing transmembrane domain. This work suggests that maps representing residue types and their backbone conformation can be reassembled to optimize the medium-to-high resolution details of a protein structure. In particular, the information encoded in maps constructed from the lipid-facing transmembrane residues appears to paint a clear picture of the protein-lipid interactions that are difficult to obtain experimentally.
Collapse
Affiliation(s)
- Glen E Kellogg
- Department of Medicinal Chemistry, Virginia Commonwealth University, Richmond, VA 23298-0540, USA
| |
Collapse
|
34
|
Liu J, Eastep GN, Cvirkaite-Krupovic V, Rich-New ST, Kreutzberger MAB, Egelman EH, Krupovic M, Wang F. Two distinct archaeal type IV pili structures formed by proteins with identical sequence. Nat Commun 2024; 15:5049. [PMID: 38877064 PMCID: PMC11178852 DOI: 10.1038/s41467-024-45062-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Accepted: 01/10/2024] [Indexed: 06/16/2024] Open
Abstract
Type IV pili (T4P) represent one of the most common varieties of surface appendages in archaea. These filaments, assembled from small pilin proteins, can be many microns long and serve diverse functions, including adhesion, biofilm formation, motility, and intercellular communication. Here, we determine atomic structures of two distinct adhesive T4P from Saccharolobus islandicus via cryo-electron microscopy (cryo-EM). Unexpectedly, both pili were assembled from the same pilin polypeptide but under different growth conditions. One filament, denoted mono-pilus, conforms to canonical archaeal T4P structures where all subunits are equivalent, whereas in the other filament, the tri-pilus, the same polypeptide exists in three different conformations. The three conformations in the tri-pilus are very different from the single conformation found in the mono-pilus, and involve different orientations of the outer immunoglobulin-like domains, mediated by a very flexible linker. Remarkably, the outer domains rotate nearly 180° between the mono- and tri-pilus conformations. Both forms of pili require the same ATPase and TadC-like membrane pore for assembly, indicating that the same secretion system can produce structurally very different filaments. Our results show that the structures of archaeal T4P appear to be less constrained and rigid than those of the homologous archaeal flagellar filaments that serve as helical propellers.
Collapse
Affiliation(s)
- Junfeng Liu
- Institut Pasteur, Université Paris Cité, Archaeal Virology Unit, Paris, France
| | - Gunnar N Eastep
- Department of Biochemistry and Molecular Genetics, University of Alabama at Birmingham, Birmingham, AL, USA
| | | | - Shane T Rich-New
- Department of Biochemistry and Molecular Genetics, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Mark A B Kreutzberger
- Department of Biochemistry and Molecular Genetics, University of Virginia School of Medicine, Charlottesville, VA, USA
| | - Edward H Egelman
- Department of Biochemistry and Molecular Genetics, University of Virginia School of Medicine, Charlottesville, VA, USA.
| | - Mart Krupovic
- Institut Pasteur, Université Paris Cité, Archaeal Virology Unit, Paris, France.
| | - Fengbin Wang
- Department of Biochemistry and Molecular Genetics, University of Alabama at Birmingham, Birmingham, AL, USA.
- Department of Biochemistry and Molecular Genetics, University of Virginia School of Medicine, Charlottesville, VA, USA.
| |
Collapse
|
35
|
Giri N, Wang L, Cheng J. Cryo2StructData: A Large Labeled Cryo-EM Density Map Dataset for AI-based Modeling of Protein Structures. Sci Data 2024; 11:458. [PMID: 38710720 PMCID: PMC11074267 DOI: 10.1038/s41597-024-03299-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Accepted: 04/23/2024] [Indexed: 05/08/2024] Open
Abstract
The advent of single-particle cryo-electron microscopy (cryo-EM) has brought forth a new era of structural biology, enabling the routine determination of large biological molecules and their complexes at atomic resolution. The high-resolution structures of biological macromolecules and their complexes significantly expedite biomedical research and drug discovery. However, automatically and accurately building atomic models from high-resolution cryo-EM density maps is still time-consuming and challenging when template-based models are unavailable. Artificial intelligence (AI) methods such as deep learning trained on limited amount of labeled cryo-EM density maps generate inaccurate atomic models. To address this issue, we created a dataset called Cryo2StructData consisting of 7,600 preprocessed cryo-EM density maps whose voxels are labelled according to their corresponding known atomic structures for training and testing AI methods to build atomic models from cryo-EM density maps. Cryo2StructData is larger than existing, publicly available datasets for training AI methods to build atomic protein structures from cryo-EM density maps. We trained and tested deep learning models on Cryo2StructData to validate its quality showing that it is ready for being used to train and test AI methods for building atomic models.
Collapse
Affiliation(s)
- Nabin Giri
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, 65211, USA
- Roy Blunt NextGen Precision Health, University of Missouri, Columbia, MO, 65211, USA
| | - Liguo Wang
- Laboratory for BioMolecular Structure (LBMS), Brookhaven National Laboratory, Upton, NY, 11973, USA
| | - Jianlin Cheng
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, 65211, USA.
- Roy Blunt NextGen Precision Health, University of Missouri, Columbia, MO, 65211, USA.
| |
Collapse
|
36
|
Walton T, Doran MH, Brown A. Structural determination and modeling of ciliary microtubules. Acta Crystallogr D Struct Biol 2024; 80:220-231. [PMID: 38451206 PMCID: PMC10994176 DOI: 10.1107/s2059798324001815] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2024] [Accepted: 02/24/2024] [Indexed: 03/08/2024] Open
Abstract
The axoneme, a microtubule-based array at the center of every cilium, has been the subject of structural investigations for decades, but only recent advances in cryo-EM and cryo-ET have allowed a molecular-level interpretation of the entire complex to be achieved. The unique properties of the nine doublet microtubules and central pair of singlet microtubules that form the axoneme, including the highly decorated tubulin lattice and the docking of massive axonemal complexes, provide opportunities and challenges for sample preparation, 3D reconstruction and atomic modeling. Here, the approaches used for cryo-EM and cryo-ET of axonemes are reviewed, while highlighting the unique opportunities provided by the latest generation of AI-guided tools that are transforming structural biology.
Collapse
Affiliation(s)
- Travis Walton
- Department of Biological Chemistry and Molecular Pharmacology, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA
| | - Matthew H. Doran
- Department of Biological Chemistry and Molecular Pharmacology, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA
| | - Alan Brown
- Department of Biological Chemistry and Molecular Pharmacology, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
37
|
Jamali K, Käll L, Zhang R, Brown A, Kimanius D, Scheres SHW. Automated model building and protein identification in cryo-EM maps. Nature 2024; 628:450-457. [PMID: 38408488 PMCID: PMC11006616 DOI: 10.1038/s41586-024-07215-4] [Citation(s) in RCA: 155] [Impact Index Per Article: 155.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Accepted: 02/19/2024] [Indexed: 02/28/2024]
Abstract
Interpreting electron cryo-microscopy (cryo-EM) maps with atomic models requires high levels of expertise and labour-intensive manual intervention in three-dimensional computer graphics programs1,2. Here we present ModelAngelo, a machine-learning approach for automated atomic model building in cryo-EM maps. By combining information from the cryo-EM map with information from protein sequence and structure in a single graph neural network, ModelAngelo builds atomic models for proteins that are of similar quality to those generated by human experts. For nucleotides, ModelAngelo builds backbones with similar accuracy to those built by humans. By using its predicted amino acid probabilities for each residue in hidden Markov model sequence searches, ModelAngelo outperforms human experts in the identification of proteins with unknown sequences. ModelAngelo will therefore remove bottlenecks and increase objectivity in cryo-EM structure determination.
Collapse
Affiliation(s)
| | - Lukas Käll
- Science for Life Laboratory, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Rui Zhang
- Washington University in St Louis, St Louis, MO, USA
| | - Alan Brown
- Blavatnik Institute, Harvard Medical School, Boston, MA, USA
| | | | | |
Collapse
|
38
|
Feng MF, Chen YX, Shen HB. DeepQs: Local quality assessment of cryo-EM density map by deep learning map-model fit score. J Struct Biol 2024; 216:108059. [PMID: 38160703 DOI: 10.1016/j.jsb.2023.108059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2023] [Revised: 12/18/2023] [Accepted: 12/28/2023] [Indexed: 01/03/2024]
Abstract
Cryogenic electron microscopy maps are valuable for determining macromolecule structures. A proper quality assessment method is essential for cryo-EM map selection or revision. This article presents DeepQs, a novel approach to estimate local quality for 3D cryo-EM density maps, using a deep-learning algorithm based on map-model fit score. DeepQs is a parameter-free method for users and incorporates structural information between map and its related atomic model into well-trained models by deep learning. More specifically, the DeepQs approach leverages the interplay between map and atomic model through predefined map-model fit score, Q-score. DeepQs can get close results to the ground truth map-model fit scores with only cryo-EM map as input. In experiments, DeepQs demonstrates the lowest root mean square error with standard method Fourier shell correlation metric and high correlation with map-model fit score, Q-score, when compared with other local quality estimation methods in high-resolution dataset (<=5 Å). DeepQs can also be applied to evaluate the quality of the post-processed maps. In both cases, DeepQs runs faster by using GPU acceleration. Our program is available at http://www.csbio.sjtu.edu.cn/bioinf/DeepQs for academic use.
Collapse
Affiliation(s)
- Ming-Feng Feng
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China
| | - Yu-Xuan Chen
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China
| | - Hong-Bin Shen
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China.
| |
Collapse
|
39
|
Wang X, Zhu H, Terashi G, Taluja M, Kihara D. DiffModeler: Large Macromolecular Structure Modeling in Low-Resolution Cryo-EM Maps Using Diffusion Model. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.20.576370. [PMID: 38328203 PMCID: PMC10849514 DOI: 10.1101/2024.01.20.576370] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/09/2024]
Abstract
Cryogenic electron microscopy (cryo-EM) has now been widely used for determining multi-chain protein complexes. However, modeling a complex structure is challenging particularly when the map resolution is low, typically in the intermediate resolution range of 5 to 10 Å. Within this resolution range, even accurate structure fitting is difficult, let alone de novo modeling. To address this challenge, here we present DiffModeler, a fully automated method for modeling protein complex structures. DiffModeler employs a diffusion model for backbone tracing and integrates AlphaFold2-predicted single-chain structures for structure fitting. Extensive testing on cryo-EM maps at intermediate resolutions demonstrates the exceptional accuracy of DiffModeler in structure modeling, achieving an average TM-Score of 0.92, surpassing existing methodologies significantly. Notably, DiffModeler successfully modeled a protein complex composed of 47 chains and 13,462 residues, achieving a high TM-Score of 0.94. Further benchmarking at low resolutions (10-20 Å confirms its versatility, demonstrating plausible performance. Moreover, when coupled with CryoREAD, DiffModeler excels in constructing protein-DNA/RNA complex structures for near-atomic resolution maps (0-5 Å), showcasing state-of-the-art performance with average TM-Scores of 0.88 and 0.91 across two datasets.
Collapse
Affiliation(s)
- Xiao Wang
- Department of Computer Science, Purdue University, West Lafayette, Indiana, 47907, USA
| | - Han Zhu
- Department of Computer Science, Purdue University, West Lafayette, Indiana, 47907, USA
| | - Genki Terashi
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana, 47907, USA
| | - Manav Taluja
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana, 47907, USA
- School of Computer Science and Engineering, Vellore Institute of Technology, Tamil Nadu 642014, India
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, Indiana, 47907, USA
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana, 47907, USA
| |
Collapse
|
40
|
Corum MR, Venkannagari H, Hryc CF, Baker ML. Predictive modeling and cryo-EM: A synergistic approach to modeling macromolecular structure. Biophys J 2024; 123:435-450. [PMID: 38268190 PMCID: PMC10912932 DOI: 10.1016/j.bpj.2024.01.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Revised: 01/09/2024] [Accepted: 01/18/2024] [Indexed: 01/26/2024] Open
Abstract
Over the last 15 years, structural biology has seen unprecedented development and improvement in two areas: electron cryo-microscopy (cryo-EM) and predictive modeling. Once relegated to low resolutions, single-particle cryo-EM is now capable of achieving near-atomic resolutions of a wide variety of macromolecular complexes. Ushered in by AlphaFold, machine learning has powered the current generation of predictive modeling tools, which can accurately and reliably predict models for proteins and some complexes directly from the sequence alone. Although they offer new opportunities individually, there is an inherent synergy between these techniques, allowing for the construction of large, complex macromolecular models. Here, we give a brief overview of these approaches in addition to illustrating works that combine these techniques for model building. These examples provide insight into model building, assessment, and limitations when integrating predictive modeling with cryo-EM density maps. Together, these approaches offer the potential to greatly accelerate the generation of macromolecular structural insights, particularly when coupled with experimental data.
Collapse
Affiliation(s)
- Michael R Corum
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas
| | - Harikanth Venkannagari
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas
| | - Corey F Hryc
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas
| | - Matthew L Baker
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas.
| |
Collapse
|
41
|
Lawson CL, Kryshtafovych A, Pintilie GD, Burley SK, Černý J, Chen VB, Emsley P, Gobbi A, Joachimiak A, Noreng S, Prisant M, Read RJ, Richardson JS, Rohou AL, Schneider B, Sellers BD, Shao C, Sourial E, Williams CI, Williams CJ, Yang Y, Abbaraju V, Afonine PV, Baker ML, Bond PS, Blundell TL, Burnley T, Campbell A, Cao R, Cheng J, Chojnowski G, Cowtan KD, DiMaio F, Esmaeeli R, Giri N, Grubmüller H, Hoh SW, Hou J, Hryc CF, Hunte C, Igaev M, Joseph AP, Kao WC, Kihara D, Kumar D, Lang L, Lin S, Maddhuri Venkata Subramaniya SR, Mittal S, Mondal A, Moriarty NW, Muenks A, Murshudov GN, Nicholls RA, Olek M, Palmer CM, Perez A, Pohjolainen E, Pothula KR, Rowley CN, Sarkar D, Schäfer LU, Schlicksup CJ, Schröder GF, Shekhar M, Si D, Singharoy A, Sobolev OV, Terashi G, Vaiana AC, Vedithi SC, Verburgt J, Wang X, Warshamanage R, Winn MD, Weyand S, Yamashita K, Zhao M, Schmid MF, Berman HM, Chiu W. Outcomes of the EMDataResource Cryo-EM Ligand Modeling Challenge. RESEARCH SQUARE 2024:rs.3.rs-3864137. [PMID: 38343795 PMCID: PMC10854310 DOI: 10.21203/rs.3.rs-3864137/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/18/2024]
Abstract
The EMDataResource Ligand Model Challenge aimed to assess the reliability and reproducibility of modeling ligands bound to protein and protein/nucleic-acid complexes in cryogenic electron microscopy (cryo-EM) maps determined at near-atomic (1.9-2.5 Å) resolution. Three published maps were selected as targets: E. coli beta-galactosidase with inhibitor, SARS-CoV-2 RNA-dependent RNA polymerase with covalently bound nucleotide analog, and SARS-CoV-2 ion channel ORF3a with bound lipid. Sixty-one models were submitted from 17 independent research groups, each with supporting workflow details. We found that (1) the quality of submitted ligand models and surrounding atoms varied, as judged by visual inspection and quantification of local map quality, model-to-map fit, geometry, energetics, and contact scores, and (2) a composite rather than a single score was needed to assess macromolecule+ligand model quality. These observations lead us to recommend best practices for assessing cryo-EM structures of liganded macromolecules reported at near-atomic resolution.
Collapse
Affiliation(s)
- Catherine L. Lawson
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | | | - Grigore D. Pintilie
- Departments of Bioengineering and of Microbiology and Immunology, Stanford University, Stanford, CA, USA
| | - Stephen K. Burley
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Rutgers Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ USA
- San Diego Supercomputer Center, University of California San Diego, La Jolla, CA USA
| | - Jiří Černý
- Institute of Biotechnology, Czech Academy of Sciences, Vestec, CZ
| | | | - Paul Emsley
- MRC Laboratory of Molecular Biology, Cambridge, UK
| | - Alberto Gobbi
- Discovery Chemistry, Genentech Inc, South San Francisco, USA
| | - Andrzej Joachimiak
- Structural Biology Center, X-ray Science Division, Argonne National Laboratory, Argonne, IL, USA
| | - Sigrid Noreng
- Structural Biology, Genentech Inc, South San Francisco, USA
| | | | - Randy J. Read
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, UK
| | | | | | - Bohdan Schneider
- Institute of Biotechnology, Czech Academy of Sciences, Vestec, CZ
| | | | - Chenghua Shao
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | | | | | | | - Ying Yang
- Structural Biology, Genentech Inc, South San Francisco, USA
| | - Venkat Abbaraju
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Pavel V. Afonine
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Matthew L. Baker
- Department of Biochemistry and Molecular Biology, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Paul S. Bond
- York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK
| | - Tom L. Blundell
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Tom Burnley
- Scientific Computing Department, UKRI Science and Technology Facilities Council, Research Complex at Harwell, Didcot, UK
| | - Arthur Campbell
- Center for Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Renzhi Cao
- Department of Computer Science, Pacific Lutheran University, Tacoma, WA, USA
| | - Jianlin Cheng
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
| | | | - Kevin D. Cowtan
- York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK
| | - Frank DiMaio
- Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Reza Esmaeeli
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL, USA
| | - Nabin Giri
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
| | - Helmut Grubmüller
- Theoretical and Computational Biophysics Department, Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany
| | - Soon Wen Hoh
- York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK
| | - Jie Hou
- Department of Computer Science, Saint Louis University, St. Louis, MO, USA
| | - Corey F. Hryc
- Department of Biochemistry and Molecular Biology, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Carola Hunte
- Institute of Biochemistry and Molecular Biology, ZBMZ, Faculty of Medicine and CIBSS - Centre for Integrative Biological Signalling Studies, University of Freiburg, 79104 Freiburg, Germany
| | - Maxim Igaev
- Theoretical and Computational Biophysics Department, Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany
| | - Agnel P. Joseph
- Scientific Computing Department, UKRI Science and Technology Facilities Council, Research Complex at Harwell, Didcot, UK
| | - Wei-Chun Kao
- Institute of Biochemistry and Molecular Biology, ZBMZ, Faculty of Medicine and CIBSS - Centre for Integrative Biological Signalling Studies, University of Freiburg, 79104 Freiburg, Germany
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
| | - Dilip Kumar
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX, USA
| | - Lijun Lang
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL, USA
| | - Sean Lin
- Division of Computing & Software Systems, University of Washington, Bothell, WA, USA
| | | | - Sumit Mittal
- Biodesign Institute, Arizona State University, Tempe, AZ, USA
- School of Advanced Sciences and Languages, VIT Bhopal University, Bhopal, India
| | - Arup Mondal
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL, USA
| | - Nigel W. Moriarty
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Andrew Muenks
- Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, WA, USA
| | | | | | - Mateusz Olek
- York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK
- Electron Bio-Imaging Centre, Diamond Light Source, Harwell Science and Innovation Campus, Didcot, UK
| | - Colin M. Palmer
- Scientific Computing Department, UKRI Science and Technology Facilities Council, Research Complex at Harwell, Didcot, UK
| | - Alberto Perez
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL, USA
| | - Emmi Pohjolainen
- Theoretical and Computational Biophysics Department, Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany
| | - Karunakar R. Pothula
- Institute of Biological Information Processing (IBI-7: Structural Biochemistry) and Jülich Centre for Structural Biology (JuStruct), Forschungszentrum Jülich, Jülich, Germany
| | | | - Daipayan Sarkar
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
- Biodesign Institute, Arizona State University, Tempe, AZ, USA
| | - Luisa U. Schäfer
- Institute of Biological Information Processing (IBI-7: Structural Biochemistry) and Jülich Centre for Structural Biology (JuStruct), Forschungszentrum Jülich, Jülich, Germany
| | - Christopher J. Schlicksup
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Gunnar F. Schröder
- Institute of Biological Information Processing (IBI-7: Structural Biochemistry) and Jülich Centre for Structural Biology (JuStruct), Forschungszentrum Jülich, Jülich, Germany
- Physics Department, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Mrinal Shekhar
- Center for Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Biodesign Institute, Arizona State University, Tempe, AZ, USA
| | - Dong Si
- Division of Computing & Software Systems, University of Washington, Bothell, WA, USA
| | | | - Oleg V. Sobolev
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Genki Terashi
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
| | - Andrea C. Vaiana
- Theoretical and Computational Biophysics Department, Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany
- Nature’s Toolbox (NTx), Rio Rancho, NM, USA
| | | | - Jacob Verburgt
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
| | - Xiao Wang
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
| | | | - Martyn D. Winn
- Scientific Computing Department, UKRI Science and Technology Facilities Council, Research Complex at Harwell, Didcot, UK
| | - Simone Weyand
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| | | | - Minglei Zhao
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, IL, USA
| | - Michael F. Schmid
- Division of Cryo-EM and Bioimaging, SSRL, SLAC National Accelerator Laboratory, Menlo Park, CA, USA
| | - Helen M. Berman
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Wah Chiu
- Departments of Bioengineering and of Microbiology and Immunology, Stanford University, Stanford, CA, USA
- Division of Cryo-EM and Bioimaging, SSRL, SLAC National Accelerator Laboratory, Menlo Park, CA, USA
| |
Collapse
|
42
|
Zhang Z, Cai Y, Zhang B, Zheng W, Freddolino L, Zhang G, Zhou X. DEMO-EM2: assembling protein complex structures from cryo-EM maps through intertwined chain and domain fitting. Brief Bioinform 2024; 25:bbae113. [PMID: 38517699 PMCID: PMC10959074 DOI: 10.1093/bib/bbae113] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 02/10/2024] [Accepted: 02/25/2024] [Indexed: 03/24/2024] Open
Abstract
The breakthrough in cryo-electron microscopy (cryo-EM) technology has led to an increasing number of density maps of biological macromolecules. However, constructing accurate protein complex atomic structures from cryo-EM maps remains a challenge. In this study, we extend our previously developed DEMO-EM to present DEMO-EM2, an automated method for constructing protein complex models from cryo-EM maps through an iterative assembly procedure intertwining chain- and domain-level matching and fitting for predicted chain models. The method was carefully evaluated on 27 cryo-electron tomography (cryo-ET) maps and 16 single-particle EM maps, where DEMO-EM2 models achieved an average TM-score of 0.92, outperforming those of state-of-the-art methods. The results demonstrate an efficient method that enables the rapid and reliable solution of challenging cryo-EM structure modeling problems.
Collapse
Affiliation(s)
- Ziying Zhang
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Yaxian Cai
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Biao Zhang
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Wei Zheng
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Lydia Freddolino
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Guijun Zhang
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Xiaogen Zhou
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| |
Collapse
|
43
|
The wwPDB Consortium, Turner J, Abbott S, Fonseca N, Pye R, Carrijo L, Duraisamy AK, Salih O, Wang Z, Kleywegt GJ, Morris KL, Patwardhan A, Burley SK, Crichlow G, Feng Z, Flatt JW, Ghosh S, Hudson BP, Lawson CL, Liang Y, Peisach E, Persikova I, Sekharan M, Shao C, Young J, Velankar S, Armstrong D, Bage M, Bueno WM, Evans G, Gaborova R, Ganguly S, Gupta D, Harrus D, Tanweer A, Bansal M, Rangannan V, Kurisu G, Cho H, Ikegawa Y, Kengaku Y, Kim JY, Niwa S, Sato J, Takuwa A, Yu J, Hoch JC, Baskaran K, Xu W, Zhang W, Ma X. EMDB-the Electron Microscopy Data Bank. Nucleic Acids Res 2024; 52:D456-D465. [PMID: 37994703 PMCID: PMC10767987 DOI: 10.1093/nar/gkad1019] [Citation(s) in RCA: 48] [Impact Index Per Article: 48.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 10/18/2023] [Accepted: 10/20/2023] [Indexed: 11/24/2023] Open
Abstract
The Electron Microscopy Data Bank (EMDB) is the global public archive of three-dimensional electron microscopy (3DEM) maps of biological specimens derived from transmission electron microscopy experiments. As of 2021, EMDB is managed by the Worldwide Protein Data Bank consortium (wwPDB; wwpdb.org) as a wwPDB Core Archive, and the EMDB team is a core member of the consortium. Today, EMDB houses over 30 000 entries with maps containing macromolecules, complexes, viruses, organelles and cells. Herein, we provide an overview of the rapidly growing EMDB archive, including its current holdings, recent updates, and future plans.
Collapse
|
44
|
Giri N, Wang L, Cheng J. Cryo2StructData: A Large Labeled Cryo-EM Density Map Dataset for AI-based Modeling of Protein Structures. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.06.14.545024. [PMID: 37398020 PMCID: PMC10312718 DOI: 10.1101/2023.06.14.545024] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
The advent of single-particle cryo-electron microscopy (cryo-EM) has brought forth a new era of structural biology, enabling the routine determination of large biological molecules and their complexes at atomic resolution. The high-resolution structures of biological macromolecules and their complexes significantly expedite biomedical research and drug discovery. However, automatically and accurately building atomic models from high-resolution cryo-EM density maps is still time-consuming and challenging when template-based models are unavailable. Artificial intelligence (AI) methods such as deep learning trained on limited amount of labeled cryo-EM density maps generate inaccurate atomic models. To address this issue, we created a dataset called Cryo2StructData consisting of 7,600 preprocessed cryo-EM density maps whose voxels are labelled according to their corresponding known atomic structures for training and testing AI methods to build atomic models from cryo-EM density maps. It is larger and of higher quality than any existing, publicly available dataset. We trained and tested deep learning models on Cryo2StructData to make sure it is ready for the large-scale development of AI methods for building atomic models from cryo-EM density maps.
Collapse
Affiliation(s)
- Nabin Giri
- University of Missouri, Electrical Engineering and Computer Science, Columbia, 65211, USA
- NextGen Precision Health Institute, Columbia, 65211, USA
| | - Liguo Wang
- Laboratory for Biological Structure, Brookhaven National Laboratory, Upton, NY, 11973, USA
| | - Jianlin Cheng
- University of Missouri, Electrical Engineering and Computer Science, Columbia, 65211, USA
- NextGen Precision Health Institute, Columbia, 65211, USA
| |
Collapse
|
45
|
Giri N, Cheng J. De Novo Atomic Protein Structure Modeling for Cryo-EM Density Maps Using 3D Transformer and Hidden Markov Model. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.02.573943. [PMID: 38260535 PMCID: PMC10802328 DOI: 10.1101/2024.01.02.573943] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]
Abstract
Accurately building three-dimensional (3D) atomic structures from 3D cryo-electron microscopy (cryo-EM) density maps is a crucial step in the cryo-EM-based determination of the structures of protein complexes. Despite improvements in the resolution of 3D cryo-EM density maps, the de novo conversion of density maps into 3D atomic structures for protein complexes that do not have accurate homologous or predicted structures to be used as templates remains a significant challenge. Here, we introduce Cryo2Struct, a fully automated ab initio cryo-EM structure modeling method that utilizes a 3D transformer to identify atoms and amino acid types in cryo-EM density maps first, and then employs a novel Hidden Markov Model (HMM) to connect predicted atoms to build backbone structures of proteins. Tested on a standard test dataset of 128 cryo-EM density maps with varying resolutions (2.1 - 5.6 °A) and different numbers of residues (730 - 8,416), Cryo2Struct built substantially more accurate and complete protein structural models than the widely used ab initio method - Phenix in terms of multiple evaluation metrics. Moreover, on a new test dataset of 500 recently released density maps with varying resolutions (1.9 - 4.0 °A) and different numbers of residues (234 - 8,828), it built more accurate models than on the standard dataset. And its performance is rather robust against the change of the resolution of density maps and the size of protein structures.
Collapse
Affiliation(s)
- Nabin Giri
- Electrical Engineering and Computer Science, University of Missouri, Columbia, 65211, Missouri, USA
- NextGen Precision Health Institute, University of Missouri, Columbia, 65211, Missouri, USA
| | - Jianlin Cheng
- Electrical Engineering and Computer Science, University of Missouri, Columbia, 65211, Missouri, USA
- NextGen Precision Health Institute, University of Missouri, Columbia, 65211, Missouri, USA
| |
Collapse
|
46
|
Terashi G, Wang X, Prasad D, Nakamura T, Kihara D. DeepMainmast: integrated protocol of protein structure modeling for cryo-EM with deep learning and structure prediction. Nat Methods 2024; 21:122-131. [PMID: 38066344 DOI: 10.1038/s41592-023-02099-0] [Citation(s) in RCA: 17] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Accepted: 10/22/2023] [Indexed: 12/19/2023]
Abstract
Three-dimensional structure modeling from maps is an indispensable step for studying proteins and their complexes with cryogenic electron microscopy. Although the resolution of determined cryogenic electron microscopy maps has generally improved, there are still many cases where tracing protein main chains is difficult, even in maps determined at a near-atomic resolution. Here we developed a protein structure modeling method, DeepMainmast, which employs deep learning to capture the local map features of amino acids and atoms to assist main-chain tracing. Moreover, we integrated AlphaFold2 with the de novo density tracing protocol to combine their complementary strengths and achieved even higher accuracy than each method alone. Additionally, the protocol is able to accurately assign the chain identity to the structure models of homo-multimers, which is not a trivial task for existing methods.
Collapse
Affiliation(s)
- Genki Terashi
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
| | - Xiao Wang
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
| | - Devashish Prasad
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
| | - Tsukasa Nakamura
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA.
- Department of Computer Science, Purdue University, West Lafayette, IN, USA.
| |
Collapse
|
47
|
Zhang Z, Tringides ML, Morgan CE, Miyagi M, Mears JA, Hoppel CL, Yu EW. High-Resolution Structural Proteomics of Mitochondria Using the 'Build and Retrieve' Methodology. Mol Cell Proteomics 2023; 22:100666. [PMID: 37839702 PMCID: PMC10709515 DOI: 10.1016/j.mcpro.2023.100666] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 10/06/2023] [Accepted: 10/11/2023] [Indexed: 10/17/2023] Open
Abstract
The application of integrated systems biology to the field of structural biology is a promising new direction, although it is still in the infant stages of development. Here we report the use of single particle cryo-EM to identify multiple proteins from three enriched heterogeneous fractions prepared from human liver mitochondrial lysate. We simultaneously identify and solve high-resolution structures of nine essential mitochondrial enzymes with key metabolic functions, including fatty acid catabolism, reactive oxidative species clearance, and amino acid metabolism. Our methodology also identified multiple distinct members of the acyl-CoA dehydrogenase family. This work highlights the potential of cryo-EM to explore tissue proteomics at the atomic level.
Collapse
Affiliation(s)
- Zhemin Zhang
- Department of Pharmacology, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA
| | - Marios L Tringides
- Department of Pharmacology, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA
| | - Christopher E Morgan
- Department of Pharmacology, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA
| | - Masaru Miyagi
- Department of Pharmacology, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA
| | - Jason A Mears
- Department of Pharmacology, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA
| | - Charles L Hoppel
- Department of Pharmacology, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA
| | - Edward W Yu
- Department of Pharmacology, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA.
| |
Collapse
|
48
|
Gao J, Tong M, Lee C, Gaertig J, Legal T, Bui KH. DomainFit: Identification of Protein Domains in cryo-EM maps at Intermediate Resolution using AlphaFold2-predicted Models. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.28.569001. [PMID: 38077012 PMCID: PMC10705406 DOI: 10.1101/2023.11.28.569001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/20/2023]
Abstract
Cryo-electron microscopy (cryo-EM) has revolutionized our understanding of macromolecular complexes, enabling high-resolution structure determination. With the paradigm shift to in situ structural biology recently driven by the ground-breaking development of cryo-focused ion beam milling and cryo-electron tomography, there are an increasing number of structures at sub-nanometer resolution of complexes solved directly within their cellular environment. These cellular complexes often contain unidentified proteins, related to different cellular states or processes. Identifying proteins at resolutions lower than 4 Å remains challenging because the side chains cannot be visualized reliably. Here, we present DomainFit, a program for automated domain-level protein identification from cryo-EM maps at resolutions lower than 4 Å. By fitting domains from artificial intelligence-predicted models such as AlphaFold2-predicted models into cryo-EM maps, the program performs statistical analyses and attempts to identify the proteins forming the density. Using DomainFit, we identified two microtubule inner proteins, one of them, a CCDC81 domain-containing protein, is exclusively localized in the proximal region of the doublet microtubule from the ciliate Tetrahymena thermophila. The flexibility and capability of DomainFit makes it a valuable tool for analyzing in situ structures.
Collapse
Affiliation(s)
- Jerry Gao
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Québec, Canada
- Centre de recherche en biologie structurale, McGill University, Montréal, Quebec, Canada
| | - Max Tong
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Québec, Canada
- Centre de recherche en biologie structurale, McGill University, Montréal, Quebec, Canada
| | - Chinkyu Lee
- Department of Cellular Biology, University of Georgia, Athens, GA, USA
| | - Jacek Gaertig
- Department of Cellular Biology, University of Georgia, Athens, GA, USA
| | - Thibault Legal
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Québec, Canada
- Centre de recherche en biologie structurale, McGill University, Montréal, Quebec, Canada
| | - Khanh Huy Bui
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences, McGill University, Québec, Canada
- Centre de recherche en biologie structurale, McGill University, Montréal, Quebec, Canada
| |
Collapse
|
49
|
Terashi G, Wang X, Prasad D, Nakamura T, Zhu H, Kihara D. Integrated Protocol of Protein Structure Modeling for Cryo-EM with Deep Learning and Structure Prediction. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.19.563151. [PMID: 37904978 PMCID: PMC10614963 DOI: 10.1101/2023.10.19.563151] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/02/2023]
Abstract
Structure modeling from maps is an indispensable step for studying proteins and their complexes with cryogenic electron microscopy (cryo-EM). Although the resolution of determined cryo-EM maps has generally improved, there are still many cases where tracing protein main-chains is difficult, even in maps determined at a near atomic resolution. Here, we have developed a protein structure modeling method, called DeepMainmast, which employs deep learning to capture the local map features of amino acids and atoms to assist main-chain tracing. Moreover, since Alphafold2 demonstrates high accuracy in protein structure prediction, we have integrated complementary strengths of de novo density tracing using deep learning with Alphafold2's structure modeling to achieve even higher accuracy than each method alone. Additionally, the protocol is able to accurately assign chain identity to the structure models of homo-multimers.
Collapse
Affiliation(s)
- Genki Terashi
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana, 47907, USA
| | - Xiao Wang
- Department of Computer Science, Purdue University, West Lafayette, Indiana, 47907, USA
| | - Devashish Prasad
- Department of Computer Science, Purdue University, West Lafayette, Indiana, 47907, USA
| | - Tsukasa Nakamura
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana, 47907, USA
| | - Han Zhu
- Department of Computer Science, Purdue University, West Lafayette, Indiana, 47907, USA
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana, 47907, USA
- Department of Computer Science, Purdue University, West Lafayette, Indiana, 47907, USA
| |
Collapse
|
50
|
Wang X, Terashi G, Kihara D. CryoREAD: de novo structure modeling for nucleic acids in cryo-EM maps using deep learning. Nat Methods 2023; 20:1739-1747. [PMID: 37783885 PMCID: PMC10841814 DOI: 10.1038/s41592-023-02032-5] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Accepted: 08/24/2023] [Indexed: 10/04/2023]
Abstract
DNA and RNA play fundamental roles in various cellular processes, where their three-dimensional structures provide information critical to understanding the molecular mechanisms of their functions. Although an increasing number of nucleic acid structures and their complexes with proteins are determined by cryogenic electron microscopy (cryo-EM), structure modeling for DNA and RNA remains challenging particularly when the map is determined at a resolution coarser than atomic level. Moreover, computational methods for nucleic acid structure modeling are relatively scarce. Here, we present CryoREAD, a fully automated de novo DNA/RNA atomic structure modeling method using deep learning. CryoREAD identifies phosphate, sugar and base positions in a cryo-EM map using deep learning, which are traced and modeled into a three-dimensional structure. When tested on cryo-EM maps determined at 2.0 to 5.0 Å resolution, CryoREAD built substantially more accurate models than existing methods. We also applied the method to cryo-EM maps of biomolecular complexes in severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2).
Collapse
Affiliation(s)
- Xiao Wang
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
| | - Genki Terashi
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN, USA.
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA.
| |
Collapse
|