1
|
Zhou H, Wekesa JS, Luan Y, Meng J. PRPI-SC: an ensemble deep learning model for predicting plant lncRNA-protein interactions. BMC Bioinformatics 2021; 22:415. [PMID: 34429059 PMCID: PMC8385908 DOI: 10.1186/s12859-021-04328-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2020] [Accepted: 11/09/2020] [Indexed: 01/04/2023] Open
Abstract
BACKGROUND Plant long non-coding RNAs (lncRNAs) play vital roles in many biological processes mainly through interactions with RNA-binding protein (RBP). To understand the function of lncRNAs, a fundamental method is to identify which types of proteins interact with the lncRNAs. However, the models or rules of interactions are a major challenge when calculating and estimating the types of RBP. RESULTS In this study, we propose an ensemble deep learning model to predict plant lncRNA-protein interactions using stacked denoising autoencoder and convolutional neural network based on sequence and structural information, named PRPI-SC. PRPI-SC predicts interactions between lncRNAs and proteins based on the k-mer features of RNAs and proteins. Experiments proved good results on Arabidopsis thaliana and Zea mays datasets (ATH948 and ZEA22133). The accuracy rates of ATH948 and ZEA22133 datasets were 88.9% and 82.6%, respectively. PRPI-SC also performed well on some public RNA protein interaction datasets. CONCLUSIONS PRPI-SC accurately predicts the interaction between plant lncRNA and protein, which plays a guiding role in studying the function and expression of plant lncRNA. At the same time, PRPI-SC has a strong generalization ability and good prediction effect for non-plant data.
Collapse
Affiliation(s)
- Haoran Zhou
- School of Computer Science and Technology, Dalian University of Technology, Dalian, 116024 Liaoning China
| | - Jael Sanyanda Wekesa
- School of Computer Science and Technology, Dalian University of Technology, Dalian, 116024 Liaoning China
| | - Yushi Luan
- School of Bioengineering, Dalian University of Technology, Dalian, 116024 Liaoning China
| | - Jun Meng
- School of Computer Science and Technology, Dalian University of Technology, Dalian, 116024 Liaoning China
| |
Collapse
|
2
|
Sutandy FXR, Ebersberger S, Huang L, Busch A, Bach M, Kang HS, Fallmann J, Maticzka D, Backofen R, Stadler PF, Zarnack K, Sattler M, Legewie S, König J. In vitro iCLIP-based modeling uncovers how the splicing factor U2AF2 relies on regulation by cofactors. Genome Res 2018; 28:699-713. [PMID: 29643205 PMCID: PMC5932610 DOI: 10.1101/gr.229757.117] [Citation(s) in RCA: 51] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2017] [Accepted: 02/09/2018] [Indexed: 01/26/2023]
Abstract
Alternative splicing generates distinct mRNA isoforms and is crucial for proteome diversity in eukaryotes. The RNA-binding protein (RBP) U2AF2 is central to splicing decisions, as it recognizes 3′ splice sites and recruits the spliceosome. We establish “in vitro iCLIP” experiments, in which recombinant RBPs are incubated with long transcripts, to study how U2AF2 recognizes RNA sequences and how this is modulated by trans-acting RBPs. We measure U2AF2 affinities at hundreds of binding sites and compare in vitro and in vivo binding landscapes by mathematical modeling. We find that trans-acting RBPs extensively regulate U2AF2 binding in vivo, including enhanced recruitment to 3′ splice sites and clearance of introns. Using machine learning, we identify and experimentally validate novel trans-acting RBPs (including FUBP1, CELF6, and PCBP1) that modulate U2AF2 binding and affect splicing outcomes. Our study offers a blueprint for the high-throughput characterization of in vitro mRNP assembly and in vivo splicing regulation.
Collapse
Affiliation(s)
| | | | - Lu Huang
- Institute of Molecular Biology (IMB) gGmbH, 55128 Mainz, Germany
| | - Anke Busch
- Institute of Molecular Biology (IMB) gGmbH, 55128 Mainz, Germany
| | - Maximilian Bach
- Institute of Molecular Biology (IMB) gGmbH, 55128 Mainz, Germany
| | - Hyun-Seo Kang
- Institute of Structural Biology, Helmholtz Center Munich, 85764 Neuherberg, Germany.,Biomolecular NMR and Center for Integrated Protein Science Munich at Department of Chemistry, Technical University of Munich, 85747 Garching, Germany
| | - Jörg Fallmann
- Bioinformatics Group, Department of Computer Science and Interdisciplinary Center for Bioinformatics, University of Leipzig, 04107 Leipzig, Germany
| | - Daniel Maticzka
- Bioinformatics Group, Department of Computer Science, University of Freiburg, 79110 Freiburg, Germany
| | - Rolf Backofen
- Bioinformatics Group, Department of Computer Science, University of Freiburg, 79110 Freiburg, Germany.,Centre for Biological Signalling Studies (BIOSS), University of Freiburg, 79104 Freiburg, Germany
| | - Peter F Stadler
- Bioinformatics Group, Department of Computer Science and Interdisciplinary Center for Bioinformatics, University of Leipzig, 04107 Leipzig, Germany
| | - Kathi Zarnack
- Buchmann Institute for Molecular Life Sciences (BMLS), Goethe University Frankfurt, 60438 Frankfurt a.M., Germany
| | - Michael Sattler
- Institute of Structural Biology, Helmholtz Center Munich, 85764 Neuherberg, Germany.,Biomolecular NMR and Center for Integrated Protein Science Munich at Department of Chemistry, Technical University of Munich, 85747 Garching, Germany
| | - Stefan Legewie
- Institute of Molecular Biology (IMB) gGmbH, 55128 Mainz, Germany
| | - Julian König
- Institute of Molecular Biology (IMB) gGmbH, 55128 Mainz, Germany
| |
Collapse
|
3
|
Recognition of the 3' splice site RNA by the U2AF heterodimer involves a dynamic population shift. Proc Natl Acad Sci U S A 2016; 113:E7169-E7175. [PMID: 27799531 DOI: 10.1073/pnas.1605873113] [Citation(s) in RCA: 44] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
An essential early step in the assembly of human spliceosomes onto pre-mRNA involves the recognition of regulatory RNA cis elements in the 3' splice site by the U2 auxiliary factor (U2AF). The large (U2AF65) and small (U2AF35) subunits of the U2AF heterodimer contact the polypyrimidine tract (Py-tract) and the AG-dinucleotide, respectively. The tandem RNA recognition motif domains (RRM1,2) of U2AF65 adopt closed/inactive and open/active conformations in the free form and when bound to bona fide Py-tract RNA ligands. To investigate the molecular mechanism and dynamics of 3' splice site recognition by U2AF65 and the role of U2AF35 in the U2AF heterodimer, we have combined single-pair FRET and NMR experiments. In the absence of RNA, the RRM1,2 domain arrangement is highly dynamic on a submillisecond time scale, switching between closed and open conformations. The addition of Py-tract RNA ligands with increasing binding affinity (strength) gradually shifts the equilibrium toward an open conformation. Notably, the protein-RNA complex is rigid in the presence of a strong Py-tract but exhibits internal motion with weak Py-tracts. Surprisingly, the presence of U2AF35, whose UHM domain interacts with U2AF65 RRM1, increases the population of the open arrangement of U2AF65 RRM1,2 in the absence and presence of a weak Py-tract. These data indicate that the U2AF heterodimer promotes spliceosome assembly by a dynamic population shift toward the open conformation of U2AF65 to facilitate the recognition of weak Py-tracts at the 3' splice site. The structure and RNA binding of the heterodimer was unaffected by cancer-linked myelodysplastic syndrome mutants.
Collapse
|
4
|
Abstract
RRM-containing proteins are involved in most of the RNA metabolism steps. Their functions are closely related to their mode of RNA recognition, which has been studied by structural biologists for more than 20 years. In this chapter, we report on high-resolution structures of single and multi RRM-RNA complexes to explain the numerous strategies used by these domains to interact specifically with a large repertoire of RNA sequences. We show that multiple variations of their canonical fold can be used to adapt to different single-stranded sequences with a large range of affinities. Furthermore, we describe the consequences on RNA binding of the different structural arrangements found in tandem RRMs and higher order RNPs. Importantly, these structures also reveal with very high accuracy the RNA motifs bound specifically by RRM-containing proteins, which correspond very often to consensus sequences identified with genome-wide approaches. Finally, we show how structural and cellular biology can benefit from each other and pave a way for understanding, defining, and predicting a code of RNA recognition by the RRMs.
Collapse
|
5
|
Göbl C, Madl T, Simon B, Sattler M. NMR approaches for structural analysis of multidomain proteins and complexes in solution. PROGRESS IN NUCLEAR MAGNETIC RESONANCE SPECTROSCOPY 2014; 80:26-63. [PMID: 24924266 DOI: 10.1016/j.pnmrs.2014.05.003] [Citation(s) in RCA: 130] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/21/2014] [Accepted: 05/14/2014] [Indexed: 05/22/2023]
Abstract
NMR spectroscopy is a key method for studying the structure and dynamics of (large) multidomain proteins and complexes in solution. It plays a unique role in integrated structural biology approaches as especially information about conformational dynamics can be readily obtained at residue resolution. Here, we review NMR techniques for such studies focusing on state-of-the-art tools and practical aspects. An efficient approach for determining the quaternary structure of multidomain complexes starts from the structures of individual domains or subunits. The arrangement of the domains/subunits within the complex is then defined based on NMR measurements that provide information about the domain interfaces combined with (long-range) distance and orientational restraints. Aspects discussed include sample preparation, specific isotope labeling and spin labeling; determination of binding interfaces and domain/subunit arrangements from chemical shift perturbations (CSP), nuclear Overhauser effects (NOEs), isotope editing/filtering, cross-saturation, and differential line broadening; and based on paramagnetic relaxation enhancements (PRE) using covalent and soluble spin labels. Finally, the utility of complementary methods such as small-angle X-ray or neutron scattering (SAXS, SANS), electron paramagnetic resonance (EPR) or fluorescence spectroscopy techniques is discussed. The applications of NMR techniques are illustrated with studies of challenging (high molecular weight) protein complexes.
Collapse
Affiliation(s)
- Christoph Göbl
- Biomolecular NMR and Center for Integrated Protein Science Munich at Department Chemie, Technische Universität München, Garching, Germany
| | - Tobias Madl
- Biomolecular NMR and Center for Integrated Protein Science Munich at Department Chemie, Technische Universität München, Garching, Germany; Institute of Structural Biology, Helmholtz Zentrum München, Neuherberg, Germany; Institute of Molecular Biology, University of Graz, Graz, Austria.
| | - Bernd Simon
- European Molecular Biology Laboratory, Structural and Computational Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Michael Sattler
- Biomolecular NMR and Center for Integrated Protein Science Munich at Department Chemie, Technische Universität München, Garching, Germany; Institute of Structural Biology, Helmholtz Zentrum München, Neuherberg, Germany.
| |
Collapse
|
6
|
Lebars I, Vileno B, Bourbigot S, Turek P, Wolff P, Kieffer B. A fully enzymatic method for site-directed spin labeling of long RNA. Nucleic Acids Res 2014; 42:e117. [PMID: 24981512 PMCID: PMC4150755 DOI: 10.1093/nar/gku553] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Site-directed spin labeling is emerging as an essential tool to investigate the structural and dynamical features of RNA. We propose here an enzymatic method, which allows the insertion of a paramagnetic center at a specific position in an RNA molecule. The technique is based on a segmental approach using a ligation protocol with T4 RNA ligase 2. One transcribed acceptor RNA is ligated to a donor RNA in which a thio-modified nucleotide is introduced at its 5′-end by in vitro transcription with T7 RNA polymerase. The paramagnetic thiol-specific reagent is subsequently attached to the RNA ligation product. This novel strategy is demonstrated by introducing a paramagnetic probe into the 55 nucleotides long RNA corresponding to K-turn and Specifier Loop domains from the Bacillus subtilis tyrS T-Box leader RNA. The efficiency of the coupling reaction and the quality of the resulting spin-labeled RNA were assessed by Mass Spectrometry, Electron Paramagnetic Resonance (EPR) and Nuclear Magnetic Resonance (NMR). This method enables various combinations of isotopic segmental labeling and spin labeling schemes, a strategy that will be of particular interest to investigate the structural and dynamical properties of large RNA complexes by NMR and EPR spectroscopies.
Collapse
Affiliation(s)
- Isabelle Lebars
- Institut de Génétique et de Biologie Moléculaire et Cellulaire (IGBMC), Département de Biologie Structurale, Centre National de la Recherche Scientifique (CNRS) UMR 7104/Institut National de la Santé et de la Recherche Médicale (INSERM) U964/Université de Strasbourg, 1 rue Laurent Fries, BP 10142, 67404 Illkirch cedex, France
| | - Bertrand Vileno
- Institut de Chimie, Laboratoire Propriétés Optiques & Magnétiques des Architectures Moléculaires, Université de Strasbourg, UMR 7177 CNRS, 4 rue Blaise Pascal, CS 90032, 67081 Strasbourg Cedex, France
| | - Sarah Bourbigot
- Institut de Génétique et de Biologie Moléculaire et Cellulaire (IGBMC), Département de Biologie Structurale, Centre National de la Recherche Scientifique (CNRS) UMR 7104/Institut National de la Santé et de la Recherche Médicale (INSERM) U964/Université de Strasbourg, 1 rue Laurent Fries, BP 10142, 67404 Illkirch cedex, France
| | - Philippe Turek
- Institut de Chimie, Laboratoire Propriétés Optiques & Magnétiques des Architectures Moléculaires, Université de Strasbourg, UMR 7177 CNRS, 4 rue Blaise Pascal, CS 90032, 67081 Strasbourg Cedex, France
| | - Philippe Wolff
- Institut de Biologie Moléculaire et Cellulaire, Plateforme Protéomique Strasbourg Esplanade, FRC 1589 CNRS, 15 rue René Descartes, 67084 Strasbourg Cedex, France Institut de Biologie Moléculaire et Cellulaire, Architecture et Réactivité des ARN, Université de Strasbourg, UPR 9002 CNRS, 15 rue René Descartes, 67084 Strasbourg Cedex, France
| | - Bruno Kieffer
- Institut de Génétique et de Biologie Moléculaire et Cellulaire (IGBMC), Département de Biologie Structurale, Centre National de la Recherche Scientifique (CNRS) UMR 7104/Institut National de la Santé et de la Recherche Médicale (INSERM) U964/Université de Strasbourg, 1 rue Laurent Fries, BP 10142, 67404 Illkirch cedex, France
| |
Collapse
|
7
|
Huang JR, Warner LR, Sanchez C, Gabel F, Madl T, Mackereth CD, Sattler M, Blackledge M. Transient electrostatic interactions dominate the conformational equilibrium sampled by multidomain splicing factor U2AF65: a combined NMR and SAXS study. J Am Chem Soc 2014; 136:7068-76. [PMID: 24734879 DOI: 10.1021/ja502030n] [Citation(s) in RCA: 64] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Multidomain proteins containing intrinsically disordered linkers exhibit large-scale dynamic modes that play key roles in a multitude of molecular recognition and signaling processes. Here, we determine the conformational space sampled by the multidomain splicing factor U2AF65 using complementary nuclear magnetic resonance spectroscopy and small-angle scattering data. Available degrees of conformational freedom are initially stochastically sampled and experimental data then used to delineate the potential energy landscape in terms of statistical probability. The spatial distribution of U2AF65 conformations is found to be highly anisotropic, comprising significantly populated interdomain contacts that appear to be electrostatic in origin. This hypothesis is supported by the reduction of signature PREs reporting on expected interfaces with increasing salt concentration. The described spatial distribution reveals the complete spectrum of the unbound forms of U2AF65 that coexist with the small percentage of a preformed RNA-bound domain arrangement required for polypyrimidine-tract recognition by conformational selection. More generally, the proposed approach to describing conformational equilibria of multidomain proteins can be further combined with other experimental data that are sensitive to domain dynamics.
Collapse
Affiliation(s)
- Jie-rong Huang
- University Grenoble Alpes, ‡CNRS, and §CEA, Protein Dynamics and Flexibility, Institut de Biologie Structurale , 38000 Grenoble, France
| | | | | | | | | | | | | | | |
Collapse
|
8
|
Hennig J, Sattler M. The dynamic duo: combining NMR and small angle scattering in structural biology. Protein Sci 2014; 23:669-82. [PMID: 24687405 DOI: 10.1002/pro.2467] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2014] [Revised: 03/25/2014] [Accepted: 03/28/2014] [Indexed: 12/12/2022]
Abstract
Structural biology provides essential information for elucidating molecular mechanisms that underlie biological function. Advances in hardware, sample preparation, experimental methods, and computational approaches now enable structural analysis of protein complexes with increasing complexity that more closely represent biologically entities in the cellular environment. Integrated multidisciplinary approaches are required to overcome limitations of individual methods and take advantage of complementary aspects provided by different structural biology techniques. Although X-ray crystallography remains the method of choice for structural analysis of large complexes, crystallization of flexible systems is often difficult and does typically not provide insights into conformational dynamics present in solution. Nuclear magnetic resonance spectroscopy (NMR) is well-suited to study dynamics at picosecond to second time scales, and to map binding interfaces even of large systems at residue resolution but suffers from poor sensitivity with increasing molecular weight. Small angle scattering (SAS) methods provide low resolution information in solution and can characterize dynamics and conformational equilibria complementary to crystallography and NMR. The combination of NMR, crystallography, and SAS is, thus, very useful for analysis of the structure and conformational dynamics of (large) protein complexes in solution. In high molecular weight systems, where NMR data are often sparse, SAS provides additional structural information and can differentiate between NMR-derived models. Scattering data can also validate the solution conformation of a crystal structure and indicate the presence of conformational equilibria. Here, we review current state-of-the-art approaches for combining NMR, crystallography, and SAS data to characterize protein complexes in solution.
Collapse
Affiliation(s)
- Janosch Hennig
- Institute of Structural Biology, Helmholtz Zentrum München, Ingolstädter Landstr.1, D-85764, Neuherberg, Germany; Center for Integrated Protein Science Munich at Chair Biomolecular NMR Spectroscopy, Department Chemie, Technische Universität München, Lichtenbergstr. 4, D-85747, Garching, Germany
| | | |
Collapse
|
9
|
Mackereth CD, Sattler M. Dynamics in multi-domain protein recognition of RNA. Curr Opin Struct Biol 2012; 22:287-96. [DOI: 10.1016/j.sbi.2012.03.013] [Citation(s) in RCA: 90] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2012] [Accepted: 03/25/2012] [Indexed: 12/28/2022]
|
10
|
Madl T, Gabel F, Sattler M. NMR and small-angle scattering-based structural analysis of protein complexes in solution. J Struct Biol 2010; 173:472-82. [PMID: 21074620 DOI: 10.1016/j.jsb.2010.11.004] [Citation(s) in RCA: 61] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2010] [Revised: 11/01/2010] [Accepted: 11/04/2010] [Indexed: 01/14/2023]
Abstract
Structural analysis of multi-domain protein complexes is a key challenge in current biology and a prerequisite for understanding the molecular basis of essential cellular processes. The use of solution techniques is important for characterizing the quaternary arrangements and dynamics of domains and subunits of these complexes. In this respect solution NMR is the only technique that allows atomic- or residue-resolution structure determination and investigation of dynamic properties of multi-domain proteins and their complexes. As experimental NMR data for large protein complexes are sparse, it is advantageous to combine these data with additional information from other solution techniques. Here, the utility and computational approaches of combining solution state NMR with small-angle X-ray and Neutron scattering (SAXS/SANS) experiments for structural analysis of large protein complexes is reviewed. Recent progress in experimental and computational approaches of combining NMR and SAS are discussed and illustrated with recent examples from the literature. The complementary aspects of combining NMR and SAS data for studying multi-domain proteins, i.e. where weakly interacting domains are connected by flexible linkers, are illustrated with the structural analysis of the tandem RNA recognition motif (RRM) domains (RRM1-RRM2) of the human splicing factor U2AF65 bound to a nine-uridine (U9) RNA oligonucleotide.
Collapse
Affiliation(s)
- Tobias Madl
- Institute of Structural Biology, Helmholtz Zentrum München, Ingolstädter Landstr. 1, 85764 Neuherberg, Germany
| | | | | |
Collapse
|
11
|
Gans P, Hamelin O, Sounier R, Ayala I, Durá MA, Amero CD, Noirclerc-Savoye M, Franzetti B, Plevin MJ, Boisbouvier J. Stereospecific isotopic labeling of methyl groups for NMR spectroscopic studies of high-molecular-weight proteins. Angew Chem Int Ed Engl 2010; 49:1958-62. [PMID: 20157899 DOI: 10.1002/anie.200905660] [Citation(s) in RCA: 160] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Affiliation(s)
- Pierre Gans
- Institut de Biologie Structurale Jean-Pierre Ebel, CEA/CNRS/UJF, 41 rue Jules Horowitz, 38027 Grenoble Cedex, France
| | | | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Gans P, Hamelin O, Sounier R, Ayala I, Durá M, Amero C, Noirclerc-Savoye M, Franzetti B, Plevin M, Boisbouvier J. Stereospecific Isotopic Labeling of Methyl Groups for NMR Spectroscopic Studies of High-Molecular-Weight Proteins. Angew Chem Int Ed Engl 2010. [DOI: 10.1002/ange.200905660] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
13
|
Simon B, Madl T, Mackereth C, Nilges M, Sattler M. An Efficient Protocol for NMR-Spectroscopy-Based Structure Determination of Protein Complexes in Solution. Angew Chem Int Ed Engl 2010; 49:1967-70. [DOI: 10.1002/anie.200906147] [Citation(s) in RCA: 91] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
14
|
Simon B, Madl T, Mackereth C, Nilges M, Sattler M. An Efficient Protocol for NMR-Spectroscopy-Based Structure Determination of Protein Complexes in Solution. Angew Chem Int Ed Engl 2010. [DOI: 10.1002/ange.200906147] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
15
|
Van Horn WD, Beel AJ, Kang C, Sanders CR. The impact of window functions on NMR-based paramagnetic relaxation enhancement measurements in membrane proteins. BIOCHIMICA ET BIOPHYSICA ACTA-BIOMEMBRANES 2009; 1798:140-9. [PMID: 19751702 DOI: 10.1016/j.bbamem.2009.08.022] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/17/2009] [Revised: 08/25/2009] [Accepted: 08/31/2009] [Indexed: 11/24/2022]
Abstract
Though challenging, solution NMR spectroscopy allows fundamental interrogation of the structure and dynamics of membrane proteins. One major technical hurdle in studies of helical membrane proteins by NMR is the difficulty of obtaining sufficient long range NOEs to determine tertiary structure. For this reason, long range distance information is sometimes sought through measurement of paramagnetic relaxation enhancements (PRE) of NMR nuclei as a function of distance from an introduced paramagnetic probe. Current PRE interpretation is based on the assumption of Lorentzian resonance lineshapes. However, in order to optimize spectral resolution, modern multidimensional NMR spectra are almost always subjected to resolution-enhancement, leading to distortions in the Lorentizian peak shape. Here it is shown that when PREs are derived using peak intensities (i.e., peak height) and linewidths from both real and simulated spectra that were produced using a wide range of apodization/window functions, that there is little variation in the distances determined (<1 A at the extremes). This indicates that the high degree of resolution enhancement required to obtain well-resolved spectra from helical membrane proteins is compatible with the use of PRE data as a source of distance restraints. While these conclusions are particularly important for helical membrane proteins, they are generally applicable to all PRE measurements made using resolution-enhanced data.
Collapse
Affiliation(s)
- Wade D Van Horn
- Department of Biochemistry and Center for Structural Biology, Vanderbilt University School of Medicine, Nashville, TN 37232-8725, USA
| | | | | | | |
Collapse
|
16
|
Nelissen FHT, Girard FC, Tessari M, Heus HA, Wijmenga SS. Preparation of selective and segmentally labeled single-stranded DNA for NMR by self-primed PCR and asymmetrical endonuclease double digestion. Nucleic Acids Res 2009; 37:e114. [PMID: 19553193 PMCID: PMC2761255 DOI: 10.1093/nar/gkp540] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
We demonstrate a new, efficient and easy-to-use method for enzymatic synthesis of (stereo-)specific and segmental (13)C/(15)N/(2)H isotope-labeled single-stranded DNA in amounts sufficient for NMR, based on the highly efficient self-primed PCR. To achieve this, new approaches are introduced and combined. (i) Asymmetric endonuclease double digestion of tandem-repeated PCR product. (ii) T4 DNA ligase mediated ligation of two ssDNA segments. (iii) In vitro dNTP synthesis, consisting of in vitro rNTP synthesis followed by enzymatic stereo-selective reduction of the C2' of the rNTP, and a one-pot add-up synthesis of dTTP from dUTP. The method is demonstrated on two ssDNAs: (i) a 36-nt three-way junction, selectively (13)C(9)/(15)N(3)/(2)H((1',2'',3',4',5',5''))-dC labeled and (ii) a 39-nt triple-repeat three-way junction, selectively (13)C(9)/(15)N(3)/(2)H((1',2'',3',4',5',5''))-dC and (13)C(9)/(15)N(2)/(2)H((1',2'',3',4',5',5''))-dT labeled in segment C20-C39. Their NMR spectra show the spectral simplification, while the stereo-selective (2)H-labeling in the deoxyribose of the dC-residues, straightforwardly provided assignment of their C1'-H2' and C2'-H2' resonances. The labeling protocols can be extended to larger ssDNA molecules and to more than two segments.
Collapse
Affiliation(s)
- Frank H T Nelissen
- Department of Biophysical Chemistry, Institute for Molecules and Materials, Radboud University Nijmegen, Toernooiveld 1, 6525 ED Nijmegen, the Netherlands
| | | | | | | | | |
Collapse
|
17
|
Abstract
Ribonucleoproteins (RNPs) mediate key cellular functions such as gene expression and its regulation. Whereas most RNP enzymes are stable in composition and harbor preformed active sites, the spliceosome, which removes noncoding introns from precursor messenger RNAs (pre-mRNAs), follows fundamentally different strategies. In order to provide both accuracy to the recognition of reactive splice sites in the pre-mRNA and flexibility to the choice of splice sites during alternative splicing, the spliceosome exhibits exceptional compositional and structural dynamics that are exploited during substrate-dependent complex assembly, catalytic activation, and active site remodeling.
Collapse
Affiliation(s)
- Markus C Wahl
- Makromolekulare Röntgenkristallographie, Max-Planck-Institut für biophysikalische Chemie, Am Fassberg 11, D-37077 Göttingen, Germany.
| | | | | |
Collapse
|
18
|
Jain NU. Use of residual dipolar couplings in structural analysis of protein-ligand complexes by solution NMR spectroscopy. Methods Mol Biol 2009; 544:231-52. [PMID: 19488703 DOI: 10.1007/978-1-59745-483-4_15] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Investigation of structure-function relationships in protein complexes, specifically protein-ligand interactions, carry great significance in elucidating the structural and mechanistic bases of molecular recognition events and their role in regulating cell processes. Nuclear magnetic resonance (NMR) spectroscopy is one of the leading structural and analytical techniques in in-depth studies of protein-ligand interactions. Recent advances in NMR methodology such as transverse relaxation-optimized spectroscopy (TROSY) and residual dipolar couplings (RDCs) measured in liquid crystalline alignment medium, offer a viable alternative to traditional nuclear Overhauser enhancement (NOE)-based approaches for structure determination of large protein complexes. RDCs provide a way to constrain the relative orientation of two molecules in complex with each other by aligning their independently determined order tensors. The potential for utilization of RDCs can be extended to proteins with multiple ligands or even multimeric protein-ligand complexes, where symmetry properties of the protein can be taken advantage of. Availability of effective RDC data collection and analysis protocols can certainly aid this process by their incorporation into structure calculation protocols using intramolecular and intermolecular orientational restraints. This chapter discusses in detail some of these protocols including methods for sample preparation in liquid crystalline media, NMR experiments for RDC data collection, as well as software tools for RDC data analysis and protein-ligand complex structure determination.
Collapse
Affiliation(s)
- Nitin U Jain
- Cellular and Molecular Biology Department, University of Tennessee, 37996-0840, Knoxville, TN, USA.
| |
Collapse
|
19
|
Zhang W, Pochapsky SS, Pochapsky TC, Jain NU. Solution NMR structure of putidaredoxin-cytochrome P450cam complex via a combined residual dipolar coupling-spin labeling approach suggests a role for Trp106 of putidaredoxin in complex formation. J Mol Biol 2008; 384:349-63. [PMID: 18835276 DOI: 10.1016/j.jmb.2008.09.037] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2008] [Revised: 09/01/2008] [Accepted: 09/03/2008] [Indexed: 11/30/2022]
Abstract
The 58-kDa complex formed between the [2Fe-2S] ferredoxin, putidaredoxin (Pdx), and cytochrome P450cam (CYP101) from the bacterium Pseudomonas putida has been investigated by high-resolution solution NMR spectroscopy. Pdx serves as both the physiological reductant and effector for CYP101 in the enzymatic reaction involving conversion of substrate camphor to 5-exo-hydroxycamphor. In order to obtain an experimental structure for the oxidized Pdx-CYP101 complex, a combined approach using orientational data on the two proteins derived from residual dipolar couplings and distance restraints from site-specific spin labeling of Pdx has been applied. Spectral changes for residues in and near the paramagnetic metal cluster region of Pdx in complex with CYP101 have also been mapped for the first time using (15)N and (13)C NMR spectroscopy, leading to direct identification of the residues strongly affected by CYP101 binding. The new NMR structure of the Pdx-CYP101 complex agrees well with results from previous mutagenesis and biophysical studies involving residues at the binding interface such as formation of a salt bridge between Asp38 of Pdx and Arg112 of CYP101, while at the same time identifying key features different from those of earlier modeling studies. Analysis of the binding interface of the complex reveals that the side chain of Trp106, the C-terminal residue of Pdx and critical for binding to CYP101, is located across from the heme-binding loop of CYP101 and forms non-polar contacts with several residues in the vicinity of the heme group on CYP101, pointing to a potentially important role in complex formation.
Collapse
Affiliation(s)
- Wei Zhang
- Biochemistry, Cellular and Molecular Biology Department, M407 Walters Life Sciences, University of Tennessee, Knoxville, TN 37996-0840, USA
| | | | | | | |
Collapse
|
20
|
Gabel F, Simon B, Nilges M, Petoukhov M, Svergun D, Sattler M. A structure refinement protocol combining NMR residual dipolar couplings and small angle scattering restraints. JOURNAL OF BIOMOLECULAR NMR 2008; 41:199-208. [PMID: 18670889 DOI: 10.1007/s10858-008-9258-y] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/08/2008] [Accepted: 05/16/2008] [Indexed: 05/26/2023]
Abstract
We present the implementation of a target function based on Small Angle Scattering data (Gabel et al. Eur Biophys J 35(4):313-327, 2006) into the Crystallography and NMR Systems (CNS) and demonstrate its utility in NMR structure calculations by simultaneous application of small angle scattering (SAS) and residual dipolar coupling (RDC) restraints. The efficiency and stability of the approach are demonstrated by reconstructing the structure of a two domain region of the 31 kDa nuclear export factor TAP (TIP-associated protein). Starting with the high resolution X-ray structures of the two individual TAP domains, the translational and orientational domain arrangement is refined simultaneously. We tested the stability of the protocol against variations of the SAS target parameters and the number of RDCs and their uncertainties. The activation of SAS restraints results in an improved translational clustering of the domain positions and lifts part of the fourfold degeneracy of their orientations (associated with a single alignment tensor). The resulting ensemble of structures reflects the conformational space that is consistent with the experimental SAS and RDC data. The SAS target function is computationally very efficient. SAS restraints can be activated at different levels of precision and only a limited SAS angular range is required. When combined with additional data from chemical shift perturbation, paramagnetic relaxation enhancement or mutational analysis the SAS refinement is an efficient approach for defining the topology of multi-domain and/or multimeric biomolecular complexes in solution based on available high resolution structures (NMR or X-ray) of the individual domains.
Collapse
Affiliation(s)
- F Gabel
- Structural and Computational Biology Unit, EMBL, Meyerhofstrasse 1, Heidelberg, Germany
| | | | | | | | | | | |
Collapse
|
21
|
Pintacuda G, Giraud N, Pierattelli R, Böckmann A, Bertini I, Emsley L. Solid-State NMR Spectroscopy of a Paramagnetic Protein: Assignment and Study of Human Dimeric Oxidized CuII–ZnII Superoxide Dismutase (SOD). Angew Chem Int Ed Engl 2007; 46:1079-82. [PMID: 17191298 DOI: 10.1002/anie.200603093] [Citation(s) in RCA: 95] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Affiliation(s)
- Guido Pintacuda
- Laboratoire de Chimie, UMR 5182 CNRS-ENS Lyon, Ecole Normale Supérieure de Lyon, 46 allée d'Italie, 69364 Lyon, France
| | | | | | | | | | | |
Collapse
|
22
|
Pintacuda G, Giraud N, Pierattelli R, Böckmann A, Bertini I, Emsley L. Solid-State NMR Spectroscopy of a Paramagnetic Protein: Assignment and Study of Human Dimeric Oxidized CuII–ZnII Superoxide Dismutase (SOD). Angew Chem Int Ed Engl 2007. [DOI: 10.1002/ange.200603093] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
23
|
Gabel F, Simon B, Sattler M. A target function for quaternary structural refinement from small angle scattering and NMR orientational restraints. EUROPEAN BIOPHYSICS JOURNAL: EBJ 2006; 35:313-27. [PMID: 16416140 DOI: 10.1007/s00249-005-0037-3] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2005] [Revised: 11/25/2005] [Accepted: 12/05/2005] [Indexed: 11/28/2022]
Abstract
We present a novel target function based on atomic coordinates that permits quaternary structural refinement of multi-domain protein-protein or protein-RNA complexes. It requires that the high-resolution structures of the individual domains are known and that small angle scattering (SAS) data as well as NMR orientational restraints from residual dipolar couplings (RDCs) of the complex are available. We show that, when used in combination, the translational and rotational restraints contained in SAS intensities and RDCs, respectively, define a target potential function that permits to determine the overall topology of complexes made up of domains with low internal symmetry. We apply the target function on a modestly anisotropic model system, the Barnase/Barstar complex, and discuss factors that influence the structural refinement such as data errors and the geometrical properties of the individual domains.
Collapse
Affiliation(s)
- Frank Gabel
- Structural and Computational Biology Group, European Molecular Biology Laboratory, Meyerhofstrasse 1, 69117 Heidelberg, Germany.
| | | | | |
Collapse
|