1
|
Du Z, Xiao X, Uversky VN. DeepA-RBPBS: A hybrid convolution and recurrent neural network combined with attention mechanism for predicting RBP binding site. J Biomol Struct Dyn 2020; 40:4250-4258. [PMID: 33272122 DOI: 10.1080/07391102.2020.1854861] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Abstract
It's important to infer the binding site of RNA-binding proteins (RBP) for understanding the interaction between RBP and its RNA targets and decipher the mechanisms of transcriptional regulation. However, experimental detection of RBP binding sites is still time-intensive and expensive. Algorithms based on machine learning can speed up detection of RBP binding sites. In this article, we propose a new deep learning method, DeepA-RBPBS, which can use RNA sequences and structural features to predict RBP binding site. DeepA-RBPBS uses CNN and BiGRU to extract sequences and structural features without long-term dependence issues. It also utilizes an attention mechanism to enhance the contribution of key features. The comparison shows that the performance of DeepA-RBPBS is better than that of the state-of-the-art predictors. In the testing on 31 datasets of CLIP-seq experiments over 19 proteins, MCC (AUC) is 8% (5%) higher than those of the latest method based on deep learning, iDeepS. We also apply DeepA-RBPBS to the target RNA data of RBPs related to diabetes (LIN28, RBFOX2, FTO, IGF2BP2, CELF1 and HuR). The results show that DeepA-RBPBS correctly predicted 41,693 samples, where iDeepS predicted 31,381 samples.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Zhihua Du
- Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ), Shenzhen University, P.R. China
| | - Xiangdong Xiao
- Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ), Shenzhen University, P.R. China
| | - Vladimir N Uversky
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL, USA.,USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, USA.,Laboratory of New Methods in Biology, Institute for Biological Instrumentation, Russian Academy of Sciences, Moscow, Russia
| |
Collapse
|
2
|
Boyd PS, Brown JB, Brown JD, Catazaro J, Chaudry I, Ding P, Dong X, Marchant J, O’Hern CT, Singh K, Swanson C, Summers MF, Yasin S. NMR Studies of Retroviral Genome Packaging. Viruses 2020; 12:v12101115. [PMID: 33008123 PMCID: PMC7599994 DOI: 10.3390/v12101115] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Revised: 09/18/2020] [Accepted: 09/26/2020] [Indexed: 12/03/2022] Open
Abstract
Nearly all retroviruses selectively package two copies of their unspliced RNA genomes from a cellular milieu that contains a substantial excess of non-viral and spliced viral RNAs. Over the past four decades, combinations of genetic experiments, phylogenetic analyses, nucleotide accessibility mapping, in silico RNA structure predictions, and biophysical experiments were employed to understand how retroviral genomes are selected for packaging. Genetic studies provided early clues regarding the protein and RNA elements required for packaging, and nucleotide accessibility mapping experiments provided insights into the secondary structures of functionally important elements in the genome. Three-dimensional structural determinants of packaging were primarily derived by nuclear magnetic resonance (NMR) spectroscopy. A key advantage of NMR, relative to other methods for determining biomolecular structure (such as X-ray crystallography), is that it is well suited for studies of conformationally dynamic and heterogeneous systems—a hallmark of the retrovirus packaging machinery. Here, we review advances in understanding of the structures, dynamics, and interactions of the proteins and RNA elements involved in retroviral genome selection and packaging that are facilitated by NMR.
Collapse
|
3
|
Abstract
RNA recognition frequently results in conformational changes that optimize intermolecular binding. As a consequence, the overall binding affinity of RNA to its binding partners depends not only on the intermolecular interactions formed in the bound state but also on the energy cost associated with changing the RNA conformational distribution. Measuring these "conformational penalties" is, however, challenging because bound RNA conformations tend to have equilibrium populations in the absence of the binding partner that fall outside detection by conventional biophysical methods. In this study we employ as a model system HIV-1 TAR RNA and its interaction with the ligand argininamide (ARG), a mimic of TAR's cognate protein binding partner, the transactivator Tat. We use NMR chemical shift perturbations and relaxation dispersion in combination with Bayesian inference to develop a detailed thermodynamic model of coupled conformational change and ligand binding. Starting from a comprehensive 12-state model of the equilibrium, we estimate the energies of six distinct detectable thermodynamic states that are not accessible by currently available methods. Our approach identifies a minimum of four RNA intermediates that differ in terms of the TAR conformation and ARG occupancy. The dominant bound TAR conformation features two bound ARG ligands and has an equilibrium population in the absence of ARG that is below detection limit. Consequently, even though ARG binds to TAR with an apparent overall weak affinity (Kdapp ≈ 0.2 mM), it binds the prefolded conformation with a Kd in the nM range. Our results show that conformational penalties can be major determinants of RNA-ligand binding affinity as well as a source of binding cooperativity, with important implications for a predictive understanding of how RNA is recognized and for RNA-targeted drug discovery.
Collapse
Affiliation(s)
- Nicole I. Orlovsky
- Department of Biochemistry, Duke University Medical Center, Durham, North Carolina 27710, United States
| | - Hashim M. Al-Hashimi
- Department of Biochemistry, Duke University Medical Center, Durham, North Carolina 27710, United States
- Department of Chemistry, Duke University, Durham, North Carolina 27708, United States
| | - Terrence G. Oas
- Department of Biochemistry, Duke University Medical Center, Durham, North Carolina 27710, United States
- Department of Chemistry, Duke University, Durham, North Carolina 27708, United States
| |
Collapse
|
4
|
Ospina-Villa JD, García-Contreras J, Rosas-Trigueros JL, Ramírez-Moreno E, López-Camarillo C, Zamora-López B, Marchat LA, Zamorano-Carrillo A. Importance of amino acids Leu135 and Tyr236 for the interaction between EhCFIm25 and RNA: a molecular dynamics simulation study. J Mol Model 2018; 24:202. [PMID: 30003410 DOI: 10.1007/s00894-018-3729-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2018] [Accepted: 06/19/2018] [Indexed: 11/28/2022]
Abstract
The CFIm25 subunit of the heterotetrameric cleavage factor Im (CFIm) is a critical factor in the formation of the poly(A) tail at mRNA 3' end, regulating the recruitment of polyadenylation factors, poly(A) site selection, and cleavage/polyadenylation reactions. We previously reported the homologous protein (EhCFIm25) in Entamoeba histolytica, the protozoan causing human amoebiasis, and showed the relevance of conserved Leu135 and Tyr236 residues for RNA binding. We also identified the GUUG sequence as the recognition site of EhCFIm25. To understand the interactions network that allows the EhCFIm25 to maintain its three-dimensional structure and function, here we performed molecular dynamics simulations of wild-type (WT) and mutant proteins, alone or interacting with the GUUG molecule. Our results indicated that in the presence of the GUUG sequence, WT converged more quickly to lower RMSD values in comparison with mutant proteins. However, RMSF values showed that movements of amino acids of WT and EhCFIm25*L135 T were almost identical, interacting or not with the GUUG molecule. Interestingly, EhCFIm25*L135 T, which is the only mutant with a slight RNA binding activity experimentally, presents the same stabilization of bend structures and alpha helices as WT, notably in the C-terminus. Moreover, WT and EhCFIm25*L135 T presented almost the same number of contacts that mainly involve lysine residues interacting with the G4 nucleotide. Overall, our data proposed a clear description of the structural and mechanistic data that govern the RNA binding capacity of EhCFIm25.
Collapse
Affiliation(s)
- Juan David Ospina-Villa
- Programa Institucional de Biomedicina Molecular, Programa de Doctorado en Ciencias en Biotecnología, ENMH, Instituto Politécnico Nacional, Guillermo Massieu Helguera 239, Fracc. La Escalera, Ticomán, Del. Gustavo A. Madero, CP 07320, Ciudad de México, Mexico
| | - Juan García-Contreras
- Programa Institucional de Biomedicina Molecular, Programa de Doctorado en Ciencias en Biotecnología, ENMH, Instituto Politécnico Nacional, Guillermo Massieu Helguera 239, Fracc. La Escalera, Ticomán, Del. Gustavo A. Madero, CP 07320, Ciudad de México, Mexico
| | - Jorge Luis Rosas-Trigueros
- Laboratorio Transdisciplinario de Investigación en Sistemas Evolutivos, ESCOM, Instituto Politécnico Nacional, Av. Juan de Dios Bátiz esq. Miguel Othón de Mendizábal, Col. Lindavista, Del. Gustavo A. Madero, CP 07738, Ciudad de México, Mexico
| | - Esther Ramírez-Moreno
- Programa Institucional de Biomedicina Molecular, Programa de Doctorado en Ciencias en Biotecnología, ENMH, Instituto Politécnico Nacional, Guillermo Massieu Helguera 239, Fracc. La Escalera, Ticomán, Del. Gustavo A. Madero, CP 07320, Ciudad de México, Mexico
| | - César López-Camarillo
- Posgrado en Ciencias Genómicas, Universidad Autónoma de la Ciudad de México, San Lorenzo 290, Colonia del Valle, CP 03100, Ciudad de México, Mexico
| | - Beatriz Zamora-López
- Departamento de Psiquiatría y Salud Mental, Facultad de Medicina, UNAM, Circuito Interior y Cerro del Agua, CP 04510, Ciudad de México, Mexico
| | - Laurence A Marchat
- Programa Institucional de Biomedicina Molecular, Programa de Doctorado en Ciencias en Biotecnología, ENMH, Instituto Politécnico Nacional, Guillermo Massieu Helguera 239, Fracc. La Escalera, Ticomán, Del. Gustavo A. Madero, CP 07320, Ciudad de México, Mexico
| | - Absalom Zamorano-Carrillo
- Programa Institucional de Biomedicina Molecular, Programa de Doctorado en Ciencias en Biotecnología, ENMH, Instituto Politécnico Nacional, Guillermo Massieu Helguera 239, Fracc. La Escalera, Ticomán, Del. Gustavo A. Madero, CP 07320, Ciudad de México, Mexico.
| |
Collapse
|
5
|
Cross- and Co-Packaging of Retroviral RNAs and Their Consequences. Viruses 2016; 8:v8100276. [PMID: 27727192 PMCID: PMC5086612 DOI: 10.3390/v8100276] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2016] [Revised: 10/03/2016] [Accepted: 10/03/2016] [Indexed: 12/23/2022] Open
Abstract
Retroviruses belong to the family Retroviridae and are ribonucleoprotein (RNP) particles that contain a dimeric RNA genome. Retroviral particle assembly is a complex process, and how the virus is able to recognize and specifically capture the genomic RNA (gRNA) among millions of other cellular and spliced retroviral RNAs has been the subject of extensive investigation over the last two decades. The specificity towards RNA packaging requires higher order interactions of the retroviral gRNA with the structural Gag proteins. Moreover, several retroviruses have been shown to have the ability to cross-/co-package gRNA from other retroviruses, despite little sequence homology. This review will compare the determinants of gRNA encapsidation among different retroviruses, followed by an examination of our current understanding of the interaction between diverse viral genomes and heterologous proteins, leading to their cross-/co-packaging. Retroviruses are well-known serious animal and human pathogens, and such a cross-/co-packaging phenomenon could result in the generation of novel viral variants with unknown pathogenic potential. At the same time, however, an enhanced understanding of the molecular mechanisms involved in these specific interactions makes retroviruses an attractive target for anti-viral drugs, vaccines, and vectors for human gene therapy.
Collapse
|
6
|
Abstract
In the last few decades, small regulatory RNA (sRNA) molecules emerged as key regulators in every kingdom of life. Resolving the full targetome of sRNAs has however remained a challenge. To address this, we used an in vivo tagging MS2-affinity purification protocol coupled with RNA sequencing technology, namely MAPS, to assemble full bacterial small RNAs targetomes. The impressive potential of MAPS has been supported by a number of reports. Here, we concisely overview RNA-tagging history that preceded the development of the MAPS assay and expose the range of possible uses of this technology.
Collapse
Affiliation(s)
- Marie-Claude Carrier
- a Department of Biochemistry , RNA Group, Université de Sherbrooke , Sherbrooke, Québec , Canada
| | - David Lalaouna
- a Department of Biochemistry , RNA Group, Université de Sherbrooke , Sherbrooke, Québec , Canada
| | - Eric Massé
- a Department of Biochemistry , RNA Group, Université de Sherbrooke , Sherbrooke, Québec , Canada
| |
Collapse
|
7
|
Ma X, Guo J, Xiao K, Sun X. PRBP: Prediction of RNA-Binding Proteins Using a Random Forest Algorithm Combined with an RNA-Binding Residue Predictor. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2015; 12:1385-1393. [PMID: 26671809 DOI: 10.1109/tcbb.2015.2418773] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
The prediction of RNA-binding proteins is an incredibly challenging problem in computational biology. Although great progress has been made using various machine learning approaches with numerous features, the problem is still far from being solved. In this study, we attempt to predict RNA-binding proteins directly from amino acid sequences. A novel approach, PRBP predicts RNA-binding proteins using the information of predicted RNA-binding residues in conjunction with a random forest based method. For a given protein, we first predict its RNA-binding residues and then judge whether the protein binds RNA or not based on information from that prediction. If the protein cannot be identified by the information associated with its predicted RNA-binding residues, then a novel random forest predictor is used to determine if the query protein is a RNA-binding protein. We incorporated features of evolutionary information combined with physicochemical features (EIPP) and amino acid composition feature to establish the random forest predictor. Feature analysis showed that EIPP contributed the most to the prediction of RNA-binding proteins. The results also showed that the information from the RNA-binding residue prediction improved the overall performance of our RNA-binding protein prediction. It is anticipated that the PRBP method will become a useful tool for identifying RNA-binding proteins. A PRBP Web server implementation is freely available at http://www.cbi.seu.edu.cn/PRBP/.
Collapse
|
8
|
Sequence-Based Prediction of RNA-Binding Proteins Using Random Forest with Minimum Redundancy Maximum Relevance Feature Selection. BIOMED RESEARCH INTERNATIONAL 2015; 2015:425810. [PMID: 26543860 PMCID: PMC4620426 DOI: 10.1155/2015/425810] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/24/2015] [Accepted: 09/21/2015] [Indexed: 11/17/2022]
Abstract
The prediction of RNA-binding proteins is one of the most challenging problems in computation biology. Although some studies have investigated this problem, the accuracy of prediction is still not sufficient. In this study, a highly accurate method was developed to predict RNA-binding proteins from amino acid sequences using random forests with the minimum redundancy maximum relevance (mRMR) method, followed by incremental feature selection (IFS). We incorporated features of conjoint triad features and three novel features: binding propensity (BP), nonbinding propensity (NBP), and evolutionary information combined with physicochemical properties (EIPP). The results showed that these novel features have important roles in improving the performance of the predictor. Using the mRMR-IFS method, our predictor achieved the best performance (86.62% accuracy and 0.737 Matthews correlation coefficient). High prediction accuracy and successful prediction performance suggested that our method can be a useful approach to identify RNA-binding proteins from sequence information.
Collapse
|
9
|
Tran T, Liu Y, Marchant J, Monti S, Seu M, Zaki J, Yang AL, Bohn J, Ramakrishnan V, Singh R, Hernandez M, Vega A, Summers MF. Conserved determinants of lentiviral genome dimerization. Retrovirology 2015; 12:83. [PMID: 26420212 PMCID: PMC4588261 DOI: 10.1186/s12977-015-0209-x] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2015] [Accepted: 09/18/2015] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND Retroviruses selectively package two copies of their unspliced genomes by what appears to be a dimerization-dependent RNA packaging mechanism. Dimerization of human immunodeficiency virus Type-1 (HIV-1) genomes is initiated by "kissing" interactions between GC-rich palindromic loop residues of a conserved hairpin (DIS), and is indirectly promoted by long-range base pairing between residues overlapping the gag start codon (AUG) and an upstream Unique 5' element (U5). The DIS and U5:AUG structures are phylogenetically conserved among divergent retroviruses, suggesting conserved functions. However, some studies suggest that the DIS of HIV-2 does not participate in dimerization, and that U5:AUG pairing inhibits, rather than promotes, genome dimerization. We prepared RNAs corresponding to native and mutant forms of the 5' leaders of HIV-1 (NL4-3 strain), HIV-2 (ROD strain), and two divergent strains of simian immunodeficiency virus (SIV; cpz-TAN1 and -US strains), and probed for potential roles of the DIS and U5:AUG base pairing on intrinsic and NC-dependent dimerization by mutagenesis, gel electrophoresis, and NMR spectroscopy. RESULTS Dimeric forms of the native HIV-2 and SIV leaders were only detectable using running buffers that contained Mg(2+), indicating that these dimers are more labile than that of the HIV-1 leader. Mutations designed to promote U5:AUG base pairing promoted dimerization of the HIV-2 and SIV RNAs, whereas mutations that prevented U5:AUG pairing inhibited dimerization. Chimeric HIV-2 and SIV leader RNAs containing the dimer-promoting loop of HIV-1 (DIS) exhibited HIV-1 leader-like dimerization properties, whereas an HIV-1NL4-3 mutant containing the SIVcpzTAN1 DIS loop behaved like the SIVcpzTAN1 leader. The cognate NC proteins exhibited varying abilities to promote dimerization of the retroviral leader RNAs, but none were able to convert labile dimers to non-labile dimers. CONCLUSIONS The finding that U5:AUG formation promotes dimerization of the full-length HIV-1, HIV-2, SIVcpzUS, and SIVcpzTAN1 5' leaders suggests that these retroviruses utilize a common RNA structural switch mechanism to modulate function. Differences in native and NC-dependent dimerization propensity and lability are due to variations in the compositions of the DIS loop residues rather than other sequences within the leader RNAs. Although NC is a well-known RNA chaperone, its role in dimerization has the hallmarks of a classical riboswitch.
Collapse
Affiliation(s)
- Thao Tran
- Howard Hughes Medical Institute, Department of Chemistry and Biochemistry, University of Maryland Baltimore County, 1000 Hilltop Circle, Baltimore, MD, 21250, USA.
| | - Yuanyuan Liu
- Howard Hughes Medical Institute, Department of Chemistry and Biochemistry, University of Maryland Baltimore County, 1000 Hilltop Circle, Baltimore, MD, 21250, USA.
| | - Jan Marchant
- Howard Hughes Medical Institute, Department of Chemistry and Biochemistry, University of Maryland Baltimore County, 1000 Hilltop Circle, Baltimore, MD, 21250, USA.
| | - Sarah Monti
- Howard Hughes Medical Institute, Department of Chemistry and Biochemistry, University of Maryland Baltimore County, 1000 Hilltop Circle, Baltimore, MD, 21250, USA.
| | - Michelle Seu
- Howard Hughes Medical Institute, Department of Chemistry and Biochemistry, University of Maryland Baltimore County, 1000 Hilltop Circle, Baltimore, MD, 21250, USA.
| | - Jessica Zaki
- Howard Hughes Medical Institute, Department of Chemistry and Biochemistry, University of Maryland Baltimore County, 1000 Hilltop Circle, Baltimore, MD, 21250, USA.
| | - Ae Lim Yang
- Howard Hughes Medical Institute, Department of Chemistry and Biochemistry, University of Maryland Baltimore County, 1000 Hilltop Circle, Baltimore, MD, 21250, USA.
| | - Jennifer Bohn
- Howard Hughes Medical Institute, Department of Chemistry and Biochemistry, University of Maryland Baltimore County, 1000 Hilltop Circle, Baltimore, MD, 21250, USA.
| | - Venkateswaran Ramakrishnan
- Howard Hughes Medical Institute, Department of Chemistry and Biochemistry, University of Maryland Baltimore County, 1000 Hilltop Circle, Baltimore, MD, 21250, USA.
| | - Rashmi Singh
- Howard Hughes Medical Institute, Department of Chemistry and Biochemistry, University of Maryland Baltimore County, 1000 Hilltop Circle, Baltimore, MD, 21250, USA.
| | - Mateo Hernandez
- Howard Hughes Medical Institute, Department of Chemistry and Biochemistry, University of Maryland Baltimore County, 1000 Hilltop Circle, Baltimore, MD, 21250, USA.
| | - Alexander Vega
- Howard Hughes Medical Institute, Department of Chemistry and Biochemistry, University of Maryland Baltimore County, 1000 Hilltop Circle, Baltimore, MD, 21250, USA.
| | - Michael F Summers
- Howard Hughes Medical Institute, Department of Chemistry and Biochemistry, University of Maryland Baltimore County, 1000 Hilltop Circle, Baltimore, MD, 21250, USA.
| |
Collapse
|
10
|
Li JH, Chiu WC, Yao YC, Cheng RP. Effect of arginine methylation on the RNA recognition and cellular uptake of Tat-derived peptides. Bioorg Med Chem 2015; 23:2281-6. [DOI: 10.1016/j.bmc.2015.01.051] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2014] [Revised: 01/22/2015] [Accepted: 01/28/2015] [Indexed: 12/16/2022]
|
11
|
Livi CM, Blanzieri E. Protein-specific prediction of mRNA binding using RNA sequences, binding motifs and predicted secondary structures. BMC Bioinformatics 2014; 15:123. [PMID: 24780077 PMCID: PMC4098778 DOI: 10.1186/1471-2105-15-123] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2013] [Accepted: 04/16/2014] [Indexed: 12/14/2022] Open
Abstract
Background RNA-binding proteins interact with specific RNA molecules to regulate important cellular processes. It is therefore necessary to identify the RNA interaction partners in order to understand the precise functions of such proteins. Protein-RNA interactions are typically characterized using in vivo and in vitro experiments but these may not detect all binding partners. Therefore, computational methods that capture the protein-dependent nature of such binding interactions could help to predict potential binding partners in silico. Results We have developed three methods to predict whether an RNA can interact with a particular RNA-binding protein using support vector machines and different features based on the sequence (the Oli method), the motif score (the OliMo method) and the secondary structure (the OliMoSS method). We applied these approaches to different experimentally-derived datasets and compared the predictions with RNAcontext and RPISeq. Oli outperformed OliMoSS and RPISeq, confirming our protein-specific predictions and suggesting that tetranucleotide frequencies are appropriate discriminative features. Oli and RNAcontext were the most competitive methods in terms of the area under curve. A precision-recall curve analysis achieved higher precision values for Oli. On a second experimental dataset including real negative binding information, Oli outperformed RNAcontext with a precision of 0.73 vs. 0.59. Conclusions Our experiments showed that features based on primary sequence information are sufficiently discriminating to predict specific RNA-protein interactions. Sequence motifs and secondary structure information were not necessary to improve these predictions. Finally we confirmed that protein-specific experimental data concerning RNA-protein interactions are valuable sources of information that can be used for the efficient training of models for in silico predictions. The scripts are available upon request to the corresponding author.
Collapse
Affiliation(s)
- Carmen M Livi
- Department of Information Engineering and Computer Science, University of Trento, Via Sommarive 5, Trento, Italy.
| | | |
Collapse
|
12
|
Mattheis C, Wang H, Schwarzer MC, Frenking G, Agarwal S. Exploring suitable oligoamines for phantom ring-closing condensation polymerization with guanidine hydrochloride. Polym Chem 2013. [DOI: 10.1039/c2py20672b] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
13
|
Wang R, Li H. The mysterious RAMP proteins and their roles in small RNA-based immunity. Protein Sci 2012; 21:463-70. [PMID: 22323284 DOI: 10.1002/pro.2044] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
A new class of prokaryotic RNA binding proteins called Repeat Associated Mysterious Proteins (RAMPs), has recently been identified. These proteins play key roles in a novel type immunity in which the DNA of the host organism (e.g. a prokaryote) has sequence segments corresponding to the sequences of potential viral invaders. The sequences embedded in the host DNA confer immunity by directing selective destruction of the nucleic acid of the virus using an RNA-based strategy. In this viral defense mechanism, RAMP proteins have multiple functional roles including endoribonucleotic cleavage and ribonucleoprotein particle assembly. RAMPs contain the classical RNA recognition motif (RRM), often in tandem, and a conserved glycine-rich segment (G-loop) near the carboxyl terminus. However, unlike RRMs that bind single-stranded RNA using their β-sheet surface, RAMPs make use of both sides of the RRM fold and interact with both single-stranded and structured RNA. The unique spatial arrangement of the two RRM folds, facilitated by a hallmark G-loop, is crucial to formation of a composite surface for recognition of specific RNA. Evidence for RNA-dependent oligomerization is also observed in some RAMP proteins that may serve as an important strategy to increase specificity.
Collapse
Affiliation(s)
- Ruiying Wang
- Department of Chemistry and Biochemistry, Florida State University, Tallahassee, Florida 32306, USA
| | | |
Collapse
|
14
|
Identification of a minimal region of the HIV-1 5'-leader required for RNA dimerization, NC binding, and packaging. J Mol Biol 2012; 417:224-39. [PMID: 22306406 DOI: 10.1016/j.jmb.2012.01.033] [Citation(s) in RCA: 80] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2011] [Revised: 01/13/2012] [Accepted: 01/21/2012] [Indexed: 11/23/2022]
Abstract
Assembly of human immunodeficiency virus type 1 (HIV-1) particles is initiated in the cytoplasm by the formation of a ribonucleoprotein complex comprising the dimeric RNA genome and a small number of viral Gag polyproteins. Genomes are recognized by the nucleocapsid (NC) domains of Gag, which interact with packaging elements believed to be located primarily within the 5'-leader (5'-L) of the viral RNA. Recent studies revealed that the native 5'-L exists as an equilibrium of two conformers, one in which dimer-promoting residues and NC binding sites are sequestered and packaging is attenuated, and one in which these sites are exposed and packaging is promoted. To identify the elements within the dimeric 5'-L that are important for packaging, we generated HIV-1 5'-L RNAs containing mutations and deletions designed to eliminate substructures without perturbing the overall structure of the leader and examined effects of the mutations on RNA dimerization, NC binding, and packaging. Our findings identify a 159-residue RNA packaging signal that possesses dimerization and NC binding properties similar to those of the intact 5'-L and contains elements required for efficient RNA packaging.
Collapse
|
15
|
Lu K, Heng X, Summers MF. Structural determinants and mechanism of HIV-1 genome packaging. J Mol Biol 2011; 410:609-33. [PMID: 21762803 DOI: 10.1016/j.jmb.2011.04.029] [Citation(s) in RCA: 180] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2011] [Revised: 04/11/2011] [Accepted: 04/11/2011] [Indexed: 11/30/2022]
Abstract
Like all retroviruses, the human immunodeficiency virus selectively packages two copies of its unspliced RNA genome, both of which are utilized for strand-transfer-mediated recombination during reverse transcription-a process that enables rapid evolution under environmental and chemotherapeutic pressures. The viral RNA appears to be selected for packaging as a dimer, and there is evidence that dimerization and packaging are mechanistically coupled. Both processes are mediated by interactions between the nucleocapsid domains of a small number of assembling viral Gag polyproteins and RNA elements within the 5'-untranslated region of the genome. A number of secondary structures have been predicted for regions of the genome that are responsible for packaging, and high-resolution structures have been determined for a few small RNA fragments and protein-RNA complexes. However, major questions regarding the RNA structures (and potentially the structural changes) that are responsible for dimeric genome selection remain unanswered. Here, we review efforts that have been made to identify the molecular determinants and mechanism of human immunodeficiency virus type 1 genome packaging.
Collapse
Affiliation(s)
- Kun Lu
- Howard Hughes Medical Institute and Department of Chemistry and Biochemistry, University of Maryland Baltimore County, Baltimore, MD 21250, USA
| | | | | |
Collapse
|
16
|
Ahmad S, Sarai A. Analysis of electric moments of RNA-binding proteins: implications for mechanism and prediction. BMC STRUCTURAL BIOLOGY 2011; 11:8. [PMID: 21284850 PMCID: PMC3048485 DOI: 10.1186/1472-6807-11-8] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/29/2010] [Accepted: 02/01/2011] [Indexed: 11/24/2022]
Abstract
Background Protein-RNA interactions play important role in many biological processes such as gene regulation, replication, protein synthesis and virus assembly. Although many structures of various types of protein-RNA complexes have been determined, the mechanism of protein-RNA recognition remains elusive. We have earlier shown that the simplest electrostatic properties viz. charge, dipole and quadrupole moments, calculated from backbone atomic coordinates of proteins are biased relative to other proteins, and these quantities can be used to identify DNA-binding proteins. Closely related, RNA-binding proteins are investigated in this study. In particular, discrimination between various types of RNA-binding proteins, evolutionary conservation of these bulk electrostatic features and effect of conformational changes by complex formation are investigated. Basic binding mechanism of a putative RNA-binding protein (HI1333 from Haemophilus influenza) is suggested as a potential application of this study. Results We found that similar to DNA-binding proteins (DBPs), RNA-binding proteins (RBPs) also show significantly higher values of electric moments. However, higher moments in RBPs are found to strongly depend on their functional class: proteins binding to ribosomal RNA (rRNA) constitute the only class with all three of the properties (charge, dipole and quadrupole moments) being higher than control proteins. Neural networks were trained using leave-one-out cross-validation to predict RBPs from control data as well as pair-wise classification capacity between proteins binding to various RNA types. RBPs and control proteins reached up to 78% accuracy measured by the area under the ROC curve. Proteins binding to rRNA are found to be best distinguished (AUC = 79%). Changes in dipole and quadrupole moments between unbound and bound structures were small and these properties are found to be robust under complex formation. Conclusions Bulk electric moments of proteins considered here provide insights into target recognition by RNA-binding proteins, as well as ability to recognize one type of RBP from others. These results help in understanding the mechanism of protein-RNA recognition, and identifying RNA-binding proteins.
Collapse
Affiliation(s)
- Shandar Ahmad
- Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, Iizuka, Fukuoka, 820-8502, Japan
| | | |
Collapse
|
17
|
Netter C, Weber G, Benecke H, Wahl MC. Functional stabilization of an RNA recognition motif by a noncanonical N-terminal expansion. RNA (NEW YORK, N.Y.) 2009; 15:1305-13. [PMID: 19447915 PMCID: PMC2704084 DOI: 10.1261/rna.1359909] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
RNA recognition motifs (RRMs) constitute versatile macromolecular interaction platforms. They are found in many components of spliceosomes, in which they mediate RNA and protein interactions by diverse molecular strategies. The human U11/U12-65K protein of the minor spliceosome employs a C-terminal RRM to bind hairpin III of the U12 small nuclear RNA (snRNA). This interaction comprises one side of a molecular bridge between the U11 and U12 small nuclear ribonucleoprotein particles (snRNPs) and is reminiscent of the binding of the N-terminal RRMs in the major spliceosomal U1A and U2B'' proteins to hairpins in their cognate snRNAs. Here we show by mutagenesis and electrophoretic mobility shift assays that the beta-sheet surface and a neighboring loop of 65K C-terminal RRM are involved in RNA binding, as previously seen in canonical RRMs like the N-terminal RRMs of the U1A and U2B'' proteins. However, unlike U1A and U2B'', some 30 residues N-terminal of the 65K C-terminal RRM core are additionally required for stable U12 snRNA binding. The crystal structure of the expanded 65K C-terminal RRM revealed that the N-terminal tail adopts an alpha-helical conformation and wraps around the protein toward the face opposite the RNA-binding platform. Point mutations in this part of the protein had only minor effects on RNA affinity. Removal of the N-terminal extension significantly decreased the thermal stability of the 65K C-terminal RRM. These results demonstrate that the 65K C-terminal RRM is augmented by an N-terminal element that confers stability to the domain, and thereby facilitates stable RNA binding.
Collapse
MESH Headings
- Crystallography, X-Ray
- Electrophoretic Mobility Shift Assay
- Humans
- Models, Molecular
- Protein Conformation
- Protein Structure, Secondary
- RNA, Small Nuclear/genetics
- RNA, Small Nuclear/metabolism
- Ribonucleoprotein, U1 Small Nuclear/genetics
- Ribonucleoprotein, U1 Small Nuclear/metabolism
- Ribonucleoproteins, Small Nuclear/chemistry
- Ribonucleoproteins, Small Nuclear/genetics
- Ribonucleoproteins, Small Nuclear/metabolism
- Spliceosomes
- snRNP Core Proteins/genetics
- snRNP Core Proteins/metabolism
Collapse
Affiliation(s)
- Catharina Netter
- Max-Planck-Institut für Biophysikalische Chemie, D-37077 Göttingen, Germany
| | | | | | | |
Collapse
|
18
|
Shazman S, Mandel-Gutfreund Y. Classifying RNA-binding proteins based on electrostatic properties. PLoS Comput Biol 2008; 4:e1000146. [PMID: 18716674 PMCID: PMC2518515 DOI: 10.1371/journal.pcbi.1000146] [Citation(s) in RCA: 61] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2007] [Accepted: 06/26/2008] [Indexed: 01/15/2023] Open
Abstract
Protein structure can provide new insight into the biological function of a protein and can enable the design of better experiments to learn its biological roles. Moreover, deciphering the interactions of a protein with other molecules can contribute to the understanding of the protein's function within cellular processes. In this study, we apply a machine learning approach for classifying RNA-binding proteins based on their three-dimensional structures. The method is based on characterizing unique properties of electrostatic patches on the protein surface. Using an ensemble of general protein features and specific properties extracted from the electrostatic patches, we have trained a support vector machine (SVM) to distinguish RNA-binding proteins from other positively charged proteins that do not bind nucleic acids. Specifically, the method was applied on proteins possessing the RNA recognition motif (RRM) and successfully classified RNA-binding proteins from RRM domains involved in protein–protein interactions. Overall the method achieves 88% accuracy in classifying RNA-binding proteins, yet it cannot distinguish RNA from DNA binding proteins. Nevertheless, by applying a multiclass SVM approach we were able to classify the RNA-binding proteins based on their RNA targets, specifically, whether they bind a ribosomal RNA (rRNA), a transfer RNA (tRNA), or messenger RNA (mRNA). Finally, we present here an innovative approach that does not rely on sequence or structural homology and could be applied to identify novel RNA-binding proteins with unique folds and/or binding motifs. Gene expression in all living organisms is regulated by a complex set of events at both transcriptional and posttranscriptional levels. RNA-binding proteins play a key role in posttranscriptional events including splicing, stability, transport, and translation. Nowadays, there is increasing evidence that many other cellular processes may be mediated by RNA. Identifying new proteins involved in interaction with RNA is thus essential to unraveling the cellular processes in which these interactions are involved. In the current study we present a successful computational approach for classifying RNA-binding proteins and distinguishing them from other proteins based on structural and electrostatic properties. We test the method on a unique protein domain, the RNA recognition motif (RRM), which mediates both RNA and protein interactions. We show that we can discriminate RNA-binding RRMs from protein-binding RRMs. Further, we demonstrate that we can classify known RNA-binding proteins based on their RNA target (mRNA, rRNA, or tRNA). Our method does not rely on any kind of evolutionary information and thus can be applied to identify RNA-binding proteins with novel modes of RNA recognition.
Collapse
Affiliation(s)
- Shula Shazman
- Faculty of Biology, Technion—Israel Institute of Technology, Haifa, Israel
| | | |
Collapse
|
19
|
Jacob DT, DeStefano JJ. A new role for HIV nucleocapsid protein in modulating the specificity of plus strand priming. Virology 2008; 378:385-96. [PMID: 18632127 DOI: 10.1016/j.virol.2008.06.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2008] [Revised: 05/26/2008] [Accepted: 06/05/2008] [Indexed: 11/28/2022]
Abstract
The current study indicates a new role for HIV nucleocapsid protein (NC) in modulating the specificity of plus strand priming. RNase H cleavage by reverse transcriptase (RT) during minus strand synthesis gives rise to RNA fragments that could potentially be used as primers for synthesis of the plus strand, leading to the initiation of priming from multiple points as has been observed for other retroviruses. For HIV, the central and 3' polypurine tracts (PPTs) are the major sites of plus strand initiation. Using reconstituted in vitro assays, results showed that NC greatly reduced the efficiency of extension of non-PPT RNA primers, but not PPT. Experiments mimicking HIV replication showed that RT generated and used both PPT and non-PPT RNAs to initiate "plus strand" synthesis, but non-PPT usage was strongly inhibited by NC. The results support a role for NC in specifying primer usage during plus strand synthesis.
Collapse
Affiliation(s)
- Deena T Jacob
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, Maryland 20742, USA
| | | |
Collapse
|
20
|
Nucleocapsid protein function in early infection processes. Virus Res 2008; 134:39-63. [PMID: 18279991 DOI: 10.1016/j.virusres.2007.12.006] [Citation(s) in RCA: 124] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2007] [Revised: 12/13/2007] [Accepted: 12/13/2007] [Indexed: 01/15/2023]
Abstract
The role of nucleocapsid protein (NC) in the early steps of retroviral replication appears largely that of a facilitator for reverse transcription and integration. Using a wide variety of cell-free assay systems, the properties of mature NC proteins (e.g. HIV-1 p7(NC) or MLV p10(NC)) as nucleic acid chaperones have been extensively investigated. The effect of NC on tRNA annealing, reverse transcription initiation, minus-strand-transfer, processivity of reverse transcription, plus-strand-transfer, strand-displacement synthesis, 3' processing of viral DNA by integrase, and integrase-mediated strand-transfer has been determined by a large number of laboratories. Interestingly, these reactions can all be accomplished to varying degrees in the absence of NC; some are facilitated by both viral and non-viral proteins and peptides that may or may not be involved in vivo. What is one to conclude from the observation that NC is not strictly required for these necessary reactions to occur? NC likely enhances the efficiency of each of these steps, thereby vastly improving the productivity of infection. In other words, one of the major roles of NC is to enhance the effectiveness of early infection, thereby increasing the probability of productive replication and ultimately of retrovirus survival.
Collapse
|
21
|
Zheng S, Robertson TA, Varani G. A knowledge-based potential function predicts the specificity and relative binding energy of RNA-binding proteins. FEBS J 2007; 274:6378-91. [PMID: 18005254 DOI: 10.1111/j.1742-4658.2007.06155.x] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
RNA-protein interactions are fundamental to gene expression. Thus, the molecular basis for the sequence dependence of protein-RNA recognition has been extensively studied experimentally. However, there have been very few computational studies of this problem, and no sustained attempt has been made towards using computational methods to predict or alter the sequence-specificity of these proteins. In the present study, we provide a distance-dependent statistical potential function derived from our previous work on protein-DNA interactions. This potential function discriminates native structures from decoys, successfully predicts the native sequences recognized by sequence-specific RNA-binding proteins, and recapitulates experimentally determined relative changes in binding energy due to mutations of individual amino acids at protein-RNA interfaces. Thus, this work demonstrates that statistical models allow the quantitative analysis of protein-RNA recognition based on their structure and can be applied to modeling protein-RNA interfaces for prediction and design purposes.
Collapse
Affiliation(s)
- Suxin Zheng
- Department of Chemistry, University of Washington, Seattle, WA 98195, USA
| | | | | |
Collapse
|
22
|
Li H. Complexes of tRNA and maturation enzymes: shaping up for translation. Curr Opin Struct Biol 2007; 17:293-301. [PMID: 17580114 DOI: 10.1016/j.sbi.2007.05.002] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2007] [Revised: 03/27/2007] [Accepted: 05/25/2007] [Indexed: 11/29/2022]
Abstract
Several significant structures of transfer ribonucleic acid (tRNA) maturation enzymes complexed with precursor tRNA or fragments thereof have been published recently, providing detailed knowledge of enzyme-tRNA recognition and catalytic strategies. In addition to reinforcing the general principles of RNA-protein interaction, the new structures highlight both the features of composite RNA recognition by multiple enzyme subunits and the pronounced RNA structural flexibility in or near the active site in all cases. These structural principles provide plausible explanations for the exquisite specificity and catalytic power of these enzymes and, in the case of evolutionary adaptation, for the ability of some enzymes to develop novel specificities.
Collapse
Affiliation(s)
- Hong Li
- Department of Chemistry and Biochemistry, Institute of Molecular Biophysics, Florida State University, Tallahassee, FL 32306, USA.
| |
Collapse
|
23
|
Eshete M, Marchbank MT, Deutscher SL, Sproat B, Leszczynska G, Malkiewicz A, Agris PF. Specificity of Phage Display Selected Peptides for Modified Anticodon Stem and Loop Domains of tRNA. Protein J 2007; 26:61-73. [PMID: 17237992 DOI: 10.1007/s10930-006-9046-z] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Protein recognition of RNA has been studied using Peptide Phage Display Libraries, but in the absence of RNA modifications. Peptides from two libraries, selected for binding the modified anticodon stem and loop (ASL) of human tRNA(LyS3) having 2-thiouridine (s(2)U34) and pseudouridine (psi39), bound the modified human ASL(Lys3)(s(2)U34;psi39) preferentially and had significant homology with RNA binding proteins. Selected peptides were narrowed to a manageable number using a less sensitive, but inexpensive assay before conducting intensive characterization. The affinity and specificity of the best binding peptide (with an N-terminal fluorescein) were characterized by fluorescence spectrophotometry. The peptide exhibited the highest binding affinity for ASL(LYS3)(s(2)U34; psi39), followed by the hypermodified ASL(Lys3) (mcm(5)s(2) U34; ms(2)t(6)A37) and the unmodified ASL(Lys3), but bound poorly to singly modified ASL(Lys3) constructs (psi39, ms(2)t(6)A37, s(2)34), ASL(Lys1,2) (t(6)A37) and Escherichia coli ASL(Glu) (s(2)U34). Thus, RNA modifications are potentially important recognition elements for proteins and can be targets for selective recognition by peptides.
Collapse
Affiliation(s)
- Matthewos Eshete
- Department of Molecular and Structural Biochemistry, North Carolina State University, 128 Polk Hall, Campus Box 7622, Raleigh, NC, 27695-7622, USA
| | | | | | | | | | | | | |
Collapse
|
24
|
Narayanan N, Gorelick RJ, DeStefano JJ. Structure/function mapping of amino acids in the N-terminal zinc finger of the human immunodeficiency virus type 1 nucleocapsid protein: residues responsible for nucleic acid helix destabilizing activity. Biochemistry 2006; 45:12617-28. [PMID: 17029416 PMCID: PMC4829079 DOI: 10.1021/bi060925c] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The nucleocapsid protein (NC) of HIV-1 is 55 amino acids in length and possesses two CCHC-type zinc fingers. Finger one (N-terminal) contributes significantly more to helix destabilizing activity than finger two (C-terminal). Five amino acids differ between the two zinc fingers. To determine at the amino acid level the reason for the apparent distinction between the fingers, each different residue in finger one was incrementally replaced by the one at the corresponding location in finger two. Mutants were analyzed in annealing assays with unstructured and structured substrates. Three groupings emerged: (1) those similar to wild-type levels (N17K, A25M), (2) those with diminished activity (I24Q, N27D), and (3) mutant F16W, which had substantially greater helix destabilizing activity than that of the wild type. Unlike I24Q and the other mutants, N27D was defective in DNA binding. Only I24Q and N27D showed reduced strand transfer in in vitro assays. Double and triple mutants F16W/I24Q, F16W/N27D, and F16W/I24Q/N27D all showed defects in DNA binding, strand transfer, and helix destabilization, suggesting that the I24Q and N27D mutations have a dominant negative effect and abolish the positive influence of F16W. Results show that amino acid differences at positions 24 and 27 contribute significantly to finger one's helix destabilizing activity.
Collapse
Affiliation(s)
- Nirupama Narayanan
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD
| | - Robert J. Gorelick
- AIDS Vaccine Program, SAIC-Frederick, Inc., NCI at Frederick, Frederick, MD
| | - Jeffrey J. DeStefano
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD
| |
Collapse
|
25
|
Nie M, Htun H. Different modes and potencies of translational repression by sequence-specific RNA-protein interaction at the 5'-UTR. Nucleic Acids Res 2006; 34:5528-40. [PMID: 17023487 PMCID: PMC1635260 DOI: 10.1093/nar/gkl584] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
To determine whether sequence-specific RNA–protein interaction at the 5′-untranslated region (5′-UTR) can potently repress translation in mammalian cells, a bicistronic translational repression assay was developed to permit direct assessment of RNA–protein interaction and translational repression in transiently transfected living mammalian cells. Changes in cap-dependent yellow fluorescent protein (YFP) and internal ribosome entry sequence (IRES)-dependent cyan fluorescent protein (CFP) translation were monitored by fluorescence microscopy. Selective repression of YFP or coordinate repression of both YFP and CFP translation occurred, indicating two distinct modes by which RNA-binding proteins repress translation through the 5′-UTR. Interestingly, a single-stranded RNA-binding protein from Bacillus subtilis, tryptophan RNA-binding attenuation protein (TRAP), showed potent translational repression, dependent on the level of TRAP expression and position of its cognate binding site within the bicistronic reporter transcript. As the first of its class to be examined in mammalian cells, its potency in repression of translation through the 5′-UTR may be a general feature for this class of single-stranded RNA-binding proteins. Finally, a one-hybrid screen based on translational repression through the 5′-UTR identified linkers supporting full-translational repression as well as a range of partial repression by TRAP within the context of a fusion protein.
Collapse
Affiliation(s)
- Minghua Nie
- Department of Obstetrics and Gynecology, Molecular Biology InstituteUniversity of California Los Angeles-Jonsson Comprehensive Cancer Center, 22-168 CHS, David Geffen School of Medicine at UCLA, 10833 Le Conte Avenue, Box 951740, Los Angeles, CA 90095-1740, USA
- Department of Molecular and Medical Pharmacology, Molecular Biology InstituteUniversity of California Los Angeles-Jonsson Comprehensive Cancer Center, 22-168 CHS, David Geffen School of Medicine at UCLA, 10833 Le Conte Avenue, Box 951740, Los Angeles, CA 90095-1740, USA
| | - Han Htun
- Department of Obstetrics and Gynecology, Molecular Biology InstituteUniversity of California Los Angeles-Jonsson Comprehensive Cancer Center, 22-168 CHS, David Geffen School of Medicine at UCLA, 10833 Le Conte Avenue, Box 951740, Los Angeles, CA 90095-1740, USA
- Department of Molecular and Medical Pharmacology, Molecular Biology InstituteUniversity of California Los Angeles-Jonsson Comprehensive Cancer Center, 22-168 CHS, David Geffen School of Medicine at UCLA, 10833 Le Conte Avenue, Box 951740, Los Angeles, CA 90095-1740, USA
- To whom correspondence should be addressed. Tel: +1 310 206 3015; Fax: +1 310 206 3670;
| |
Collapse
|
26
|
Hagan NA, Fabris D. Dissecting the protein-RNA and RNA-RNA interactions in the nucleocapsid-mediated dimerization and isomerization of HIV-1 stemloop 1. J Mol Biol 2006; 365:396-410. [PMID: 17070549 PMCID: PMC1847390 DOI: 10.1016/j.jmb.2006.09.081] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2006] [Revised: 09/21/2006] [Accepted: 09/27/2006] [Indexed: 10/24/2022]
Abstract
The specific binding of HIV-1 nucleocapsid protein (NC) to the different forms assumed in vitro by the stemloop 1 (Lai variant) of the genome's packaging signal has been investigated using electrospray ionization-Fourier transform mass spectrometry (ESI-FTMS). The simultaneous observation of protein-RNA and RNA-RNA interactions in solution has provided direct information about the role of NC in the two-step model of RNA dimerization and isomerization. In particular, two distinct binding sites have been identified on the monomeric stemloop structure, corresponding to the apical loop and stem-bulge motifs. These sites share similar binding affinities that are intermediate between those of stemloop 3 (SL3) and the putative stemloop 4 (SL4) of the packaging signal. Binding to the apical loop, which contains the dimerization initiation site (DIS), competes directly with the annealing of self-complementary sequences to form a metastable kissing-loop (KL) dimer. In contrast, binding to the stem-bulge affects indirectly the monomer-dimer equilibrium by promoting the rearrangement of KL into the more stable extended duplex (ED) conformer. This process is mediated by the duplex-melting activity of NC, which destabilizes the intramolecular base-pairs surrounding the KL stem-bulges and enables their exchange to form the inter-strand pairs that define the ED structure. In this conformer, high-affinity binding takes place at stem-bulge sites that are identical to those present in the monomeric and KL forms. In this case, however, the NC-induced "breathing" does not result in dissociation of the double-stranded structure because of the large number of intermolecular base-pairs. The different binding modes manifested by conformer-specific mutants have shown that NC can also provide low affinity interactions with the bulged-out adenine bases flanking the DIS region of the ED conformer, thus supporting the hypothesis that these exposed nucleotides may constitute "base-grips" for protein contacts during the late stages of the viral lifecycle.
Collapse
Affiliation(s)
- Nathan A. Hagan
- University of Maryland Baltimore County, Department of Chemistry and Biochemistry, 1000 Hilltop Circle, Baltimore, MD 21228 USA, Tel. (410) 455-3053, Fax (410) 455-2608,
| | - Daniele Fabris
- University of Maryland Baltimore County, Department of Chemistry and Biochemistry, 1000 Hilltop Circle, Baltimore, MD 21228 USA, Tel. (410) 455-3053, Fax (410) 455-2608,
| |
Collapse
|
27
|
Comprehensive Alanine-scanning Mutagenesis of Escherichia coli CsrA Defines Two Subdomains of Critical Functional Importance. J Biol Chem 2006. [DOI: 10.1016/s0021-9258(19)84098-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
|
28
|
Mercante J, Suzuki K, Cheng X, Babitzke P, Romeo T. Comprehensive alanine-scanning mutagenesis of Escherichia coli CsrA defines two subdomains of critical functional importance. J Biol Chem 2006; 281:31832-42. [PMID: 16923806 DOI: 10.1074/jbc.m606057200] [Citation(s) in RCA: 92] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The RNA-binding protein CsrA (carbon storage regulator) of Escherichia coli is a global regulator of gene expression and is representative of the CsrA/RsmA family of bacterial proteins. These proteins act by regulating mRNA translation and stability and are antagonized by binding to small noncoding RNAs. Although the RNA target sequence and structure for CsrA binding have been well defined, little information exists concerning the protein requirements for RNA recognition. The three-dimensional structures of three CsrA/RsmA proteins were recently solved, revealing a novel protein fold consisting of two interdigitated monomers. Here, we performed comprehensive alanine-scanning mutagenesis on csrA of E. coli and tested the 58 resulting mutants for regulation of glycogen accumulation, motility, and biofilm formation. Quantitative effects of these mutations on expression of glgCA'-'lacZ, flhDC'-'lacZ, and pgaA'-'lacZ translational fusions were also examined, and eight of the mutant proteins were purified and tested for RNA binding. These studies identified two regions of the amino acid sequence that were critical for regulation and RNA binding, located within the first (beta1, residues 2-7) and containing the last (beta5, residues 40-47) beta-strands of CsrA. The beta1 and beta5 strands of opposite monomers lie adjacent and parallel to each other in the three-dimensional structure of this protein. Given the symmetry of the CsrA dimer, these findings imply that two distinct RNA binding surfaces or functional subdomains lie on opposite sides of the protein.
Collapse
Affiliation(s)
- Jeffrey Mercante
- Department of Microbiology and Immunology, Emory University School of Medicine, Atlanta, Georgia 30322, USA
| | | | | | | | | |
Collapse
|
29
|
Le TT, Harlepp S, Guet CC, Dittmar K, Emonet T, Pan T, Cluzel P. Real-time RNA profiling within a single bacterium. Proc Natl Acad Sci U S A 2005; 102:9160-4. [PMID: 15967986 PMCID: PMC1166617 DOI: 10.1073/pnas.0503311102] [Citation(s) in RCA: 78] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Characterizing the dynamics of specific RNA levels requires real-time RNA profiling in a single cell. We show that the combination of a synthetic modular genetic system with fluorescence correlation spectroscopy allows us to directly measure in real time the activity of any specific promoter in prokaryotes. Using a simple inducible gene expression system, we found that induced RNA levels within a single bacterium of Escherichia coli exhibited a pulsating profile in response to a steady input of inducer. The genetic deletion of an efflux pump system, a key determinant of antibiotic resistance, altered the pulsating transcriptional dynamics and caused overexpression of induced RNA. In contrast with population measurements, real-time RNA profiling permits identifying relationships between genotypes and transcriptional dynamics that are accessible only at the level of the single cell.
Collapse
Affiliation(s)
- Thuc T Le
- Institute for Biophysical Dynamics and The James Franck Institute and Department of Biochemistry and Molecular Biology, University of Chicago, 5640 South Ellis Avenue, Chicago, IL 60637, USA
| | | | | | | | | | | | | |
Collapse
|
30
|
Pustowka A, Dietz J, Ferner J, Baumann M, Landersz M, Königs C, Schwalbe H, Dietrich U. Identification of peptide ligands for target RNA structures derived from the HIV-1 packaging signal psi by screening phage-displayed peptide libraries. Chembiochem 2004; 4:1093-7. [PMID: 14523928 DOI: 10.1002/cbic.200300681] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Anette Pustowka
- Georg-Speyer-Haus, Institute for Biomedical Research, Paul-Ehrlich-Strasse 42-44, 60596 Frankfurt, Germany
| | | | | | | | | | | | | | | |
Collapse
|
31
|
Abstract
RNA is an ancient and highly versatile molecule that plays fundamental roles in all living organisms. Its molecular functions range from being a mediator of genetic information to the regulation of essential cellular processes. These functions are often accomplished in close association with RNA binding proteins. Over the past few years, a considerable number of high-resolution three-dimensional structures of important protein-RNA complexes have been determined. Here, we wish to discuss recent examples and highlight principles and distinct features of single-stranded RNA recognition by conserved RNA binding domains.
Collapse
Affiliation(s)
- Ana C Messias
- Structural and Computational Biology, European Molecular Biology Laboratory (EMBL), Meyerhofstrasse 1, 69117 Heidelberg, Germany
| | | |
Collapse
|
32
|
McCracken S, Longman D, Johnstone IL, Cáceres JF, Blencowe BJ. An evolutionarily conserved role for SRm160 in 3'-end processing that functions independently of exon junction complex formation. J Biol Chem 2003; 278:44153-60. [PMID: 12944400 DOI: 10.1074/jbc.m306856200] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
SRm160 (the SR-related nuclear matrix protein of 160 kDa) functions as a splicing coactivator and 3'-end cleavage-stimulatory factor. It is also a component of the splicing-dependent exon-junction complex (EJC), which has been implicated in coupling of pre-mRNA splicing with mRNA turnover and mRNA export. We have investigated whether the association of SRm160 with the EJC is important for efficient 3'-end cleavage. The EJC components RNPS1, REF, UAP56, and Y14 interact with SRm160. However, when these factors were tethered to transcripts, only SRm160 and RNPS1 stimulated 3'-end cleavage. Whereas SRm160 stimulated cleavage to a similar extent in the presence or absence of an active intron, stimulation of 3'-end cleavage by tethered RNPS1 is dependent on an active intron. Assembly of an EJC adjacent to the cleavage and polyadenylation signal in vitro did not significantly affect cleavage efficiency. These results suggest that SRm160 stimulates cleavage independently of its association with EJC components and that the cleavage-stimulatory activity of RNPS1 may be an indirect consequence of its ability to stimulate splicing. Using RNA interference (RNAi) in Caenorhabditis elegans, we determined whether interactions between SRm160 and the cleavage machinery are important in a whole organism context. Simultaneous RNAi of SRm160 and the cleavage factor CstF-50 (Cleavage stimulation factor 50-kDa subunit) resulted in late embryonic developmental arrest. In contrast, RNAi of CstF-50 in combination with RNPS1 or REFs did not result in an apparent phenotype. Our combined results provide evidence for an evolutionarily conserved interaction between SRm160 and the 3'-end cleavage machinery that functions independently of EJC formation.
Collapse
Affiliation(s)
- Susan McCracken
- Banting and Best Department of Medical Research, C. H. Best Institute, University of Toronto, Toronto, Ontario M5G 1L6, Canada
| | | | | | | | | |
Collapse
|
33
|
Heath MJ, Derebail SS, Gorelick RJ, DeStefano JJ. Differing roles of the N- and C-terminal zinc fingers in human immunodeficiency virus nucleocapsid protein-enhanced nucleic acid annealing. J Biol Chem 2003; 278:30755-63. [PMID: 12783894 DOI: 10.1074/jbc.m303819200] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The replication process of human immunodeficiency virus requires a number of nucleic acid annealing steps facilitated by the hybridization and helix-destabilizing activities of human immunodeficiency virus nucleocapsid (NC) protein. NC contains two CCHC zinc finger motifs numbered 1 and 2 from the N terminus. The amino acids surrounding the CCHC residues differ between the two zinc fingers. Assays were preformed to investigate the activities of the fingers by determining the effect of mutant and wild-type proteins on annealing of 42-nucleotide RNA and DNA complements. The mutants 1.1 NC and 2.2 NC had duplications of the N- and C-terminal zinc fingers in positions 1 and 2. The mutant 2.1 NC had the native zinc fingers with their positions switched. Annealing assays were completed with unstructured and highly structured oligonucleotide complements. 2.2 NC had a near wild-type level of annealing of unstructured nucleic acids, whereas it was completely unable to stimulate annealing of highly structured nucleic acids. In contrast, 1.1 NC was able to stimulate annealing of both unstructured and structured substrates, but to a lesser degree than the wild-type protein. Results suggest that finger 1 has a greater role in unfolding of strong secondary structures, whereas finger 2 serves an accessory role that leads to a further increase in the rate of annealing.
Collapse
Affiliation(s)
- Megan J Heath
- Department of Cell Biology and Molecular Genetics, University of Maryland College Park, College Park, Maryland 20742, USA
| | | | | | | |
Collapse
|
34
|
Villescas-Diaz G, Zacharias M. Sequence context dependence of tandem guanine:adenine mismatch conformations in RNA: a continuum solvent analysis. Biophys J 2003; 85:416-25. [PMID: 12829496 PMCID: PMC1303097 DOI: 10.1016/s0006-3495(03)74486-5] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Guanine:adenine (G:A) mismatches and in particular tandem G:A (tG:A) mismatches are frequently observed in biological RNA molecules and can serve as sites for tertiary interaction, metal binding and protein recognition. Depending on the surrounding sequence tG:A mismatches can adopt different basepairing topologies. In the sequence context (5'-) GGAC (tandem G:A in bold) a face-to-face (imino or Watson-Crick-like) pairing is preferred whereas in the CGAG context, G and A adopt a sheared arrangement. Systematic conformational searches with a generalized Born continuum model and molecular dynamics simulations including explicit water molecules and ions have been used to generate face-to-face and sheared tG:A mismatches in both CGAG and GGAC sequence contexts. Conformations from both approaches were evaluated using the same force field and a Poisson-Boltzmann continuum solvent model. Although the substate analysis predicted the sheared arrangement to be energetically preferred in both sequence contexts, a significantly greater preference of the sheared form was found for the CGAG context. In agreement with the experimental observation, the analysis of molecular dynamics trajectories indicated a preference of the sheared form in the case of the CGAG-context and a favorization of the face-to-face form in the case of the GGAC context. The computational studies allowed to identify energetic contributions that stabilize or destabilize the face-to-face and sheared tandem mismatch topologies. The calculated nonpolar solvation and Lennard-Jones packing interaction were found to stabilize the sheared topology independent of the sequence context. Electrostatic contributions are predicted to make the most significant contribution to the sequence context dependence on the structural preference of tG:A mismatches.
Collapse
|
35
|
Augustin MA, Reichert AS, Betat H, Huber R, Mörl M, Steegborn C. Crystal structure of the human CCA-adding enzyme: insights into template-independent polymerization. J Mol Biol 2003; 328:985-94. [PMID: 12729736 DOI: 10.1016/s0022-2836(03)00381-4] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
All tRNA molecules carry the invariant sequence CCA at their 3'-terminus for amino acid attachment. The post-transcriptional addition of CCA is carried out by ATP(CTP):tRNA nucleotidyltransferase, also called CCase. This enzyme catalyses a unique template-independent but sequence-specific nucleotide polymerization reaction. In order to reveal the molecular mechanism of this activity, we solved the crystal structure of human CCase by single isomorphous replacement. The structure reveals a four domain architecture with a cluster of conserved residues forming a positively charged cleft between the first two domains. Structural homology of the N-terminal CCase domain to other nucleotidyltransferases could be exploited for modeling a tRNA-substrate complex. The model places the tRNA 3'-end into the N-terminal nucleotidyltransferase site, close to a patch of conserved residues that provide the binding sites for CTP and ATP. Based on our results, we introduce a corkscrew model for CCA addition that includes a fixed active site and a traveling tRNA-binding region formed by flexible parts of the protein.
Collapse
Affiliation(s)
- Martin A Augustin
- Max-Planck-Institut für Biochemie, Abteilung Strukturforschung, Am Klopferspitz 18A, D-82152 Martinsried, Germany.
| | | | | | | | | | | |
Collapse
|
36
|
Szymczyna BR, Bowman J, McCracken S, Pineda-Lucena A, Lu Y, Cox B, Lambermon M, Graveley BR, Arrowsmith CH, Blencowe BJ. Structure and function of the PWI motif: a novel nucleic acid-binding domain that facilitates pre-mRNA processing. Genes Dev 2003; 17:461-75. [PMID: 12600940 PMCID: PMC196000 DOI: 10.1101/gad.1060403] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
The PWI motif is a highly conserved domain of unknown function in the SRm160 splicing and 3'-end cleavage-stimulatory factor, as well as in several other known or putative pre-mRNA processing components. We show here that the PWI motif is a new type of RNA/DNA-binding domain that has an equal preference for single- and double-stranded nucleic acids. Deletion of the motif prevents SRm160 from binding RNA and stimulating 3'-end cleavage, and its substitution with a heterologous RNA-binding domain restores these functions. The NMR solution structure of the SRm160-PWI motif reveals a novel, four-helix bundle and represents the first example of an alpha-helical fold that can bind single-stranded (ss)RNA. Structure-guided mutagenesis indicates that the same surface is involved in RNA and DNA binding and requires the cooperative action of a highly conserved, adjacent basic region. Thus, the PWI motif is a novel type of nucleic acid-binding domain that likely has multiple important functions in pre-mRNA processing, including SRm160-dependent stimulation of 3'-end formation.
Collapse
Affiliation(s)
- Blair R Szymczyna
- Ontario Cancer Institute, University of Toronto, Toronto, Ontario, Canada M5G 2M9
| | | | | | | | | | | | | | | | | | | |
Collapse
|
37
|
Da Poian AT, Johnson JE, Silva JL. Protein-RNA interactions and virus stability as probed by the dynamics of tryptophan side chains. J Biol Chem 2002; 277:47596-602. [PMID: 12359712 DOI: 10.1074/jbc.m209174200] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The correlation between dynamics and stability of icosahedral viruses was studied by steady-state and time-resolved fluorescence approaches. We compared the environment and dynamics of tryptophan side chains of empty capsids and ribonucleoprotein particles of two icosahedral viruses from the comovirus group: cowpea mosaic virus (CPMV) and bean pod mottle virus (BPMV). We found a great difference between tryptophan fluorescence emission spectra of the ribonucleoprotein particles and the empty capsids of BPMV. For CPMV, time-resolved fluorescence revealed differences in the tryptophan environments of the capsid protein. The excited-state lifetimes of tryptophan residues were significantly modified by the presence of RNA in the capsid. More than half of the emission of the tryptophans in the ribonucleoprotein particles of CPMV originates from a single exponential decay that can be explained by a similar, nonpolar environment in the local structure of most of the tryptophans, even though they are physically located in different regions of the x-ray structure. CPMV particles without RNA lost this discrete component of emission. Anisotropy decay measurements demonstrated that tryptophans rotate faster in empty particles when compared with the ribonucleoprotein particles. The increased structural breathing facilitates the denaturation of the empty particles. Our studies bring new insights into the intricate interactions between protein and RNA where part of the missing structural information on the nucleic acid molecule is compensated for by the dynamics.
Collapse
Affiliation(s)
- Andrea T Da Poian
- Departamento de Bioquímica Medica and Centro Nacional de Ressonancia Magnetica Nuclear de Macromoleculas, Instituto de Ciências Biomédicas, Universidade Federal do Rio de Janeiro, Rio de Janeiro 21941-590, Brazil
| | | | | |
Collapse
|
38
|
Cox JC, Hayhurst A, Hesselberth J, Bayer TS, Georgiou G, Ellington AD. Automated selection of aptamers against protein targets translated in vitro: from gene to aptamer. Nucleic Acids Res 2002; 30:e108. [PMID: 12384610 PMCID: PMC137152 DOI: 10.1093/nar/gnf107] [Citation(s) in RCA: 139] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Reagents for proteome research must of necessity be generated by high throughput methods. Aptamers are potentially useful as reagents to identify and quantitate individual proteins, yet are currently produced for the most part by manual selection procedures. We have developed automated selection methods, but must still individually purify protein targets. Therefore, we have attempted to select aptamers against protein targets generated by in vitro transcription and translation of individual genes. In order to specifically immobilize the protein targets for selection, they are also biotinylated in vitro. As a proof of this method, we have selected aptamers against translated human U1A, a component of the nuclear spliceosome. Selected sequences demonstrated exquisite mimicry of natural binding sequences and structures. These results not only reveal a potential path to the high throughput generation of aptamers, but also yield insights into the incredible specificity of the U1A protein for its natural RNA ligands.
Collapse
Affiliation(s)
- J Colin Cox
- Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, TX 78712, USA
| | | | | | | | | | | |
Collapse
|
39
|
Postsynthetic guanidinylation of primary amino groups in the minor and major grooves of oligonucleotides. Tetrahedron Lett 2002. [DOI: 10.1016/s0040-4039(02)01732-x] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
40
|
Katsamba PS, Bayramyan M, Haworth IS, Myszka DG, Laird-Offringa IA. Complex role of the beta 2-beta 3 loop in the interaction of U1A with U1 hairpin II RNA. J Biol Chem 2002; 277:33267-74. [PMID: 12082087 DOI: 10.1074/jbc.m200304200] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
RNA recognition motifs (RRMs) are characterized by highly conserved regions located centrally on a beta-sheet, which forms the RNA binding surface. Variable flanking regions, such as the loop connecting beta-strands 2 and 3, are thought to be important in determining the RNA-binding specificities of individual RRMs. The N-terminal RRM of the spliceosomal U1A protein mediates binding to an RNA hairpin (U1hpII) in the U1 small nuclear RNA. In this complex, the beta(2)-beta(3) loop protrudes through the 10-nucleotide RNA loop. Shortening of the RNA loop strongly perturbs binding, suggesting that an optimal "fit" of the beta(2)-beta(3) loop into the RNA loop is an important factor in complexation. To understand this interaction further, we mutated or deleted loop residues Lys(50) and Met(51), which protrude centrally into the RNA loop but do not make any direct contacts to the bases. Using BIACORE, we analyzed the ability of these U1A mutants to bind to wild type RNAs, or RNAs with shortened loops. Alanine replacement mutations only modestly affected binding to wild type U1hpII. Interestingly, simultaneous replacement of Lys(50) and Met(51) with alanine appeared to alleviate the loss of binding caused by shortening of the RNA loop. Deletion of Lys(50) or Met(51) caused a dramatic loss in stability of the U1A.U1hpII complex. However, deletion of both residues simultaneously was much less deleterious. Simulated annealing molecular dynamics analyses suggest this is due to the ability of this mutant to rearrange flanking amino acids to substitute for the two deleted residues. The double deletion mutant also exhibited substantially reduced negative effects of RNA loop shortening, suggesting the rearranged loop is better able to accommodate a short RNA loop. Our results indicate that one of the roles of the beta(2)-beta(3) loop is to provide a steric fit into the RNA loop, thereby stabilizing the RNA.protein complex.
Collapse
Affiliation(s)
- Phinikoula S Katsamba
- Norris Cancer Center/University of Southern California, Keck School of Medicine, Los Angeles, California 90089-9176, USA
| | | | | | | | | |
Collapse
|
41
|
Urbaneja MA, Wu M, Casas-Finet JR, Karpel RL. HIV-1 nucleocapsid protein as a nucleic acid chaperone: spectroscopic study of its helix-destabilizing properties, structural binding specificity, and annealing activity. J Mol Biol 2002; 318:749-64. [PMID: 12054820 DOI: 10.1016/s0022-2836(02)00043-8] [Citation(s) in RCA: 93] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/16/2022]
Abstract
Assembly of infectious retroviral particles involves recognition of specific sequences on the viral RNA by the nucleocapsid (NC) domain of the Gag polyprotein, and subsequent stoichiometric binding of the processed NC protein along the entire length of the RNA. NC proteins also act as nucleic acid chaperones. They accelerate nucleic acid hybridization and strand exchange, which may be critical during the initial stages of reverse transcription. In order to better understand these properties, we have studied the nucleic acid helix-destabilizing t(m)-depressing) and binding activities of HIV-1 NCp7 protein with a variety of substrates, and the real-time kinetics of NC-induced strand exchange. At low ionic strength (0.01 M Na phosphate, pH 7.0) and saturating levels of protein, NCp7 displays moderate helix-destabilizing activity on double-stranded DNA. Saturating levels of NCp7 lowered the t(m) of a synthetic 28 base-pair 28(+)/28(-) oligonucleotide duplex by about 10 deg. C (51 to 41 degrees C). The presence of single-stranded calf thymus DNA (equimolar with duplex) eliminated the t(m) depression, whereas double-stranded calf thymus DNA only altered the t(m) of the 28-mer duplex by about 2 deg. C. Similar effects were seen with duplexes with single-stranded overhangs or internal single-stranded gaps. Binding experiments utilizing intrinsic tryptophan quenching indicated significant affinity (K(d) about 0.1 microM) for both single-stranded and double-stranded forms of the 28-mer in 0.01 M sodium phosphate at 25 degrees C, although long-chain (calf thymus double-stranded) DNA displayed a much lower affinity. The effects of NCp7 on the kinetics of nucleic acid annealing, strand exchange, and strand displacement were determined by use of oligonucleotides with end-labeled fluorophores serving as donor-acceptor pairs. NCp7 accelerated all these reactions. In the strand exchange reaction, an imperfect duplex, 28(+)/21(-), was reacted with a perfect complement, 28(-). The kinetics of 28(+)/28(-) annealing in this reaction did not conform to a simple bimolecular model, but could be well fit to the sum of two exponential decays. Addition of stoichiometric levels of NCp7 increased the rate constants of both components, and significantly increased the fraction of exchange associated with the rapid process. Increasing levels of 28(-) also increased the rapid fraction, as well as the rapid rate constant. This concentration dependence indicates that, although the kinetic decays appear biexponential, at least one of the steps is bimolecular. Simple annealing reactions, 28(+) with 28(-), could be fit to single-exponential decays, and their magnitudes in the presence of NCp7 were comparable to the rapid step of annealing observed for exchange reactions, suggesting that this step is connected with annealing. Strand dissociation during exchange was monitored by placing the fluorescent acceptor on the 21(-) strand. The results, though complex, suggest that the slow step of exchange is largely associated with the dissociation of the shorter oligonucleotide. Analogous experiments were performed with variants of these oligonucleotides, and the results are in line with the 28(+)/21(-)/28(-) experiments. On the basis of an analysis of the effect of increasing levels of 28(-) on the formation of the perfect 28 bp duplex from the imperfect duplex, we propose that NCp7 forms a ternary complex intermediate with imperfect duplex and 28(-), and suggest several ways by which such an intermediate would facilitate strand exchange.
Collapse
Affiliation(s)
- María A Urbaneja
- AIDS Vaccine Program, SAIC-Frederick, Building 535-424, National Cancer Institute-Frederick, Frederick, MD 21702-1201, USA
| | | | | | | |
Collapse
|
42
|
Burkhardt C, Zacharias M. Modelling ion binding to AA platform motifs in RNA: a continuum solvent study including conformational adaptation. Nucleic Acids Res 2001; 29:3910-8. [PMID: 11574672 PMCID: PMC60250 DOI: 10.1093/nar/29.19.3910] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Binding of monovalent and divalent cations to two adenine-adenine platform structures from the Tetrahymena group I intron ribozyme has been studied using continuum solvent models based on the generalised Born and the finite-difference Poisson-Boltzmann approaches. The adenine-adenine platform RNA motif forms an experimentally characterised monovalent ion binding site important for ribozyme folding and function. Qualitative agreement between calculated and experimental ion placements and binding selectivity was obtained. The inclusion of solvation effects turned out to be important to obtain low energy structures and ion binding placements in agreement with the experiment. The calculations indicate that differences in solvation of the isolated ions contribute to the calculated ion binding preference. However, Coulomb attraction and van der Waals interactions due to ion size differences and RNA conformational adaptation also influence the calculated ion binding affinity. The calculated alkali ion binding selectivity for both platforms followed the order K(+) > Na(+) > Rb(+) > Cs(+) > Li(+) (Eisenman series VI) in the case of allowing RNA conformational relaxation during docking. With rigid RNA an Eisenman series V was obtained (K(+) > Rb(+) > Na(+) > Cs(+) > Li(+)). Systematic energy minimisation docking simulations starting from several hundred initial placements of potassium ions on the surface of platform containing RNA fragments identified a coordination geometry in agreement with the experiment as the lowest energy binding site. The approach could be helpful to identify putative ion binding sites in nucleic acid structures determined at low resolution or with experimental methods that do not allow identification of ion binding sites.
Collapse
Affiliation(s)
- C Burkhardt
- AG Theoretische Biophysik, Institut für Molekulare Biotechnologie, Beutenbergstrasse 11, D-07745 Jena, Germany
| | | |
Collapse
|
43
|
Wang W, Donini O, Reyes CM, Kollman PA. Biomolecular simulations: recent developments in force fields, simulations of enzyme catalysis, protein-ligand, protein-protein, and protein-nucleic acid noncovalent interactions. ANNUAL REVIEW OF BIOPHYSICS AND BIOMOLECULAR STRUCTURE 2001; 30:211-43. [PMID: 11340059 DOI: 10.1146/annurev.biophys.30.1.211] [Citation(s) in RCA: 389] [Impact Index Per Article: 16.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
Computer modeling has been developed and widely applied in studying molecules of biological interest. The force field is the cornerstone of computer simulations, and many force fields have been developed and successfully applied in these simulations. Two interesting areas are (a) studying enzyme catalytic mechanisms using a combination of quantum mechanics and molecular mechanics, and (b) studying macromolecular dynamics and interactions using molecular dynamics (MD) and free energy (FE) calculation methods. Enzyme catalysis involves forming and breaking of covalent bonds and requires the use of quantum mechanics. Noncovalent interactions appear ubiquitously in biology, but here we confine ourselves to review only noncovalent interactions between protein and protein, protein and ligand, and protein and nucleic acids.
Collapse
Affiliation(s)
- W Wang
- Graduate Group in Biophysics, University of California San Francisco, California 94143, USA.
| | | | | | | |
Collapse
|
44
|
Collins BM, Harrop SJ, Kornfeld GD, Dawes IW, Curmi PM, Mabbutt BC. Crystal structure of a heptameric Sm-like protein complex from archaea: implications for the structure and evolution of snRNPs. J Mol Biol 2001; 309:915-23. [PMID: 11399068 DOI: 10.1006/jmbi.2001.4693] [Citation(s) in RCA: 79] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
The Sm/Lsm proteins associate with small nuclear RNA to form the core of small nuclear ribonucleoproteins, required for processes as diverse as pre-mRNA splicing, mRNA degradation and telomere formation. The Lsm proteins from archaea are likely to represent the ancestral Sm/Lsm domain. Here, we present the crystal structure of the Lsm alpha protein from the thermophilic archaeon Methanobacterium thermoautotrophicum at 2.0 A resolution. The Lsm alpha protein crystallizes as a heptameric ring comprised of seven identical subunits interacting via beta-strand pairing and hydrophobic interactions. The heptamer can be viewed as a propeller-like structure in which each blade consists of a seven-stranded antiparallel beta-sheet formed from neighbouring subunits. There are seven slots on the inner surface of the heptamer ring, each of which is lined by Asp, Asn and Arg residues that are highly conserved in the Sm/Lsm sequences. These conserved slots are likely to form the RNA-binding site. In archaea, the gene encoding Lsm alpha is located next to the L37e ribosomal protein gene in a putative operon, suggesting a role for the Lsm alpha complex in ribosome function or biogenesis.
Collapse
Affiliation(s)
- B M Collins
- Department of Chemistry, Macquarie University, NSW 2109, Australia
| | | | | | | | | | | |
Collapse
|
45
|
Geese WJ, Waring RB. A comprehensive characterization of a group IB intron and its encoded maturase reveals that protein-assisted splicing requires an almost intact intron RNA. J Mol Biol 2001; 308:609-22. [PMID: 11350164 DOI: 10.1006/jmbi.2001.4609] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The group I intron (AnCOB) of the mitochondrial apocytochrome b gene from Aspergillus nidulans encodes a bi-functional maturase protein that is also a DNA endonuclease. Although the AnCOB intron self-splices, the encoded maturase protein greatly facilitates splicing, in part, by stabilizing RNA tertiary structure. To determine their role in self-splicing and in protein-assisted splicing, several peripheral RNA sub-domains in the 313 nucleotide intron were deleted (P2, P9, P9.1) or truncated (P5ab, P6a). The sequence in two helices (P2 and P9) was also inverted. Except for P9, the deleted regions are not highly conserved among group I introns and are often dispensable for catalytic activity. Nevertheless, despite the very tight binding of AnCOB RNA to the maturase and the high activity of the bimolecular complex (the rate of 5' splice-site cleavage was >20 min(-1) with guanosine as the cofactor), the intron was surprisingly sensitive to these modifications. Several mutations inactivated splicing completely and virtually all impaired splicing to varying degrees. Mutants containing comparatively small deletions in various regions of the intron significantly decreased binding affinity (generally >10(4)-fold), indicating that none of the domains that remained constitutes the primary recognition site of the maturase. The data argue that tight binding requires tertiary interactions that can be maintained by only a relatively intact intron RNA, and that the binding mechanism of the maturase differs from those of two other well-characterized group I intron splicing factors, CYT-18 and Cpb2. A model is proposed in which the protein promotes widespread cooperative folding of an RNA lacking extensive initial tertiary structure.
Collapse
Affiliation(s)
- W J Geese
- Department of Biology, Temple University, Philadelphia, PA 19122, USA
| | | |
Collapse
|
46
|
Jones S, Daley DT, Luscombe NM, Berman HM, Thornton JM. Protein-RNA interactions: a structural analysis. Nucleic Acids Res 2001; 29:943-54. [PMID: 11160927 PMCID: PMC29619 DOI: 10.1093/nar/29.4.943] [Citation(s) in RCA: 314] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
A detailed computational analysis of 32 protein-RNA complexes is presented. A number of physical and chemical properties of the intermolecular interfaces are calculated and compared with those observed in protein-double-stranded DNA and protein-single-stranded DNA complexes. The interface properties of the protein-RNA complexes reveal the diverse nature of the binding sites. van der Waals contacts played a more prevalent role than hydrogen bond contacts, and preferential binding to guanine and uracil was observed. The positively charged residue, arginine, and the single aromatic residues, phenylalanine and tyrosine, all played key roles in the RNA binding sites. A comparison between protein-RNA and protein-DNA complexes showed that whilst base and backbone contacts (both hydrogen bonding and van der Waals) were observed with equal frequency in the protein-RNA complexes, backbone contacts were more dominant in the protein-DNA complexes. Although similar modes of secondary structure interactions have been observed in RNA and DNA binding proteins, the current analysis emphasises the differences that exist between the two types of nucleic acid binding protein at the atomic contact level.
Collapse
Affiliation(s)
- S Jones
- Biomolecular Structure and Modelling Unit, Department of Biochemistry and Molecular Biology, University College, Gower Street, London WC1E 6BT, UK
| | | | | | | | | |
Collapse
|
47
|
Tok JB, Des Jean RC, Fenker J. Binding of a cyclic BIV beta-Tat peptide with its TAR RNA construct. Bioorg Med Chem Lett 2001; 11:43-6. [PMID: 11140729 DOI: 10.1016/s0960-894x(00)00591-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The ability of RNA structures to adopt diverse yet complex tertiary structures has resulted in numerous fascinating RNA-protein recognition events. It was recently reported that a close relative of the HIV Rev peptide, namely a 17 residue Tat peptide from bovine immuno-deficiency virus (BIV), is able to bind to the 28 nucleotide BIV TAR RNA construct. Here we report that by simply converting the 17 residue beta-ribbon peptide structure to a 19 residue cyclopeptide, the binding affinity (Kd) of the resulting cyclopeptide to the TAR RNA target, observed by fluorescence binding study, was enhanced approximately 5-fold.
Collapse
MESH Headings
- Amino Acid Sequence
- Animals
- Base Sequence
- Cattle
- Fluorescein
- Fluorescence
- Gene Products, tat/chemistry
- Gene Products, tat/metabolism
- HIV Long Terminal Repeat/genetics
- Immunodeficiency Virus, Bovine/genetics
- Immunodeficiency Virus, Bovine/metabolism
- Models, Molecular
- Molecular Sequence Data
- Nucleic Acid Conformation
- Peptide Fragments/chemistry
- Peptide Fragments/metabolism
- Peptides, Cyclic/chemistry
- Peptides, Cyclic/metabolism
- Protein Binding
- RNA, Viral/chemistry
- RNA, Viral/genetics
- RNA, Viral/metabolism
- RNA-Binding Proteins/chemistry
- RNA-Binding Proteins/metabolism
- Thermodynamics
Collapse
Affiliation(s)
- J B Tok
- Department of Chemistry, Indiana University-Purdue University, Fort Wayne, 46805, USA.
| | | | | |
Collapse
|
48
|
Abstract
The aminoacyl-tRNA synthetases are an ancient group of enzymes that catalyze the covalent attachment of an amino acid to its cognate transfer RNA. The question of specificity, that is, how each synthetase selects the correct individual or isoacceptor set of tRNAs for each amino acid, has been referred to as the second genetic code. A wealth of structural, biochemical, and genetic data on this subject has accumulated over the past 40 years. Although there are now crystal structures of sixteen of the twenty synthetases from various species, there are only a few high resolution structures of synthetases complexed with cognate tRNAs. Here we review briefly the structural information available for synthetases, and focus on the structural features of tRNA that may be used for recognition. Finally, we explore in detail the insights into specific recognition gained from classical and atomic group mutagenesis experiments performed with tRNAs, tRNA fragments, and small RNAs mimicking portions of tRNAs.
Collapse
Affiliation(s)
- P J Beuning
- Department of Chemistry, University of Minnesota, Minneapolis, MN 55455, USA
| | | |
Collapse
|
49
|
Reyes CM, Kollman PA. Structure and thermodynamics of RNA-protein binding: using molecular dynamics and free energy analyses to calculate the free energies of binding and conformational change. J Mol Biol 2000; 297:1145-58. [PMID: 10764579 DOI: 10.1006/jmbi.2000.3629] [Citation(s) in RCA: 145] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
An adaptive binding mechanism, requiring large conformational rearrangements, occurs commonly with many RNA-protein associations. To explore this process of reorganization, we have investigated the conformational change upon spliceosomal U1A-RNA binding with molecular dynamics (MD) simulations and free energy analyses. We computed the energetic cost of conformational change in U1A-hairpin and U1A-internal loop binding using a hybrid of molecular mechanics and continuum solvent methods. Encouragingly, in all four free energy comparisons (two slightly different proteins, two different RNAs), the free macromolecule was more stable than the bound form by the physically reasonable value of approximately 10 kcal/mol. We calculated the absolute binding free energies for both complexes to be in the same range as that found experimentally.
Collapse
Affiliation(s)
- C M Reyes
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA, 94122-0446, USA
| | | |
Collapse
|
50
|
Abstract
The powerful explanatory paradigm of molecular biology requiring form to co-evolve with function has again been proven successful when, over the recent two decades, a wealth of biological functions have been uncovered for RNA. Previously considered as a mere mediator of the genetic code, RNA is now acknowledged as a key player in a wide variety of cellular processes. Along with the discovery of novel biological functions of RNA molecules, a number of RNA three-dimensional structures have been solved which beautifully demonstrate the molecular adaptability which allows RNA to participate as a key player in these functions. A distinct repertoire of molecular motifs provides a basis for the assembly of complex RNA tertiary architectures.
Collapse
Affiliation(s)
- T Hermann
- Cellular Biochemistry and Biophysics Program, Memorial Sloan-Kettering Cancer Center, New York, NY 10021, USA.
| | | |
Collapse
|