1
|
Abstract
Local folding in mRNAs is closely associated w ith biological functions. In this study, we reveal the whole distribution of local thermodynamic stability in the complete genome of the poliovirus P3/Leon/37 and the single-stranded RNA sequences that corresponds to the nucleotide sequence of the complete genome sequence (1 667 867 bp) of Helicobacter pylori (H. pylori) strain 26695. Local thermodynamic stability in the RNA sequences is measured by two standard z -scores, significance score and stability score. To estimate the distribution of thermodynamic stability, a model based on the non-central Student's t distribution has been developed. Significant patterns of extremes that are either much more stable or unstable than expected by chance are detected. Our results indicate that the highly stable and statistically more significant folding regions are predominantly in non-coding sequences in the two genome sequences. Moreover, the highly unstable folding regions, on the contrary, are predominantly in the protein coding sequences of H. pylori. The observed differences across the complete genomic sequences are statistically very significant by a chi2-test. These extreme patterns may be useful in searching for target sequences for long-chain antisense RNA and for locating potential RNA functional elements involved in the regulation of gene expression including translation, mRNA localization and metabolism.
Collapse
Affiliation(s)
- S Y Le
- Laboratory of Experimental and Computational Biology, Division of Basic Sciences, National Cancer Institute, NIH, Bldg 469, Room 151, Frederick, MD 21702, USA.
| | | | | | | |
Collapse
|
2
|
Abstract
The translational control involving internal ribosome binding occurs in poliovirus (PV), human rhinoviruses (HRV), encephalomyocarditis virus (EMCV), foot-and-mouth disease virus (FMDV), and hepatitis A virus (HAV). Internal ribosome binding utilizes cis-acting genetic elements of approximately 450 nucleotides (nt) termed the internal ribosome entry sites (IRES) found in these picornaviral 5'-untranslated region (5'UTR). Although these IRES elements are quite different in their primary sequence, a similar folding structure with a conserved 3' structural core exists in the IRES. Phylogenetic analysis and RNA folding of the 5' UTR of picornaviruses, including PV types 1-3, coxsackievirus types A and B, swine vesicular disease virus, echoviruses, enteroviruses (human and bovine), HRV, HAV, EMCV, mengovirus, Theiler's murine encephalomyelitis viruses, FMDV, and equine rhinoviruses, indicates that the predicted conserved structural core is indeed a general structural feature for all members of the picornavirus family. The evolution of a common structural core likely occurred by the gradual addition or deletion of structural domains and elements to preserve a similar tertiary structure that facilitates the utilization of the IRES in specific host-cell environments.
Collapse
Affiliation(s)
- S Y Le
- Laboratory of Mathematical Biology, Division of Cancer Biology Diagnosis and Centers, National Cancer Institute, NIH, Frederick, Maryland 21702, USA.
| | | |
Collapse
|
3
|
Le SY, Liu WM, Maizel JV. Phylogenetic evidence for the improved RNA higher-order structure in internal ribosome entry sequences of HCV and pestiviruses. Virus Genes 2001; 17:279-95. [PMID: 9926403 DOI: 10.1023/a:1008073905920] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
The strong requirement for a small segment of the 5'-proximal coding sequence of hepatitis C virus (HCV) is one of the most remarkable features in the internal initiation of HCV mRNA translation. Phylogenetic analysis and RNA folding indicate a common RNA structure of the 5' untranslated region (UTR) of HCV and the animal pestiviruses, including HCV types 1-11, bovine viral diarrhea (BVDV), border disease virus (BDV) and hog cholera (HoCV). Although the common RNA structure shares similar features to that proposed for the internal ribosome entry sequence (IRES) of picornavirus, phylogenetic evidence suggests four new tertiary interactions between conserved terminal hairpin loops and between the terminal hairpin loop of F2b and the short coding sequence for HCV and pestiviruses. We suggest that the higher-order structures of IRES cis-acting elements for HCV and animal pestivirus are composed of stem-loop structures B-C, domains E-H, stem-loop structure J and four additional tertiary interactions. The common structure of IRES elements for these viruses forms a compact structure by these tertiary interactions and stem stacking. The active structural core is centered in the junction domain of E-H that is also conserved in all members of picornaviruses. Our model suggests that the requirement for a small segment of the 5' coding sequence is to form the distinct tertiary structure that facilitates the cis-acting function of the HCV IRES in the internal initiation of the translational control.
Collapse
Affiliation(s)
- S Y Le
- Laboratory of Experimental and Computational Biology, National Cancer Institute, NIH, Frederick, Maryland 21702, USA.
| | | | | |
Collapse
|
4
|
Abstract
The finite element method (FEM) has been extensively used in evaluating the interfacial status of biomaterials. We used FEM to explore the microscopic debonding mechanism of the dentin/hybrid layer/resin adhesive interface. The stress status of the local material was used as an index to judge whether the adhesive interface would develop a debonding mechanism. To generate the local stress concentration, the thermal boundary condition was applied to the model which has the phenomenon of the coefficient of thermal expansion (CTE) mismatch. The thermal boundary condition was used to emulute a previous study conducted with a laser thermoacoustic technique (LTAT). The materials, Scotchbond MP, Optibond, and Tenure bonding systems, used in the previous experiment were also tested in this study. The results show that interfacial debonding in the finite element model occurred through the hybrid layer for both the Scotchbond MP and Tenure systems, as well as within the adhesive layer itself for the Optibond system. These findings are compatible with observations by SEM obtained by LTAT. Another transformed model was created to test the "elastic cavity wall" concept. The result also confirms the importance of the elastic cavity wall concept. These compatible results between FEM and LTAT indicate that FEM can be a very useful supplement to thermoacoustic testing.
Collapse
Affiliation(s)
- S Y Le
- Graduate Institute of Oral Rehabilitation Sciences, Taipei Medical College, Taiwan.
| | | | | | | | | | | | | |
Collapse
|
5
|
Abstract
Human endogenous retrovirus K (HERV-K) is the name given to an approximately 30-million-year-old family of endogenous retroviruses present at >50 copies per haploid human genome. Previously, the HERV-K were shown to encode a nuclear RNA export factor, termed K-Rev, that is the functional equivalent of the H-Rev protein encoded by human immunodeficiency virus type 1. HERV-K was also shown to contain a cis-acting target element, the HERV-K Rev response element (K-RRE), that allowed the nuclear export of linked RNA transcripts in the presence of either K-Rev or H-Rev. Here, we demonstrate that the functionally defined K-RRE coincides with a statistically highly significant unusual RNA folding region and present a potential RNA secondary structure for the approximately 416-nt K-RRE. Both in vitro and in vivo assays of sequence specific RNA binding were used to map two primary binding sites for K-Rev, and one primary binding site for H-Rev, within the K-RRE. Of note, all three binding sites map to discrete predicted RNA stem-loop subdomains within the larger K-RRE structure. Although almost the entire 416-nt K-RRE was required for the activation of nuclear RNA export in cells expressing K-Rev, mutational inactivation of the binding sites for K-Rev resulted in the selective loss of the K-RRE response to K-Rev but not to H-Rev. Together, these data strongly suggest that the K-RRE, like the H-RRE, coincides with an extensive RNA secondary structure and identify specific sites within the K-RRE that can recruit either K-Rev or H-Rev to HERV-K RNA transcripts.
Collapse
Affiliation(s)
- J Yang
- Department of Genetics and Howard Hughes Medical Institute, Duke University Medical Center, Durham, North Carolina 27710, USA
| | | | | | | |
Collapse
|
6
|
Chen S, Le SY, Newton DL, Maizel JV, Rybak SM. A gender-specific mRNA encoding a cytotoxic ribonuclease contains a 3' UTR of unusual length and structure. Nucleic Acids Res 2000; 28:2375-82. [PMID: 10871370 PMCID: PMC102719 DOI: 10.1093/nar/28.12.2375] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
A cDNA (2855 nt) encoding a putative cytotoxic ribonuclease (rapLR1) related to the antitumor protein onconase was cloned from a library derived from the liver of gravid female amphibian Rana pipiens. The cDNA was mainly comprised (83%) of 3' untranslated region (UTR). Secondary structure analysis predicted two unusual folding regions (UFRs) in the RNA 3' UTR. Two of these regions (711-1442 and 1877-2130 nt) contained remarkable, stalk-like, stem-loop structures greater than 38 and 12 standard deviations more stable than by chance, respectively. Secondary structure modeling demonstrated similar structures in the 3' UTRs of other species at low frequencies (0.01-0.3%). The size of the rapLR1 cDNA corresponded to the major hybridizing RNA cross-reactive with a genomic clone encoding onconase (3.6 kb). The transcript was found only in liver mRNA from female frogs. In contrast, immunoreactive onconase protein was detected only in oocytes. Deletion of the 3' UTR facilitated the in vitro translation of the rapLR1 cDNA. Taken together these results suggest that these unusual UFRs may affect mRNA metabolism and/or translation.
Collapse
Affiliation(s)
- S Chen
- Laboratory of Experimental and Computational Biology, Division of Basic Science, SAIC Frederick, National Cancer Institute-Frederick Cancer Research and Development Center, Frederick, MD 21702, USA
| | | | | | | | | |
Collapse
|
7
|
Pozner A, Goldenberg D, Negreanu V, Le SY, Elroy-Stein O, Levanon D, Groner Y. Transcription-coupled translation control of AML1/RUNX1 is mediated by cap- and internal ribosome entry site-dependent mechanisms. Mol Cell Biol 2000; 20:2297-307. [PMID: 10713153 PMCID: PMC85390 DOI: 10.1128/mcb.20.7.2297-2307.2000] [Citation(s) in RCA: 106] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
AML1/RUNX1 belongs to the runt domain transcription factors that are important regulators of hematopoiesis and osteogenesis. Expression of AML1 is regulated at the level of transcription by two promoters, distal (D) and proximal (P), that give rise to mRNAs bearing two distinct 5' untranslated regions (5'UTRs) (D-UTR and P-UTR). Here we show that these 5'UTRs act as translation regulators in vivo. AML1 mRNAs bearing the uncommonly long (1,631-bp) P-UTR are poorly translated, whereas those with the shorter (452-bp) D-UTR are readily translated. The low translational efficiency of the P-UTR is attributed to its length and the cis-acting elements along it. Transfections and in vitro assays with bicistronic constructs demonstrate that the D-UTR mediates cap-dependent translation whereas the P-UTR mediates cap-independent translation and contains a functional internal ribosome entry site (IRES). The IRES-containing bicistronic constructs are more active in hematopoietic cell lines that normally express the P-UTR-containing mRNAs. Furthermore, we show that the IRES-dependent translation increases during megakaryocytic differentiation but not during erythroid differentiation, of K562 cells. These results strongly suggest that the function of the P-UTR IRES-dependent translation in vivo is to tightly regulate the translation of AML1 mRNAs. The data show that AML1 expression is regulated through usage of alternative promoters coupled with IRES-mediated translation control. This IRES-mediated translation regulation adds an important new dimension to the fine-tuned control of AML1 expression.
Collapse
Affiliation(s)
- A Pozner
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76000, Israel
| | | | | | | | | | | | | |
Collapse
|
8
|
Abstract
In this study we apply a genetic algorithm to a set of RNA sequences to find common RNA secondary structures. Our method is a three-step procedure. At the first stage of the procedure for each sequence, a genetic algorithm is used to optimize the structures in a population to a certain degree of stability. In this step, the free energy of a structure is the fitness criterion for the algorithm. Next, for each structure, we define a measure of structural conservation with respect to those in other sequences. We use this measure in a genetic algorithm to improve the structural similarity among sequences for the structures in the population of a sequence. Finally, we select those structures satisfying certain conditions of structural stability and similarity as predicted common structures for a set of RNA sequences. We have obtained satisfactory results from a set of tRNA, 5S rRNA, rev response elements (RRE) of HIV-1 and RRE of HIV-2/SIV, respectively.
Collapse
Affiliation(s)
- J H Chen
- Advanced Biomedical Computing Center, SAIC, NCI/FCRDC, Frederick, MD 21702, USA.
| | | | | |
Collapse
|
9
|
Sella O, Gerlitz G, Le SY, Elroy-Stein O. Differentiation-induced internal translation of c-sis mRNA: analysis of the cis elements and their differentiation-linked binding to the hnRNP C protein. Mol Cell Biol 1999; 19:5429-40. [PMID: 10409733 PMCID: PMC84385 DOI: 10.1128/mcb.19.8.5429] [Citation(s) in RCA: 90] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
In previous reports we showed that the long 5' untranslated region (5' UTR) of c-sis, the gene encoding the B chain of platelet-derived growth factor, has translational modulating activity due to its differentiation-activated internal ribosomal entry site (D-IRES). Here we show that the 5' UTR contains three regions with a computer-predicted Y-shaped structure upstream of an AUG codon, each of which can confer some degree of internal translation by itself. In nondifferentiated cells, the entire 5' UTR is required for maximal basal IRES activity. The elements required for the differentiation-sensing ability (i.e., D-IRES) were mapped to a 630-nucleotide fragment within the central portion of the 5' UTR. Even though the region responsible for IRES activation is smaller, the full-length 5' UTR is capable of mediating the maximal translation efficiency in differentiated cells, since only the entire 5' UTR is able to confer the maximal basal IRES activity. Interestingly, a 43-kDa protein, identified as hnRNP C, binds in a differentiation-induced manner to the differentiation-sensing region. Using UV cross-linking experiments, we show that while hnRNP C is mainly a nuclear protein, its binding activity to the D-IRES is mostly nuclear in nondifferentiated cells, whereas in differentiated cells such binding activity is associated with the ribosomal fraction. Since the c-sis 5' UTR is a translational modulator in response to cellular changes, it seems that the large number of cross-talking structural entities and the interactions with regulated trans-acting factors are important for the strength of modulation in response to cellular changes. These characteristics may constitute the major difference between strong IRESs, such as those seen in some viruses, and IRESs that serve as translational modulators in response to developmental signals, such as that of c-sis.
Collapse
Affiliation(s)
- O Sella
- Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | | | | | | |
Collapse
|
10
|
Le SY, Chen JH, Pattabiraman N, Maizel JV. Ion-RNA interactions in the RNA pseudoknot of a ribosomal frameshifting site: molecular modeling studies. J Biomol Struct Dyn 1998; 16:1-11. [PMID: 9745889 DOI: 10.1080/07391102.1998.10508221] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
The three-dimensional (3-D) structure of a RNA pseudoknot that causes the efficient ribosomal frameshifting in the gag-pro region of mouse mammary tumor virus (MMTV) has been determined recently by nuclear magnetic resonance (NMR) studies. But since the structure refinement in the studies did not use metal ions and waters, it is not clear how metal ions participate in the stabilization of the pseudoknot, and what kind of ion-RNA interactions dominate in the tertiary contacts for the RNA pseudoknotting. Based on the reported structure data of the pseudoknot VPK of MMTV, we gradually refined the structure by restrained molecular dynamics (MD) using NMR distance restraints. Restrained MD simulation of the RNA pseudoknot was performed with sodium ions and water molecules. Our results are in good agreement with known NMR data and delineate the importance of the metal ion coordination in the stability of the pseudoknot. In the non-coaxially stacking pseudoknot, stem 1 (S1), stem 2 (S2), and the intervening A14 involves unconventional stacking of base pairs coordinated by Na+ and/or bridging water molecules. A6 and G7 of loop L1 make a perfect base stacking in the major groove and are further stabilized by coordinated Na+ ions and water molecules. The first 4-nucleotide (nt) ACUC of loop L2 form a sharp turn and the following 4-nt AAAA cross the minor groove of S1 and are steadied by interactions with the nucleotides of S , bridging water molecules and coordinated Na+ ions. Our studies suggest that the metal ion plays a crucial role in the RNA pseudoknotting of VPK. In the stacking interior of S1 and S2, the Na+ ion is positioned in the major groove and interacts directly with the carbonyl group O6 of G28 and carbonyl group O4 of U13 in the wobble base pair U13:G28. The ion-RNA interactions in MMTV VPK not only stabilize the RNA pseudoknot but also modify the electrostatic properties of the nucleotides at the critical parts of the pseudoknot VPK.
Collapse
Affiliation(s)
- S Y Le
- Laboratory of Experimental and Computational Biology, DBS, FCRDC, National Cancer Institute, NIH, Frederick, MD 21701, USA.
| | | | | | | |
Collapse
|
11
|
Akiri G, Nahari D, Finkelstein Y, Le SY, Elroy-Stein O, Levi BZ. Regulation of vascular endothelial growth factor (VEGF) expression is mediated by internal initiation of translation and alternative initiation of transcription. Oncogene 1998; 17:227-36. [PMID: 9674707 DOI: 10.1038/sj.onc.1202019] [Citation(s) in RCA: 200] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Vascular Endothelial Growth Factor (VEGF) is a very potent angiogenic agent that has a central role in normal physiological angiogenesis as well as in tumor angiogenesis. VEGF expression is induced by hypoxia and hypoglycemia, and thus was suggested to promote neovascularization during tumor outgrowth. Yet, the molecular mechanism that governs VEGF expression is not fully characterized. VEGF induction is attributed in part to increased levels of transcription and RNA stability. Previously, we demonstrated that the 5' Untranslated Region (5' UTR) of VEGF has an important regulatory role in its expression. VEGF has an exceptionally long 5' UTR (1038 bp) which is highly rich in G+C nucleotides. This suggests that secondary structures in the 5' UTR might be essential for VEGF expression through transcriptional and post-transcriptional control mechanisms, as demonstrated for other growth factors. In this communication, we provide evidence that a computer predicted Internal Ribosome Entry Site (IRES) structure is biologically active and is located at the 3' end of the UTR. In addition, the results demonstrate that an alternative transcriptional initiation site for VEGF exists in the 5' UTR of VEGF. This alternative initiation site is 633 bp downstream of the main transcription start site and the resulting 5' UTR includes mainly the IRES structure. Therefore, our results suggest that VEGF is subjected to regulation at either translational level through a mechanism of ribosome internal initiation and/or transcriptional level through alternative initiation.
Collapse
Affiliation(s)
- G Akiri
- Department of Food Engineering and Biotechnology, Technion, Haifa, Israel
| | | | | | | | | | | |
Collapse
|
12
|
Bernstein J, Sella O, Le SY, Elroy-Stein O. PDGF2/c-sis mRNA leader contains a differentiation-linked internal ribosomal entry site (D-IRES). J Biol Chem 1997; 272:9356-62. [PMID: 9083072 DOI: 10.1074/jbc.272.14.9356] [Citation(s) in RCA: 138] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
It has become clear that a given cell type can qualitatively and quantitatively affect the expression of the platelet-derived growth factor B (PDGF2/c-sis) gene at multiple levels. In a previous report, we showed that PDGF2/c-sis 5'-untranslated region has a translational modulating activity during megakaryocytic differentiation of K562 cells. This study points to the mechanism used for this translational modulation. The unusual mRNA leader, which imposes a major barrier to conventional ribosomal scanning, was found to contain an internal ribosomal entry site that becomes more potent in differentiating cells and was termed differentiation-linked internal ribosomal entry site (D-IRES). The D-IRES element defines a functional role for the cumbersome 1022-nucleotide-long mRNA leader and accounts for its uncommon, evolutionary conserved architecture. The differentiation-linked enhancement of internal translation, which provides an additional step to the fine tuning of PDGF2/c-sis gene expression, might be employed by numerous critical regulatory genes with unusual mRNA leaders and might have widespread implications for cellular growth and development.
Collapse
Affiliation(s)
- J Bernstein
- Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | | | | | | |
Collapse
|
13
|
Abstract
The 5'-non-translated regions (5'NTR) of human immunoglobulin heavy chain binding protein (BiP), Antennapedia (Antp) ofDrosophilaand human fibroblast growth factor 2 (FGF-2) mRNAs are reported to mediate translation initiation by an internal ribosome binding mechanism. In this study, we investigate predicted features of the higher order structures folded in these 5'NTR sequences. Statistical analyses of RNA folding detected a 92 nt unusual folding region (UFR) from 129 to 220, close to the initiator AUG in the BiP mRNA. Details of the structural analyses show that the UFR forms a Y-type stem-loop structure with an additional stem-loop in the 3'-end resembling the common structure core found in the internal ribosome entry site (IRES) elements of picornavirus. The Y-type structural motif is also conserved among a number of divergent BiP mRNAs. We also find two RNA elements in the 5'-leader sequence of human FGF-2. The first RNA element (96 nt) is 2 nt upstream of the first CUG start codon located in the reported IRES element of human FGF-2. The second (107 nt) is immediately upstream of the authentic initiator AUG of the main open reading frame. Intriguingly, the folded RNA structural motif in the two RNA elements is conserved in other members of FGF family and shares the same structural features as that found in the 5'NTR of divergent BiP mRNAs. We suggest that the common RNA structural motif conserved in the diverse BiP and FGF-2 mRNAs has a general function in the internal ribosome binding mechanism of cellular mRNAs.
Collapse
Affiliation(s)
- S Y Le
- Laboratory of Mathematical Biology, Division of Basic Sciences, National Cancer Institute, NIH, Building 469, Room 151, Frederick, MD 21702, USA.
| | | |
Collapse
|
14
|
Le SY, Siddiqui A, Maizel JV. A common structural core in the internal ribosome entry sites of picornavirus, hepatitis C virus, and pestivirus. Virus Genes 1996; 12:135-47. [PMID: 8879130 DOI: 10.1007/bf00572952] [Citation(s) in RCA: 52] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
Cap-independent translations of viral RNAs of enteroviruses and rhinoviruses, cardioviruses and aphthoviruses, hepatitis A and C viruses (HAV and HCV), and pestivirus are initiated by the direct binding of 40S ribosomal subunits to a cis-acting genetic element termed the internal ribosome entry site (IRES) or ribosome landing pad (RLP) in the 5' noncoding region (5'NCR). RNA higher ordered structure models for these IRES elements were derived by a combined approach using thermodynamic RNA folding, Monte Carlo simulation, and phylogenetic comparative analysis. The structural differences among the three groups of picornaviruses arise not only from point mutations, but also from the addition or deletion of structural domains. However, a common core can be identified in the proposed structural models of these IRES elements from enteroviruses and rhinoviruses, cardioviruses and aphthoviruses, and HAV. The common structural core identified within the picornavirus IRES is also conserved in the 5'NCR of the divergent viruses, HCV, and pestiviruses. Furthermore, the proposed structural motif shares a structural feature similar to that observed in the catalytic core of the group 1 intron. The conserved structural motif from these divergent sequences that looks like the common core region of group 1 introns is probably a crucial element involved in the IRES-dependent translation.
Collapse
Affiliation(s)
- S Y Le
- Laboratory of Mathematical Biology, National Cancer Institute, NIH, Frederick, MD 21702, USA.
| | | | | |
Collapse
|
15
|
Wang C, Le SY, Ali N, Siddiqui A. An RNA pseudoknot is an essential structural element of the internal ribosome entry site located within the hepatitis C virus 5' noncoding region. RNA 1995; 1:526-537. [PMID: 7489514 PMCID: PMC1482419] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
Translation of the human hepatitis C virus (HCV) RNA genome occurs by a mechanism known as "internal ribosome entry." This unusual strategy of translation is employed by naturally uncapped picornaviral genomic RNAs and several cellular mRNAs. A common feature of these RNAs is a relatively long 5' noncoding region (NCR) that folds into a complex secondary structure harboring an internal ribosome entry site (IRES). Evidence derived from the use of dicistronic expression systems, combined with an extensive mutational analysis, demonstrated the presence of an IRES within the HCV 5'NCR. The results of our continued mutational analysis to map the critical structural elements of the HCV IRES has led to the identification of a pseudoknot structure upstream of the initiator AUG. The evidence presented in this study is based upon the mutational analysis of the putative pseudoknot structure. This is further substantiated by biochemical and enzymatic probing of the wild-type and mutant 5'NCR. Further, the thermodynamic calculations, based upon a modified RNAKNOT program, are consistent with the presence of a pseudoknot structure located upstream of the initiator AUG. Maintenance of this structural element is critical for internal initiation of translation. The pseudoknot structure in the 5'NCR represents a highly conserved feature of all HCV subtypes and members of the pestivirus family, including hog cholera virus and bovine viral diarrhea virus.
Collapse
Affiliation(s)
- C Wang
- Department of Microbiology, University of Colorado Health Sciences Center, Denver 80262, USA
| | | | | | | |
Collapse
|
16
|
Abstract
A statistically significant folding region is identified in the 5' untranslated region (5'-UTR) of hepatitis C virus (HCV), bovine viral diarrhea virus and hog cholera virus. This unusual folding region (UFR) detected in HCV encompasses 199 nucleotides (nt) and coincides with the reported internal ribosome entry site or ribosome landing pad (RLP), as determined by the 5' and 3' deletions [Tsukiyama-Kohara et al., J. Virol. 66 (1992) 1476-1483]. The RNA structure predicted in the UFR of HCV consists of a large stem-loop and a pseudoknot. The proposed structural model is consistent with RNase sensitivity studies [Brown et al., Nucleic Acids Res. 20 (1992) 5041-5045]. Moreover, the structure is highly conserved among these divergent HCV and pestivirus RNAs. The covariation of paired bases in the helical regions offers support for the proposed structural models. The pseudoknot predicted in these UFR shares a similar structural feature to those proposed in the RLP of cardioviruses, aphthoviruses and hepatitis A virus. Based on the common structural motif, a putative base-pairing model between HCV RNA and 18S rRNA, as well as pestiviral RNAs and 18S rRNA are suggested. Intriguingly, the proposed base-pairing models in this study are comparable to those proposed in picornaviruses in terms of their folded shape and location of the predicted complementary sequences between viral RNAs and 18S rRNA. Taken together, we suggest that the common base-pairing model between the UFR detected in the 5'-UTR of pestivirus and HCV and 18S rRNA have a general function in the internal initiation of cap-independent translation.
Collapse
Affiliation(s)
- S Y Le
- Laboratory of Mathematical Biology, National Cancer Institute, NIH, Frederick, MD 21702, USA
| | | | | |
Collapse
|
17
|
Abstract
We have developed a procedure, composed of a set of computer programs, for predicting common RNA structures of homologous sequences. Given a set of homologous RNAs, these programs perform a multiple sequence alignment, generate a list of possible helical stems that are thermodynamically favored in RNA folding from a selected individual sequence, establish a conserved stem list by inspecting the equivalent base pairings and/or conserved helical stems from the derived alignment of homologous RNAs, and build common RNA secondary structures with the maximum scores (i.e., compensatory base changes and number of base pairs, etc.). The approach is a combination of phylogenetic and thermodynamic methods and has been applied to the prediction of common folding structures of the 5' untranslated regions in a number of positive RNA viruses.
Collapse
Affiliation(s)
- S Y Le
- Laboratory of Mathematical Biology, National Cancer Institute, NIH, Frederick, Maryland 21702, USA
| | | | | |
Collapse
|
18
|
Le SY, Pattabiraman N, Maizel JV. RNA tertiary structure of the HIV RRE domain II containing non-Watson-Crick base pairs GG and GA: molecular modeling studies. Nucleic Acids Res 1994; 22:3966-76. [PMID: 7937119 PMCID: PMC308397 DOI: 10.1093/nar/22.19.3966] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
We have used molecular modeling techniques to model the RNA tertiary structure of the viral RNA element (referred to as domain II of Rev responsive element, RRE) bound by the Rev protein of HIV. In this study, the initial three-dimensional model was built from its established RNA secondary structure, including three non-Watson-Crick G:G, G:A and G:U base pairs. Molecular dynamics (MD) simulations were performed with hydrated or unhydrated sodium ions. Our results indicate that the non-Watson-Crick base pairs in the simulation with unhydrated sodium ions and water are more stable than those with hydrated sodium ions only. The RNA can maintain its compact double helical structure throughout the course of the MD simulations with water and unhydrated sodium ions, although the non-Watson-Crick base pairs and two bulge loops show much more flexibility and conformational distortion than the classical RNA helical region. The distinct distortion of the sugar-phosphate backbone significantly widens the RNA major groove so that the major groove is readily accessible for hydrogen bonding by specific Rev binding. This model emphasizes the importance of specific hydrogen bonding in the stabilization of the three-dimensional structure of the HIV Rev core binding element, not only between the nucleotide bases, but also among the ribose hydroxyls, phosphate anionic oxygens, base oxygens and nitrogens, and bridging water molecules. Moreover, our results suggest that sodium ions play an important role in the formation of base pairs G:G and G:A of the RRE by a manner similar to the arginine of the Rev-RRE complex.
Collapse
Affiliation(s)
- S Y Le
- Laboratory of Mathematical Biology, Division of Cancer Biology, Diagnosis and Centers, National Cancer Institute, NIH, Frederick, MD 21702
| | | | | |
Collapse
|
19
|
Abstract
Infectious bronchitis virus (IBV) mRNA3 encodes three small proteins, 3a, 3b, and 3c, at its 5' end. Recently, it was demonstrated that initiation of protein 3c is dependent on the upstream sequence. Monte Carlo simulations of RNA folding in this tricistronic mRNA3 indicate that a highly significant folding region occurs prior to the initiator AUG of 3c. The unusual folding region (UFR) of 265 nucleotides (nt) contains the coding sequences of proteins 3a and 3b. Details of the structural analyses show that five highly significant RNA stem-loops in the UFR can be modeled into a compact superstructure by the interaction of two predicted pseudoknot structures. The folded superstructure comprising nt 44 to 330, with additional 22 nt downstream from this UFR, is suggested to serve as a ribosome landing pad (or an internal ribosomal entry site) in the cap-independent translation of the 3c of IBV. Intriguingly, the proposed structural motif of this coronavirus shares structural features similar to those proposed in a number of picornavirus mRNAs. Based on the common structural features, a plausible base pairing model between mRNA3 and 18 S rRNA is suggested, which is consistent with a general mechanism for regulation of internal initiation described in many picornaviruses.
Collapse
Affiliation(s)
- S Y Le
- Laboratory of Mathematical Biology, National Cancer Institute, NIH, Frederick, Maryland 21702
| | | | | |
Collapse
|
20
|
Le SY, Chen JH, Sonenberg N, Maizel JV. Conserved tertiary structural elements in the 5' nontranslated region of cardiovirus, aphthovirus and hepatitis A virus RNAs. Nucleic Acids Res 1993; 21:2445-51. [PMID: 8389442 PMCID: PMC309545 DOI: 10.1093/nar/21.10.2445] [Citation(s) in RCA: 46] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Abstract
Statistical analyses of RNA folding in 5' nontranslated regions (5'NTR) of encephalomyocarditis virus, Theiler's murine encephalomyelitis virus, foot-and-mouth disease virus, and hepatitis A virus indicate that two highly significant folding regions occur in the 5' and 3' portions of the 5'NTR. The conserved tertiary structural elements are predicted in the unusual folding regions (UFR) for these viral RNAs. The theoretical, common structural elements predicted in the 3' parts of the 5'NTR occur in a cis-acting element that is critical for internal ribosome binding. These structural motifs are expected to be highly significant from extensive Monte Carlo simulations. Nucleotides (nt) in the conserved single-stranded polypyrimidine tract for these RNAs are involved in a distinctively tertiary interaction that is located at about 15 nt prior to the initiator AUG. Intriguingly, the proposed common tertiary structure in this study shares a similar structural feature to that proposed in human enteroviruses and rhinoviruses. Based on these common structural features, plausible base pairing models between these viral RNAs and 18 S rRNA are suggested, which are consistent with a general mechanism for regulation of internal initiation of cap-independent translation.
Collapse
Affiliation(s)
- S Y Le
- Laboratory of Mathematical Biology, National Cancer Institute, NIH, Frederick, MD 21702
| | | | | | | |
Collapse
|
21
|
Abstract
In this paper we present a new method for predicting a set of RNA secondary structures that are thermodynamically favored in RNA folding simulations. This method uses a large number of 'simulated energy rules' (SER) generated by perturbing the free energy parameters derived experimentally within the range of the experimental errors. The structure with the lowest free energy is computed for each SER. Structural comparisons are used to avoid multiple generation of similar structures. Computed structures are evaluated using the energy distribution of the lowest free energy structures derived in the simulation. Predicted be graphically displayed with their occurring frequencies in the simulation by dot-plot representations. On average, about 90% of phylogenetic helixes in the known models of tRNA, Group I self-splicing intron, and Escherichia coli 16 S rRNA, were predicted using the method.
Collapse
Affiliation(s)
- S Y Le
- Laboratory of Mathematical Biology, National Cancer Institute, NIH, Frederick, MD 21702
| | | | | |
Collapse
|
22
|
Abstract
A combination of comparative sequence analysis and thermodynamic methods reveals the conservation of tertiary structure elements in the 5' untranslated region (UTR) of human enteroviruses and rhinoviruses. The predicted common structural elements occur in the 3' end of a segment that is critical for internal ribosome binding, termed "ribosome landing pad" (RLP), of polioviruses. Base pairings between highly conserved 17-nucleotide (nt) and 21-nt sequences in the 5' UTR of human enteroviruses and rhinoviruses constitute a predicted pseudoknot that is significantly more stable than those that can be formed from a large set of randomly shuffled sequences. A conserved single-stranded polypyrimidine tract is located between two conserved tertiary elements. R. Nicholson, J. Pelletier, S.-Y. Le, and N. Sonenberg (1991, J. Virol. 65, 5886-5894) demonstrated that the point mutations of 3-nt UUU out of an essential 4-nt pyrimidine stretch sequence UUUC abolished translation. Structural analysis of the mutant sequence indicates that small point mutations within the short polypyrimidine sequence would destroy the tertiary interaction in the predicted, highly ordered structure. The proposed common tertiary structure can offer experimentalists a model upon which to extend the interpretations for currently available data. Based on these structural features possible base-pairing models between human enteroviruses and 18 S rRNA and between human rhinoviruses and 18 S rRNA are proposed. The proposed common structure implicates a biological function for these sequences in translational initiation.
Collapse
MESH Headings
- Base Composition
- Base Sequence
- Binding Sites
- Conserved Sequence
- Enterovirus/genetics
- Enterovirus/metabolism
- Humans
- Models, Genetic
- Models, Molecular
- Molecular Sequence Data
- Nucleic Acid Conformation
- Poliovirus/genetics
- Poliovirus/metabolism
- RNA, Ribosomal, 18S/genetics
- RNA, Ribosomal, 18S/metabolism
- RNA, Viral/genetics
- RNA, Viral/metabolism
- Regulatory Sequences, Nucleic Acid
- Rhinovirus/genetics
- Rhinovirus/metabolism
- Sequence Homology, Nucleic Acid
Collapse
Affiliation(s)
- S Y Le
- Laboratory of Mathematical Biology, National Cancer Institute, NIH, Frederick, Maryland 21702
| | | | | | | |
Collapse
|
23
|
Abstract
The RNA pseudoknot has been proposed as a significant structural motif in a wide range of biological processes of RNAs. A pseudoknot involves intramolecular pairing of bases in a hairpin loop with bases outside the stem of the loop to form a second stem and loop region. In this study, we propose a method for searching and predicting pseudoknots that are likely to have functional meaning. In our procedure, the orthodox hairpin structure involved in the pseudoknot is required to be both statistically significant and relatively stable to the others in the sequence. The bases outside the stem of the hairpin loop in the predicted pseudoknot are not entangled with any formation of a highly stable secondary structure in the sequence. Also, the predicted pseudoknot is significantly more stable than those that can be formed from a large set of scrambled sequences under the assumption that the energy contribution from a pseudoknot is proportional to the size of second loop region and planar energy contribution from second stem region. A number of functional pseudoknots that have been reported before can be identified and predicted from their sequences by our method.
Collapse
Affiliation(s)
- J H Chen
- Advanced Scientific Computer Laboratory, Program Resources, Inc., NCI/FCRF, Frederick, MD 21702
| | | | | |
Collapse
|
24
|
Le SY, Shapiro BA, Chen JH, Nussinov R, Maizel JV. RNA pseudoknots downstream of the frameshift sites of retroviruses. Genet Anal Tech Appl 1991; 8:191-205. [PMID: 1663382 PMCID: PMC7128882 DOI: 10.1016/1050-3862(91)90013-h] [Citation(s) in RCA: 29] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 04/15/1991] [Revised: 07/30/1991] [Accepted: 07/30/1991] [Indexed: 12/28/2022]
Abstract
RNA pseudoknot structural motifs could have implications for a wide range of biological processes of RNAs. In this study, the potential RNA pseudoknots just downstream from the known and suspected retroviral frame-shift sites were predicted in the Rous sarcoma virus, primate immunodeficiency viruses (HIV-1, HIV-2, and SIV), equine infectious anemia virus, visna virus, bovine leukemia virus, human T-cell leukemia virus (types I and II), mouse mammary tumor virus, Mason-Pfizer monkey virus, and simian SRV-1 type-D retrovirus. Also, the putative RNA pseudoknots were detected in the gag-pol overlaps of two retrotransposons of Drosophila, 17.6 and gypsy, and the mouse intracisternal A particle. For each sequence, the thermodynamic stability and statistical significance of the secondary structure involved in the predicted tertiary structure were assessed and compared. Our results show that the stem-loop structures in the pseudoknots are both thermodynamically highly stable and statistically significant relative to other such configurations that potentially occur in the gag-pol or gag-pro and pro-pol junction domains of these viruses (300 nucleotides upstream and downstream from the possible frameshift sites are included). Moreover, the structural features of the predicted pseudoknots following the frameshift site of pro-pol overlaps of the HTLV-1 and HTLV-2 retroviruses are structurally well conserved. The occurrence of eight compensatory base changes in the tertiary interaction of the two related sequences allow the conservation of their tertiary structures in spite of the sequence divergence. The results support the possible control mechanism for frameshifting proposed by Brierley et al. and Jacks et al.
Collapse
Affiliation(s)
- S Y Le
- Institute of Biological Sciences, National Research Council of Canada, Ottawa
| | | | | | | | | |
Collapse
|
25
|
Nicholson R, Pelletier J, Le SY, Sonenberg N. Structural and functional analysis of the ribosome landing pad of poliovirus type 2: in vivo translation studies. J Virol 1991; 65:5886-94. [PMID: 1656077 PMCID: PMC250251 DOI: 10.1128/jvi.65.11.5886-5894.1991] [Citation(s) in RCA: 160] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
The naturally uncapped genomic and mRNAs of poliovirus initiate translation by an internal ribosome-binding mechanism. The mRNA 5' untranslated region (UTR) of poliovirus is approximately 750 nucleotides in length and has seven to eight (depending on the serotype) AUG codons upstream of the initiator AUG. The sequence required for internal ribosome binding has been termed the ribosome landing pad (RLP). To better understand the mechanisms of internal initiation, we have determined the boundaries and critical elements of the RLP of poliovirus type 2 (Lansing strain) in vivo. By using deletion analysis, we demonstrate the existence of a core RLP in the poliovirus mRNA 5' UTR whose boundaries are between nucleotides 134 and 155 at the 5' end and nucleotides 556 and 585 at the 3' end. Sequences flanking the core RLP affect translational activity. The importance of several stem-loop structures in the RLP for internal initiation has been determined. Mutation of the phylogenetically conserved loop sequences in the proximal stem-loop structure of the RLP (stem-loop structure III; nucleotides 127 to 165) abolished internal translation. However, deletion of the second stem-loop in the RLP (stem-loop structure IV; nucleotides 189 to 223) reduced internal translation by only 50%. Internal deletions encompassing nucleotides 240 to 300, 350 to 380, or 450 to 480, predicted to disrupt stem-loop structure V and possibly VI, also abrogated internal initiation. Small point mutations within a short polypyrimidine sequence, highly conserved among all picornaviruses, abolished translation. A conservation of distance between the conserved polypyrimidine tract and a downstream AUG could play an important role in the mechanism of internal initiation.
Collapse
Affiliation(s)
- R Nicholson
- Department of Biochemistry, McGill University, Montreal, Canada
| | | | | | | |
Collapse
|
26
|
Chen TA, Smith MM, Le SY, Sternglanz R, Allfrey VG. Nucleosome fractionation by mercury affinity chromatography. Contrasting distribution of transcriptionally active DNA sequences and acetylated histones in nucleosome fractions of wild-type yeast cells and cells expressing a histone H3 gene altered to encode a cysteine 110 residue. J Biol Chem 1991; 266:6489-98. [PMID: 2007598] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
A technique for the separation of transcriptionally active and inactive nucleosomes by mercury affinity chromatography has been applied to study the nucleosomal distribution of DNA sequences from the GAL1, ACT1, HIS4, MAT alpha, and HMRa genes of yeast. In mammalian cells, the method has been shown to separate active from inactive nucleosomes and to fractionate the active nucleosomes into two classes, one retained on the mercury column because of salt-labile associations with certain thiol-reactive non-histone proteins, and the other bound by covalent linkage of the cysteine 110 thiol groups of histone H3 molecules to the mercurated support. The first class of nucleosomes is elutable in 0.5 M NaCl; the second is displaced by 10 mM dithiothreitol (DTT) (Walker, J., Chen, T. A., Sterner, R., Berger, M., Winston, F., and Allfrey, V.G. (1990) J. Biol. Chem. 265, 5736-5746). We show that, in wild-type yeast cells, in which histone H3 lacks cysteinyl residues, very little DNA and a negligible complement of nucleosomes appear in the DTT-eluate, confirming the requirement for the H3-thiols in the mercury-binding reaction. Moreover, the DTT-eluted fraction is seriously deficient in the actively transcribed GAL1, ACT1, HIS4, and MAT alpha DNA sequences. Site-directed mutagenesis was employed to create an H3 gene containing a cysteine codon in place of the alanine codon at position 110 of the yeast H3 amino acid sequence. A strain was constructed containing the mutant histone H3 gene instead of the normal H3 gene. Subsequent fractionations of the mutant nucleosomes by mercury-affinity chromatography revealed a characteristic nucleosome peak in the DTT-eluted fraction. Its content of transcribed GAL1, ACT1, and HIS4 DNA sequences was 20- to 500-fold higher than that of the corresponding DTT-eluted fraction of wild-type yeast. Although this result is in accord with the finding that, in mammalian cells, the thiol groups of histone H3 become accessible when nucleosomes "unfold" during transcription, we find that nucleosomes containing the GAL1 DNA sequences of the yeast H3-mutant also bind to the mercury column when that gene is not being expressed. We conclude that many yeast nucleosomes are maintained in a "primed," potentially active state, possibly due to the very high constitutive levels of acetylation of the core histones. However, the nucleosomes of the HMRa gene, which is not expressed in a MAT alpha yeast strain, are virtually absent from the DTT-eluted nucleosome fractions of the H3-mutant cells, indicating that prolonged silencing of the gene is accompanied by compaction and loss of H3-thiol reactivity of its nucleosomes.
Collapse
Affiliation(s)
- T A Chen
- Laboratory of Cell Biology, Rockefeller University, New York, New York 10021
| | | | | | | | | |
Collapse
|
27
|
Chen TA, Smith MM, Le SY, Sternglanz R, Allfrey VG. Nucleosome fractionation by mercury affinity chromatography. Contrasting distribution of transcriptionally active DNA sequences and acetylated histones in nucleosome fractions of wild-type yeast cells and cells expressing a histone H3 gene altered to encode a cysteine 110 residue. J Biol Chem 1991. [DOI: 10.1016/s0021-9258(18)38145-6] [Citation(s) in RCA: 44] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
|
28
|
Abstract
A new approach is proposed for determining common RNA secondary structures within a set of homologous RNAs. The approach is a combination of phylogenetic and thermodynamic methods which is based on the prediction of optimal and suboptimal secondary structures, topological similarity searches and phylogenetic comparative analysis. The optimal and suboptimal RNA secondary structures are predicted by energy minimization. Structural comparison of the predicted RNA secondary structures is used to find conserved structures that are topologically similar in all these homologous RNAs. The validity of the conserved structural elements found is then checked by phylogenetic comparison of the sequences. This procedure is used to predict common structures of ribonuclease P (RNAase P) RNAs.
Collapse
Affiliation(s)
- S Y Le
- Institute for Biological Sciences, National Research Council of Canada, Ottawa, Ontario
| | | |
Collapse
|
29
|
Abstract
We have developed a method for detecting more stable and significant folding regions relative to others in the sequence. The algorithm is based on the calculation of the lowest free energy of RNA secondary structures and Monte Carlo simulation. For any given RNA segment, the stability and statistical significance of RNA folding are assessed by two measures: the stability score and the significance score. The stability score measures the degree of thermodynamic stability of the segment between all possible biological segments in the RNA sequence. The significance score characterizes the specific arrangement of the nucleotides in the segment that could imply a structural role for the sequence information. Using these two measures, we are able to detect a series of distinct folding regions where highly stable and statistically significant secondary structures occur in human immunodeficiency virus (HIV) and simian immunodeficiency virus (SIV) sequences.
Collapse
Affiliation(s)
- S Y Le
- Institute for Biological Sciences, National Research Council of Canada, Ottawa
| | | | | |
Collapse
|
30
|
Abstract
A total of 4051 suboptimal secondary structures are predicted by folding the 5' non-coding region of ten polioviruses, five human rhinoviruses and three coxsackieviruses using our new suboptimal folding algorithm for the prediction of both optimal and suboptimal RNA secondary structures. A comparative analysis of these RNA secondary structures reveals the conservation of common secondary structure that can be supported by phylogenetic data. The thermodynamic stability and statistical significance of these predicted, conserved helical elements are assessed and significant structure motifs in the 5' non-coding region are proposed. The possible roles of these structure motifs in the virus life cycle are discussed.
Collapse
Affiliation(s)
- S Y Le
- Institute for Biological Sciences, National Research Council of Canada, Ottawa, Ontario
| | | |
Collapse
|
31
|
Tiley LS, Brown PH, Le SY, Maizel JV, Clements JE, Cullen BR. Visna virus encodes a post-transcriptional regulator of viral structural gene expression. Proc Natl Acad Sci U S A 1990; 87:7497-501. [PMID: 2170981 PMCID: PMC54774 DOI: 10.1073/pnas.87.19.7497] [Citation(s) in RCA: 46] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Visna virus is an ungulate lentivirus that is distantly related to the primate lentiviruses, including human immunodeficiency virus type 1 (HIV-1). Replication of HIV-1 and of other complex primate retroviruses, including human T-cell leukemia virus type I (HTLV-I), requires the expression in trans of a virally encoded post-transcriptional activator of viral structural gene expression termed Rev (HIV-1) or Rex (HTLV-I). We demonstrate that the previously defined L open reading frame of visna virus encodes a protein, here termed Rev-V, that is required for the cytoplasmic expression of the incompletely spliced RNA that encodes the viral envelope protein. Transactivation by Rev-V was shown to require a cis-acting target sequence that coincides with a predicted RNA secondary structure located within the visna virus env gene. However, Rev-V was unable to function by using the structurally similar RNA target sequences previously defined for Rev or Rex and, therefore, displays a distinct sequence specificity. Remarkably, substitution of this visna virus target sequence in place of the HIV-1 Rev response element permitted the Rev-V protein to efficiently rescue the expression of HIV-1 structural proteins, including Gag, from a Rev- proviral clone. These results suggest that the post-transcriptional regulation of viral structural gene expression may be a characteristic feature of complex retroviruses.
Collapse
Affiliation(s)
- L S Tiley
- Howard Hughes Medical Institute, Duke University Medical Center, Durham, NC 27710
| | | | | | | | | | | |
Collapse
|
32
|
Abstract
The simplest dynamic algorithm for planar RNA folding searches for the maximum number of base pairs. The algorithm uses O(n3) steps. The more general case, where different weights (energies) are assigned to stacked base pairs and to the various types of single-stranded region topologies, requires a considerably longer computation time because of the partial backtracking involved. Limiting the loop size reduces the running time back to O(n3). Reduction in the number of steps in the calculations of the various RNA topologies has recently been suggested, thereby improving the time behavior. Here we show how a "jumping" procedure can be used to speed up the computation, not only for the maximal number of base pairs algorithm, but for the minimal energy algorithm as well.
Collapse
Affiliation(s)
- R Nussinov
- Sackler Institute for Molecular Medicine, Sackler Faculty of Medicine, Tel Aviv University, Ramat Aviv, Israel
| | | | | | | |
Collapse
|
33
|
Le SY, Malim MH, Cullen BR, Maizel JV. A highly conserved RNA folding region coincident with the Rev response element of primate immunodeficiency viruses. Nucleic Acids Res 1990; 18:1613-23. [PMID: 2326200 PMCID: PMC330534 DOI: 10.1093/nar/18.6.1613] [Citation(s) in RCA: 75] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
A series of unusual folding regions (UFR) immediately 3' to the cleavage site of the outer membrane protein (OMP) and transmembrane protein (TMP) were detected in the envelope gene RNA of the human immunodeficiency virus (HIV-1, HIV-2) and simian immunodeficiency virus (SIV) by an extensive Monte Carlo simulation. These RNA secondary structures were predicted to be both highly stable and statistically significant. In the calculation, twenty-five different sequence isolates of HIV-1, three isolates of HIV-2 and eight sequences of SIV were included. Although significant sequence divergence occurs in the env coding regions of these viruses, a distinct UFR of 234-nt is consistently located ten nucleotides 3' to the cleavage site of the OMP/TMP in HIV-1, and a 216-nt UFR occurs forty-six and forty-nine nucleotides downstream from the OMP/TMP cleavage site of HIV-2 and SIV, respectively. Compensatory base changes in the helical stem regions of these conserved RNA secondary structures are identified. These results support the hypothesis that these special RNA folding regions are functionally important and suggest that the role of this sequence as the Rev response element (RRE) is mediated by secondary structure as well as primary RNA sequence.
Collapse
Affiliation(s)
- S Y Le
- Division of Cancer Biology and Diagnosis, National Cancer Institute, NCI/FCRF, National Institutes of Health, Frederick, MD 21701
| | | | | | | |
Collapse
|
34
|
Abstract
In our recent series of papers, we have used the structures of statistical significance from Monte Carlo simulations to improve the predictions of secondary structure of RNA and to analyze the possible role of locally significant structures in the life cycle of human immunodeficiency virus. Because of intensive computational requirements for Monte Carlo simulation, it becomes impractical even using a supercomputer to assess the significance of a structure with a window size greater than 200 along an RNA sequence of 1000 bases or more. In this paper, we have developed a new procedure that drastically reduces the time needed to assess the significance of structures. In fact, the efficiency of this new method allows us to assess structures on the VAX as well as the CRAY.
Collapse
Affiliation(s)
- J H Chen
- Division of Cancer Biology and Diagnosis, National Cancer Institute, National Institutes of Health, Frederick, MD 21701
| | | | | | | | | |
Collapse
|
35
|
Abstract
The trans-activator (Tat) proteins of the related but distinct type 1 and type 2 human immunodeficiency viruses (HIV-1 and HIV-2) display incomplete functional reciprocity. One possible explanation of this observation, suggested by computer analysis of potential RNA secondary structures within the viral trans-activation response (TAR) elements, is that HIV-2 Tat requires the presentation of two viral RNA stem-loop sequences for full activity whereas HIV-1 Tat is maximally active upon presentation of a single stem-loop structure. Here, we demonstrate that the HIV-2 long terminal repeat indeed contains two functionally independent TAR elements. However, the second (3') TAR element of HIV-2 is significantly less active than the 5' TAR element and is functionally masked in the context of an intact HIV-2 long terminal repeat. Evidence is presented suggesting that the activities of these two HIV-2 TAR elements reflect, at least in part, their relative distances from the site of transcription initiation. Although the HIV-2 TAR element proximal to the viral mRNA cap site appears to be sufficient for effective trans activation by HIV-2 Tat in vitro, this functional redundancy may nevertheless serve to enhance HIV-2 replication in infected cells in vivo.
Collapse
Affiliation(s)
- R Fenrick
- Howard Hughes Medical Institute, Department of Medicine, Duke University Medical Center, Durham, North Carolina 27710
| | | | | | | | | | | |
Collapse
|
36
|
Malim MH, Böhnlein S, Fenrick R, Le SY, Maizel JV, Cullen BR. Functional comparison of the Rev trans-activators encoded by different primate immunodeficiency virus species. Proc Natl Acad Sci U S A 1989; 86:8222-6. [PMID: 2682638 PMCID: PMC298252 DOI: 10.1073/pnas.86.21.8222] [Citation(s) in RCA: 84] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
The known primate lentiviruses can be divided into two subgroups consisting of the human immunodeficiency virus type 1 (HIV-1) isolates and the related HIV type 2 (HIV-2) and simian immunodeficiency virus (SIV) isolates. HIV-1 has been shown to encode a post-transcriptional trans-activator of viral structural gene expression, termed Rev, that is essential for viral replication in culture. Here, we demonstrate that HIV-2 and SIVmac also encode functional Rev proteins. As in the case of HIV-1, these Rev trans-activators are shown to induce the cytoplasmic expression of the unspliced viral transcripts that encode the viral structural proteins. Unexpectedly, the Rev proteins of HIV-2 and SIVmac proved incapable of activating the cytoplasmic expression of unspliced HIV-1 transcripts, whereas HIV-1 Rev was fully functional in the HIV-2/SIV system. This nonreciprocal complementation may imply a direct role for Rev in mediating the recognition of its viral RNA target sequence.
Collapse
Affiliation(s)
- M H Malim
- Howard Hughes Medical Institute, Duke University Medical Center, Durham, NC 27710
| | | | | | | | | | | |
Collapse
|
37
|
Abstract
To facilitate comparison of RNA secondary structures each structure is represented as an ordered labeled tree. Several alternate secondary structures yielding a set of trees can be computed for any given RNA molecule (sequence). Frequently recurring subtrees are searched in this set of trees. The consensus structure motifs are then selected and used to construct a secondary structure model of the RNA. Given the difficulties involved in RNA secondary structure calculations, this procedure may significantly improve our predictive capabilities. In addition, the change of secondary structures between two different RNA sequences is described as a transformation of ordered trees. The transferable ratio of tree A from tree B is defined as a proportion of the largest common subtrees in trees A and B occurring in tree A. The method is applied to the study of the mechanism of human alpha 1 globin pre-mRNA splicing. In the study, two tentative splicing mechanisms, A and B, with different orders of intron excision from alpha 1 globin pre-mRNA have been stimulated. A possible relationship between the structural features of the secondary structures and the order of intron excision in the pathway of precursor splicing of human alpha 1 globin is discussed.
Collapse
Affiliation(s)
- S Y Le
- Division of Cancer Biology and Diagnosis, National Cancer Institute, Frederick, Maryland 21701
| | | | | |
Collapse
|
38
|
Hanly SM, Rimsky LT, Malim MH, Kim JH, Hauber J, Duc Dodon M, Le SY, Maizel JV, Cullen BR, Greene WC. Comparative analysis of the HTLV-I Rex and HIV-1 Rev trans-regulatory proteins and their RNA response elements. Genes Dev 1989; 3:1534-44. [PMID: 2482226 DOI: 10.1101/gad.3.10.1534] [Citation(s) in RCA: 169] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
The Rex proteins of types I and II human T-cell leukemia viruses (HTLV-I, HTLV-II) are required for expression of the viral structural gene products, gag and env and, thus, are essential for the replication of these pathogenic retroviruses. The action of Rex is sequence specific, requiring the presence of a cis-acting Rex response element located in the 3' long terminal repeat. This element corresponds to a predicted RNA secondary structure and functions in an orientation-dependent but position-independent manner. Rex acts through this response element to stimulate the nuclear export of the unspliced or singly spliced viral mRNA species encoding the virion structural proteins that are normally excluded from the cytoplasm. Although the Rex proteins of HTLV-I and HTLV-II can also function via the related Rev response element present in the env gene of the type I human immunodeficiency virus (HIV-1), the analogous HIV-1 Rev protein is unable to act on the HTLV-I Rex response element. This nonreciprocal pattern of genetic complementation by Rex and Rev suggests that these viral trans-regulators may interact directly with their RNA response elements.
Collapse
Affiliation(s)
- S M Hanly
- Howard Hughes Medical Institute, Duke University Medical Center, Durham, North Carolina 27710
| | | | | | | | | | | | | | | | | | | |
Collapse
|
39
|
Le SY, Chen JH, Maizel JV. Thermodynamic stability and statistical significance of potential stem-loop structures situated at the frameshift sites of retroviruses. Nucleic Acids Res 1989; 17:6143-52. [PMID: 2549508 PMCID: PMC318267 DOI: 10.1093/nar/17.15.6143] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
RNA stem-loop structures situated just 3' to the frameshift sites of the retroviral gag-pol or gag-pro and pro-pol regions may make important contributions to frame-shifting in retroviruses. In this study, the thermodynamic stability and statistical significance of such secondary structural features relative to others in the sequence have been assessed using a newly developed method that combines calculations of the lowest free energy of formation of RNA secondary structures and the Monte Carlo simulations. Our results show that stem-loop structures situated just 3' to the frameshift sites are both highly stable and statistically significant relative to others in the gag-pol or gag-pro and pro-pol junction domains (both 300 nucleotides upstream and downstream from the possible frameshift sites are included) of Rous sarcoma virus (RSV), human immunodeficiency virus (HIV-1), bovine leukemia virus (BLV), human T-cell leukemia virus type II (HTLV-II), and mouse mammary tumor virus (MMTV). No other more stable, or significant folding regions are predicted in these domains.
Collapse
Affiliation(s)
- S Y Le
- Laboratory of Mathematical Biology, National Cancer Institute, Frederick, MD 21701
| | | | | |
Collapse
|
40
|
Le SY, Owens J, Nussinov R, Chen JH, Shapiro B, Maizel JV. RNA secondary structures: comparison and determination of frequently recurring substructures by consensus. Comput Appl Biosci 1989; 5:205-10. [PMID: 2475224 DOI: 10.1093/bioinformatics/5.3.205] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
A method for assessing the preserved stem-loops of RNA secondary structures is presented. Frequently recurring helical stems in a set of secondary structures resulting from the simulated folding process of a given RNA are assessed and consensus structural motifs can then be selected to construct a secondary structure of the RNA. Alternatively, it can be applied to a series of 'optimal' and 'suboptimal' secondary structures computed using the dynamic program developed by Williams and Tinoco. To demonstrate the power and the usefulness of the program we give examples of this procedure.
Collapse
Affiliation(s)
- S Y Le
- Division of Cancer Biology and Diagnosis, National Institutes of Health, Frederick, MD 21701
| | | | | | | | | | | |
Collapse
|
41
|
Abstract
We have developed a statistical method that is designed for analyzing potential RNA folded substructures. The statistical significance of RNA folding is assessed by the segment score. The segment score is defined as the difference between the lowest free energy calculated for the real biological sequence and the mean of the lowest free energies from random permutations of the real segment sequence, divided by the standard deviation of the random sample. This procedure was applied to the well-studied Escherichia coli 16S rRNA and potato spindle tuber viroid (PSTV) RNA. The results showed that the predictions of the locally significant secondary structures in these two molecules are in accord with the universally conserved local secondary structure elements (Gutell, Weiser & Noller, 1985, Prog. Nucl. Acid Res. molec. Biol. 32, 155-216; Riesner & Gross, 1985, A. Rev. Biochem. 54, 531-564). In addition, a statistical analysis indicated that the lowest free energies of a random sample set follow an approximately normal distribution. A reasonable size for the random sample set was determined statistically. Moreover, the statistical evaluation has been carried out using three different sets of energy rules--two sets (Salser, 1977, Cold Spring Harb. Symp. Quant Biol. 42, 985-1002; Freier, Kierzek, Jaeger, Sugimoto, Caruthers, Neilson & Turner, 1986, Proc. natn. Acad. Sci. U.S.A. 83, 9373-9377) take into account stacking energies and are based on experimental data and their computational extension (Salser, 1977)--the third set is a simplistic "unitary matrix" approach, where any base-pair is given a weight of "minus one" and an unpaired based is "zero". The Freier energy rules usually yield the strongest indication of significant folding region. However, the results derived from paired comparisons test don't provide sufficient evidence for concluding that a different set of energy rules is effective in changing the segment score level for local stem-loop structures in the 16S rRNA.
Collapse
Affiliation(s)
- S Y Le
- Division of Cancer Biology and Diagnosis, National Cancer Institute, Frederick, Maryland 21701
| | | |
Collapse
|
42
|
Le SY, Chen JH, Chatterjee D, Maizel JV. Sequence divergence and open regions of RNA secondary structures in the envelope regions of the 17 human immunodeficiency virus isolates. Nucleic Acids Res 1989; 17:3275-88. [PMID: 2726458 PMCID: PMC317728 DOI: 10.1093/nar/17.8.3275] [Citation(s) in RCA: 23] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
Genetic variation during the course of infection of an individual is a remarkable feature of the acquired immune deficiency syndrome (AIDS) disease. This variation has been studied for the envelope protein encoding regions of seventeen different sequences from various isolates of human immunodeficiency virus (HIV) using multiple sequence comparison and calculation of variability. The open regions with little intramolecular base pairing in these envelope sequences are predicted by a recently developed statistical method. The minimum length L for a run of hypervariable sites, conserved sites, or open regions that gives significance at the 1% (or 0.1%) level is then determined by a scan statistical method. The results show that significant clusters of open regions predicted at the RNA levels correlate with significant clusters of hypervariable sites in the HIV envelope gene. Those significant genomic variations in HIVs seem to be manifested mainly in the extracellular portion of the envelope protein. Twelve potential antigenic determinants are predicted using an antigenic index method. Interestingly, the majority of the significant hypervariable regions in the exterior envelope protein (gp120) were predicted potential epitopes.
Collapse
Affiliation(s)
- S Y Le
- Division of Cancer Biology and Diagnosis, National Cancer Institute, Frederick, MD 21701
| | | | | | | |
Collapse
|
43
|
Malim MH, Hauber J, Le SY, Maizel JV, Cullen BR. The HIV-1 rev trans-activator acts through a structured target sequence to activate nuclear export of unspliced viral mRNA. Nature 1989; 338:254-7. [PMID: 2784194 DOI: 10.1038/338254a0] [Citation(s) in RCA: 991] [Impact Index Per Article: 28.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
Human immunodeficiency virus type 1 (HIV-1) replication requires the expression of two classes of viral mRNA. The early class of HIV-1 transcripts is fully spliced and encodes viral regulatory gene products. The functional expression of one of these nuclear regulatory proteins, termed Rev (formerly Art or Trs), induces the cytoplasmic expression of the incompletely spliced, late class of HIV-1 mRNAs that encode the viral structural proteins, including Gag and Env. Here, we provide evidence that this induction reflects the export from the cell nucleus to the cytoplasm of a pool of unspliced viral RNA constitutively expressed in the nucleus. The hypothesis that Rev acts on RNA transport, rather than splicing, is further supported by the observation that the cytoplasmic expression of a non-spliceable HIV-1 env gene sequence is also subject to Rev regulation. Here we show that this Rev response requires a specific target sequence which coincides with a complex RNA secondary structure present in the env gene. The response to Rev is fully maintained when this sequence is relocated to other exonic or intronic locations within env but is ablated by inversion. These results indicate that the HIV-1 rev gene product induces HIV-1 structural gene expression by activating the sequence-specific nuclear export of incompletely spliced HIV-1 RNA species.
Collapse
Affiliation(s)
- M H Malim
- Howard Hughes Medical Institute, Duke University Medical Center, Durham, North Carolina 27710
| | | | | | | | | |
Collapse
|
44
|
Le SY, Chen JH, Nussinov R, Maizel JV. An improved secondary structure computation method and its application to intervening sequence in the human alpha-like globin mRNA precursors. Comput Appl Biosci 1988; 4:337-44. [PMID: 3046713 DOI: 10.1093/bioinformatics/4.3.337] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
Current secondary structure prediction computations have a serious drawback. The calculated thermodynamically most stable structure often differs from that observed in solution or in crystal form. In this paper we suggest a way to partially over-come some of these limitations by simulating the RNA folding process and calculating the frequencies of occurrence of the various substructures obtained. The frequently recurring substructures are then selected to construct the secondary structure of the whole RNA. 142 tRNA molecules and an E. coli 16S rRNA molecule have been examined by this method. The percentage of successful prediction of the correct helices are significantly higher than those calculated previously. The secondary structures of intervening sequences (IVSs) excised from human alpha-like globin pre-mRNAs are also computed. Thus, in this method the secondary structures obtained are composed of the statistically more significant substructures. This has also been demonstrated by using randomly shuffled sequences. The secondary structures of each of the randomized sequences are computed and their mean and standard deviations are used in evaluating the significance of the substructures obtained in the folding of the biological sequence. Some potentially appealing structural features aligning adjacent exons for ligation have been found.
Collapse
Affiliation(s)
- S Y Le
- Laboratory of Mathematical Biology, National Cancer Institute, Frederick, MD 21701
| | | | | | | |
Collapse
|
45
|
Le SY, Chen JH, Braun MJ, Gonda MA, Maizel JV. Stability of RNA stem-loop structure and distribution of non-random structure in the human immunodeficiency virus (HIV-I). Nucleic Acids Res 1988; 16:5153-68. [PMID: 3387221 PMCID: PMC336724 DOI: 10.1093/nar/16.11.5153] [Citation(s) in RCA: 46] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
The stability of potential RNA stem-loop structures in human immunodeficiency virus isolates, HTLV-III and ARV, has been calculated, and the relevance to the local significant secondary structures in the sequence has been tested statistically using a Monte Carlo simulation method. Potentially significant structures exist in the 5'non-coding region, the boundary regions between the protein coding frames, and the 3' non-coding region. The locally optimal secondary structure occurring in the 5' terminal region has been assessed using different overlapping segment sizes and the Monte Carlo method. The results show that the most favorable structure for the 5' mRNA leader sequence of HIV has two stem-loops folded at nucleotides 5-104 in the R region (stem-loop I, 5-54 and stem-loop II, 58-104). A large fluctuation of segment score of the local optimal secondary structure also occurs in the boundary between the exterior glycosylated protein or outer membrane protein and transmembrane protein coding region. This finding is surprising since no RNA signals or RNA processing are expected to occur at this site. In addition, regions of the genome predicted to have significantly more open structure at the RNA level correlate closely with hypervariable sites found in these viral genomes. The possible importance of local secondary structure to the biological function of the human immunodeficiency virus genome is discussed.
Collapse
Affiliation(s)
- S Y Le
- Division of Cancer Biology and Diagnosis, National Cancer Institute, Frederick, MD 21701
| | | | | | | | | |
Collapse
|
46
|
Le SY, Currey KM, Nussinov R, Maizel JV. Studies of frequently recurring substructures in human alpha-like globin mRNA precursors. Comput Biomed Res 1987; 20:563-82. [PMID: 3691069 DOI: 10.1016/0010-4809(87)90026-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
In general, the results obtained from secondary structure prediction algorithms are often inconsistent with those obtained experimentally. The reason for this disagreement is that the experimentally determined structures have higher free energies (as judged by the currently used "energy rules") than the predicted ones. To overcome this limitation we have developed a new approach which incorporates the frequencies of occurrence of substructures in the growing mRNA chain. This has been accomplished by simulating the folding process of pre-mRNAs. Using this approach we have significantly improved current helical structural prediction for 142 analyzed tRNAs and 16 S rRNA. We have next applied this method to the human alpha-like globins. Comparison of the structures obtained by running the currently used algorithms with those computed by the new method indicates that the final most stable secondary structure contains some infrequently occurring substructures. In addition, some of the frequently recurring substructures are not included in the final structure. Comparison of the simulated folding processes of the human alpha-like globin pre-mRNAs reveals some conserved helices and hairpin loop structures in those frequently recurring substructures. Among these several compensating base changes (transitions and transversions) have been identified.
Collapse
Affiliation(s)
- S Y Le
- Division of Cancer Biology and Diagnosis, National Cancer Institute, Frederick, Maryland 21701
| | | | | | | |
Collapse
|