201
|
Rodriguez J, Vives L, Jordà M, Morales C, Muñoz M, Vendrell E, Peinado MA. Genome-wide tracking of unmethylated DNA Alu repeats in normal and cancer cells. Nucleic Acids Res 2007; 36:770-84. [PMID: 18084025 PMCID: PMC2241897 DOI: 10.1093/nar/gkm1105] [Citation(s) in RCA: 73] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Methylation of the cytosine is the most frequent epigenetic modification of DNA in mammalian cells. In humans, most of the methylated cytosines are found in CpG-rich sequences within tandem and interspersed repeats that make up to 45% of the human genome, being Alu repeats the most common family. Demethylation of Alu elements occurs in aging and cancer processes and has been associated with gene reactivation and genomic instability. By targeting the unmethylated SmaI site within the Alu sequence as a surrogate marker, we have quantified and identified unmethylated Alu elements on the genomic scale. Normal colon epithelial cells contain in average 25 486 ± 10 157 unmethylated Alu's per haploid genome, while in tumor cells this figure is 41 995 ± 17 187 (P = 0.004). There is an inverse relationship in Alu families with respect to their age and methylation status: the youngest elements exhibit the highest prevalence of the SmaI site (AluY: 42%; AluS: 18%, AluJ: 5%) but the lower rates of unmethylation (AluY: 1.65%; AluS: 3.1%, AluJ: 12%). Data are consistent with a stronger silencing pressure on the youngest repetitive elements, which are closer to genes. Further insights into the functional implications of atypical unmethylation states in Alu elements will surely contribute to decipher genomic organization and gene regulation in complex organisms.
Collapse
Affiliation(s)
- Jairo Rodriguez
- Institut d'Investigació Biomèdica de Bellvitge (IDIBELL), L'Hospitalet, Catalonia, Spain
| | | | | | | | | | | | | |
Collapse
|
202
|
Coenye T, Drevinek P, Mahenthiralingam E, Shah SA, Gill RT, Vandamme P, Ussery DW. Identification of putative noncoding RNA genes in the Burkholderia cenocepacia J2315 genome. FEMS Microbiol Lett 2007; 276:83-92. [PMID: 17937666 DOI: 10.1111/j.1574-6968.2007.00916.x] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
Noncoding RNA (ncRNA) genes are not involved in the production of mRNA and proteins, but produce transcripts that function directly as structural or regulatory RNAs. In the present study, the presence of ncRNA genes in the genome of Burkholderia cenocepacia J2315 was evaluated by combining comparative genomics (alignment-based) and predicted secondary structure approaches. Two hundred and thirteen putative ncRNA genes were identified in the B. cenocepacia J2315 genome and upregulated expression of four of these could be confirmed by microarray analysis. Most of the ncRNA gene transcripts have a marked predicted secondary structure that may facilitate interaction with other molecules. Several B. cenocepacia J2315 ncRNAs seem to be related to previously characterized ncRNAs involved in regulation of various cellular processes, while the function of many others remains unknown. The presence of a large number of ncRNA genes in this organism may help to explain its complexity, phenotypic variability and ability to survive in a remarkably wide range of environments.
Collapse
Affiliation(s)
- Tom Coenye
- Laboratorium voor Microbiologie, Universiteit Gent, Gent, Belgium.
| | | | | | | | | | | | | |
Collapse
|
203
|
Metabolic signature of breast cancer cell line MCF-7: profiling of modified nucleosides via LC-IT MS coupling. BMC BIOCHEMISTRY 2007; 8:25. [PMID: 18047657 PMCID: PMC2219991 DOI: 10.1186/1471-2091-8-25] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/16/2007] [Accepted: 11/29/2007] [Indexed: 11/22/2022]
Abstract
Background Cancer, like other diseases accompanied by strong metabolic disorders, shows characteristic effects on cell turnover rate, activity of modifying enzymes and DNA/RNA modifications, resulting also in elevated amounts of excreted modified nucleosides. For a better understanding of the impaired RNA metabolism in breast cancer cells, we screened these metabolites in the cell culture supernatants of the breast cancer cell line MCF-7 and compared it to the human mammary epithelial cells MCF-10A. The nucleosides were isolated and analyzed via 2D-chromatographic techniques: In the first dimension by cis-diol specific boronate affinity extraction and subsequently by reversed phase chromatography coupled to an ion trap mass spectrometer. Results Besides the determination of ribonucleosides, additional compounds with cis-diol structure, deriving from cross-linked biochemical pathways, like purine-, histidine- and polyamine metabolism were detected. In total, 36 metabolites were identified by comparison of fragmentation patterns and retention time. Relation to the internal standard isoguanosine yielded normalized area ratios for each identified compound and enabled a semi-quantitative metabolic signature of both analyzed cell lines. 13 of the identified 26 modified ribonucleosides were elevated in the cell culture supernatants of MCF-7 cells, with 5-methyluridine, N2,N2,7-trimethylguanosine, N6-methyl-N6-threonylcarbamoyladenosine and 3-(3-aminocarboxypropyl)-uridine showing the most significant differences. 1-ribosylimidazole-4-acetic acid, a histamine metabolite, was solely found in the supernatants of MCF-10A cells, whereas 1-ribosyl-4-carboxamido-5-aminoimidazole and S-adenosylmethionine occurred only in supernatants of MCF-7 cells. Conclusion The obtained results are discussed against the background of pathological changes in cell metabolism, resulting in new perspectives for modified nucleosides and related metabolites as possible biomedical markers for breast carcinoma in vivo.
Collapse
|
204
|
Rose D, Hackermüller J, Washietl S, Reiche K, Hertel J, Findeiß S, Stadler PF, Prohaska SJ. Computational RNomics of drosophilids. BMC Genomics 2007; 8:406. [PMID: 17996037 PMCID: PMC2216035 DOI: 10.1186/1471-2164-8-406] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2007] [Accepted: 11/08/2007] [Indexed: 11/11/2022] Open
Abstract
BACKGROUND Recent experimental and computational studies have provided overwhelming evidence for a plethora of diverse transcripts that are unrelated to protein-coding genes. One subclass consists of those RNAs that require distinctive secondary structure motifs to exert their biological function and hence exhibit distinctive patterns of sequence conservation characteristic for positive selection on RNA secondary structure. The deep-sequencing of 12 drosophilid species coordinated by the NHGRI provides an ideal data set of comparative computational approaches to determine those genomic loci that code for evolutionarily conserved RNA motifs. This class of loci includes the majority of the known small ncRNAs as well as structured RNA motifs in mRNAs. We report here on a genome-wide survey using RNAz. RESULTS We obtain 16 000 high quality predictions among which we recover the majority of the known ncRNAs. Taking a pessimistically estimated false discovery rate of 40% into account, this implies that at least some ten thousand loci in the Drosophila genome show the hallmarks of stabilizing selection action of RNA structure, and hence are most likely functional at the RNA level. A subset of RNAz predictions overlapping with TRF1 and BRF binding sites [Isogai et al., EMBO J. 26: 79-89 (2007)], which are plausible candidates of Pol III transcripts, have been studied in more detail. Among these sequences we identify several "clusters" of ncRNA candidates with striking structural similarities. CONCLUSION The statistical evaluation of the RNAz predictions in comparison with a similar analysis of vertebrate genomes [Washietl et al., Nat. Biotech. 23: 1383-1390 (2005)] shows that qualitatively similar fractions of structured RNAs are found in introns, UTRs, and intergenic regions. The intergenic RNA structures, however, are concentrated much more closely around known protein-coding loci, suggesting that flies have significantly smaller complement of independent structured ncRNAs compared to mammals.
Collapse
Affiliation(s)
- Dominic Rose
- Bioinformatics Group, Department of Computer Science, University of Leipzig, Härtelstraße 16-18, Leipzig, Germany, D-04107
| | - Jörg Hackermüller
- Fraunhofer Institute for Cell Therapy and Immunology, Deutscher Platz 5e, Leipzig, Germany, D-04103
- Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstraße 16-18, Leipzig, Germany, D-04107
| | - Stefan Washietl
- Department of Theoretical Chemistry, University of Vienna, Währingerstraße 17,Wien, Austria, A-1090
| | - Kristin Reiche
- Bioinformatics Group, Department of Computer Science, University of Leipzig, Härtelstraße 16-18, Leipzig, Germany, D-04107
| | - Jana Hertel
- Bioinformatics Group, Department of Computer Science, University of Leipzig, Härtelstraße 16-18, Leipzig, Germany, D-04107
- Department of Theoretical Chemistry, University of Vienna, Währingerstraße 17,Wien, Austria, A-1090
| | - Sven Findeiß
- Bioinformatics Group, Department of Computer Science, University of Leipzig, Härtelstraße 16-18, Leipzig, Germany, D-04107
| | - Peter F Stadler
- Bioinformatics Group, Department of Computer Science, University of Leipzig, Härtelstraße 16-18, Leipzig, Germany, D-04107
- Fraunhofer Institute for Cell Therapy and Immunology, Deutscher Platz 5e, Leipzig, Germany, D-04103
- Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstraße 16-18, Leipzig, Germany, D-04107
- Department of Theoretical Chemistry, University of Vienna, Währingerstraße 17,Wien, Austria, A-1090
- Santa Fe Institute,1399 Hyde Park Rd., Santa Fe, USA, NM 87501
| | - Sonja J Prohaska
- Biomedical Informatics, Arizona State University, Tempe, PO-Box 878809, USA, AZ 85287
| |
Collapse
|
205
|
Barnes MR, Deharo S, Grocock RJ, Brown JR, Sanseau P. The micro RNA target paradigm: a fundamental and polymorphic control layer of cellular expression. Expert Opin Biol Ther 2007; 7:1387-99. [PMID: 17727328 DOI: 10.1517/14712598.7.9.1387] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
Evidence is emerging that micro RNA (miRNA) is an important and potentially polymorphic regulatory layer for silencing gene expression in vivo. Knowledge of miRNA targeting may help to elucidate the function of many human genes in common diseases, providing a powerful target validation technology. Accurate in silico prediction of miRNA targets in mRNA is a critical capability, allowing effective evaluation of the impact of variation on the creation, strengthening, weakening and destruction of miRNA binding sites. Application of such analyses identifies thousands of single-nucleotide polymorphisms, which may potentially impact miRNA regulation of mRNA. The authors believe this information may offer a real opportunity to study miRNA function at a number of levels. First, sequence-focused analysis will help to define the functional boundaries of miRNA target binding. Second, one may be able to identify miRNA target variants in mRNA with a direct role in human disease, which may be valuable therapeutic targets.
Collapse
Affiliation(s)
- Michael R Barnes
- Molecular Discovery Research Informatics, Molecular Discovery Research, GlaxoSmithKline Pharmaceuticals, New Frontiers Science Park (North), Third Avenue, Harlow, Essex, CM 19 5AW, UK.
| | | | | | | | | |
Collapse
|
206
|
|
207
|
Horesh Y, Doniger T, Michaeli S, Unger R. RNAspa: a shortest path approach for comparative prediction of the secondary structure of ncRNA molecules. BMC Bioinformatics 2007; 8:366. [PMID: 17908318 PMCID: PMC2147038 DOI: 10.1186/1471-2105-8-366] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2007] [Accepted: 10/01/2007] [Indexed: 12/27/2022] Open
Abstract
Background In recent years, RNA molecules that are not translated into proteins (ncRNAs) have drawn a great deal of attention, as they were shown to be involved in many cellular functions. One of the most important computational problems regarding ncRNA is to predict the secondary structure of a molecule from its sequence. In particular, we attempted to predict the secondary structure for a set of unaligned ncRNA molecules that are taken from the same family, and thus presumably have a similar structure. Results We developed the RNAspa program, which comparatively predicts the secondary structure for a set of ncRNA molecules in linear time in the number of molecules. We observed that in a list of several hundred suboptimal minimal free energy (MFE) predictions, as provided by the RNAsubopt program of the Vienna package, it is likely that at least one suggested structure would be similar to the true, correct one. The suboptimal solutions of each molecule are represented as a layer of vertices in a graph. The shortest path in this graph is the basis for structural predictions for the molecule. We also show that RNA secondary structures can be compared very rapidly by a simple string Edit-Distance algorithm with a minimal loss of accuracy. We show that this approach allows us to more deeply explore the suboptimal structure space. Conclusion The algorithm was tested on three datasets which include several ncRNA families taken from the Rfam database. These datasets allowed for comparison of the algorithm with other methods. In these tests, RNAspa performed better than four other programs.
Collapse
Affiliation(s)
- Yair Horesh
- Department of Computer Sciences, Bar-Ilan University, Ramat-Gan 52900, Israel
| | - Tirza Doniger
- The Mina & Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan 52900, Israel
| | - Shulamit Michaeli
- The Mina & Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan 52900, Israel
| | - Ron Unger
- The Mina & Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan 52900, Israel
| |
Collapse
|
208
|
Seemann SE, Gilchrist MJ, Hofacker IL, Stadler PF, Gorodkin J. Detection of RNA structures in porcine EST data and related mammals. BMC Genomics 2007; 8:316. [PMID: 17845718 PMCID: PMC2072958 DOI: 10.1186/1471-2164-8-316] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2007] [Accepted: 09/10/2007] [Indexed: 11/18/2022] Open
Abstract
Background Non-coding RNAs (ncRNAs) are involved in a wide spectrum of regulatory functions. Within recent years, there have been increasing reports of observed polyadenylated ncRNAs and mRNA like ncRNAs in eukaryotes. To investigate this further, we examined the large data set in the Sino-Danish PigEST resource which also contains expression information distributed on 97 non-normalized cDNA libraries. Results We constructed a pipeline, EST2ncRNA, to search for known and novel ncRNAs. The pipeline utilises sequence similarity to ncRNA databases (blast), structure similarity to Rfam (RaveNnA) as well as multiple alignments to predict conserved novel putative RNA structures (RNAz). EST2ncRNA was fed with 48,000 contigs and 73,000 singletons available from the PigEST resource. Using the pipeline we identified known RNA structures in 137 contigs and single reads (conreads), and predicted high confidence RNA structures in non-protein coding regions of additional 1,262 conreads. Of these, structures in 270 conreads overlap with existing predictions in human. To sum up, the PigEST resource comprises trans-acting elements (ncRNAs) in 715 contigs and 340 singletons as well as cis-acting elements (inside UTRs) in 311 contigs and 51 singletons, of which 18 conreads contain both predictions of trans- and cis-acting elements. The predicted RNAz candidates were compared with the PigEST expression information and we identify 114 contigs with an RNAz prediction and expression in at least ten of the non-normalised cDNA libraries. We conclude that the contigs with RNAz and known predictions are in general expressed at a much lower level than protein coding transcripts. In addition, we also observe that our ncRNA candidates constitute about one to two percent of the genes expressed in the cDNA libraries. Intriguingly, the cDNA libraries from developmental (brain) tissues contain the highest amount of ncRNA candidates, about two percent. These observations are related to existing knowledge and hypotheses about the role of ncRNAs in higher organisms. Furthermore, about 80% porcine coding transcripts (of 18,600 identified) as well as less than one-third ORF-free transcripts are conserved at least in the closely related bovine genome. Approximately one percent of the coding and 10% of the remaining matches are unique between the PigEST data and cow genome. Based on the pig-cow alignments, we searched for similarities to 16 other organisms by UCSC available alignments, which resulted in a 87% coverage by the human genome for instance. Conclusion Besides recovering several of the already annotated functional RNA structures, we predicted a large number of high confidence conserved secondary structures in polyadenylated porcine transcripts. Our observations of relatively low expression levels of predicted ncRNA candidates together with the observations of higher relative amount in cDNA libraries from developmental stages are in agreement with the current paradigm of ncRNA roles in higher organisms and supports the idea of polyadenylated ncRNAs.
Collapse
Affiliation(s)
- Stefan E Seemann
- Division of Genetics and Bioinformatics, IBHV, University of Copenhagen, Grønnegårdsvej 3, DK-1870 Frederiksberg, Denmark
- Bioinformatics Group, Department of Computer Science, University of Leipzig, Germany
| | - Michael J Gilchrist
- The Wellcome Trust/Cancer Research UK Gurdon Institute, Cambridge, CB2 1QN, UK
| | - Ivo L Hofacker
- Institute for Theoretical Chemistry and Structural Biology, University of Vienna, Austria
| | - Peter F Stadler
- Bioinformatics Group, Department of Computer Science, University of Leipzig, Germany
- Institute for Theoretical Chemistry and Structural Biology, University of Vienna, Austria
| | - Jan Gorodkin
- Division of Genetics and Bioinformatics, IBHV, University of Copenhagen, Grønnegårdsvej 3, DK-1870 Frederiksberg, Denmark
| |
Collapse
|
209
|
Sone M, Hayashi T, Tarui H, Agata K, Takeichi M, Nakagawa S. The mRNA-like noncoding RNA Gomafu constitutes a novel nuclear domain in a subset of neurons. J Cell Sci 2007; 120:2498-506. [PMID: 17623775 DOI: 10.1242/jcs.009357] [Citation(s) in RCA: 248] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Recent transcriptome analyses have revealed that a large body of noncoding regions of mammalian genomes are actually transcribed into RNAs. Our understanding of the molecular features of these noncoding RNAs is far from complete. We have identified a novel mRNA-like noncoding gene, named Gomafu, which is expressed in a distinct set of neurons in the mouse nervous system. Interestingly, spliced mature Gomafu RNA is localized to the nucleus despite its mRNA-like characteristics, which usually act as potent export signals to the cytoplasm. Within the nucleus, Gomafu RNA is detected as numerous spots that do not colocalize with known nuclear domain markers. Gomafu RNA is extremely insoluble and remains intact after nuclear matrix preparation. Furthermore, heterokaryon assays revealed that Gomafu RNA does not shuttle between the nucleus and cytoplasm, but is retained in the nucleus after its transcription. We propose that Gomafu RNA represents a novel family of mRNA-like noncoding RNA that constitutes a cell-type-specific component of the nuclear matrix.
Collapse
Affiliation(s)
- Masamitsu Sone
- Nakagawa Initiative Research Unit, RIKEN, 2-1 Hirosawa, Wako 351-0198, Japan
| | | | | | | | | | | |
Collapse
|
210
|
Mehler MF, Mattick JS. Noncoding RNAs and RNA Editing in Brain Development, Functional Diversification, and Neurological Disease. Physiol Rev 2007; 87:799-823. [PMID: 17615389 DOI: 10.1152/physrev.00036.2006] [Citation(s) in RCA: 226] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
The progressive maturation and functional plasticity of the nervous system in health and disease involve a dynamic interplay between the transcriptome and the environment. There is a growing awareness that the previously unexplored molecular and functional interface mediating these complex gene-environmental interactions, particularly in brain, may encompass a sophisticated RNA regulatory network involving the twin processes of RNA editing and multifaceted actions of numerous subclasses of non-protein-coding RNAs. The mature nervous system encompasses a wide range of cell types and interconnections. Long-term changes in the strength of synaptic connections are thought to underlie memory retrieval, formation, stabilization, and effector functions. The evolving nervous system involves numerous developmental transitions, such as neurulation, neural tube patterning, neural stem cell expansion and maintenance, lineage elaboration, differentiation, axonal path finding, and synaptogenesis. Although the molecular bases for these processes are largely unknown, RNA-based epigenetic mechanisms appear to be essential for orchestrating these precise and versatile biological phenomena and in defining the etiology of a spectrum of neurological diseases. The concerted modulation of RNA editing and the selective expression of non-protein-coding RNAs during seminal as well as continuous state transitions may comprise the plastic molecular code needed to couple the intrinsic malleability of neural network connections to evolving environmental influences to establish diverse forms of short- and long-term memory, context-specific behavioral responses, and sophisticated cognitive capacities.
Collapse
Affiliation(s)
- Mark F Mehler
- Institute for Brain Disorders and Neural Regeneration, Department of Neurology, Einstein Cancer Center, Albert Einstein College of Medicine, Bronx, New York 10461, USA.
| | | |
Collapse
|
211
|
Dominski Z. Nucleases of the metallo-beta-lactamase family and their role in DNA and RNA metabolism. Crit Rev Biochem Mol Biol 2007; 42:67-93. [PMID: 17453916 DOI: 10.1080/10409230701279118] [Citation(s) in RCA: 100] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
Proteins of the metallo-beta-lactamase family with either demonstrated or predicted nuclease activity have been identified in a number of organisms ranging from bacteria to humans and has been shown to be important constituents of cellular metabolism. Nucleases of this family are believed to utilize a zinc-dependent mechanism in catalysis and function as 5' to 3' exonucleases and or endonucleases in such processes as 3' end processing of RNA precursors, DNA repair, V(D)J recombination, and telomere maintenance. Examples of metallo-beta-lactamase nucleases include CPSF-73, a known component of the cleavage/polyadenylation machinery, which functions as the endonuclease in 3' end formation of both polyadenylated and histone mRNAs, and Artemis that opens DNA hairpins during V(D)J recombination. Mutations in two metallo-beta-lactamase nucleases have been implicated in human diseases: tRNase Z required for 3' processing of tRNA precursors has been linked to the familial form of prostate cancer, whereas inactivation of Artemis causes severe combined immunodeficiency (SCID). There is also a group of as yet uncharacterized proteins of this family in bacteria and archaea that based on sequence similarity to CPSF-73 are predicted to function as nucleases in RNA metabolism. This article reviews the cellular roles of nucleases of the metallo-beta-lactamase family and the recent advances in studying these proteins.
Collapse
Affiliation(s)
- Zbigniew Dominski
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA.
| |
Collapse
|
212
|
Benyo B, Biro JC, Benyo Z. Codes in the codons: construction of a codon/amino acid periodic table and a study of the nature of specific nucleic acid-protein interactions. CONFERENCE PROCEEDINGS : ... ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL CONFERENCE 2007; 2004:2860-3. [PMID: 17270874 DOI: 10.1109/iembs.2004.1403815] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The theory of "codon-amino acid coevolution" was first proposed by Woese in 1967. It suggests that there is a stereochemical matching - that is, affinity - between amino acids and certain of the base triplet sequences that code for those amino acids. We have constructed a common periodic table of codons and amino acids, where the nucleic acid table showed perfect axial symmetry for codons and the corresponding amino acid table also displayed periodicity regarding the biochemical properties (charge and hydrophobicity) of the 20 amino acids and the position of the stop signals. The table indicates that the middle (2/sup nd/) amino acid in the codon has a prominent role in determining some of the structural features of the amino acids. The possibility that physical contact between codons and amino acids might exist was tested on restriction enzymes. Many recognition site-like sequences were found in the coding sequences of these enzymes and as many as 73 examples of codon-amino acid co-location were observed in the 7 known 3D structures (December 2003) of endonuclease-nucleic acid complexes. These results indicate that the smallest possible units of specific nucleic acid-protein interaction are indeed the stereochemically compatible codons and amino acids.
Collapse
Affiliation(s)
- B Benyo
- Dept. of Informatics, Szechenyi Istvan Univ., Gyor, Hungary
| | | | | |
Collapse
|
213
|
Washietl S, Pedersen JS, Korbel JO, Stocsits C, Gruber AR, Hackermüller J, Hertel J, Lindemeyer M, Reiche K, Tanzer A, Ucla C, Wyss C, Antonarakis SE, Denoeud F, Lagarde J, Drenkow J, Kapranov P, Gingeras TR, Guigó R, Snyder M, Gerstein MB, Reymond A, Hofacker IL, Stadler PF. Structured RNAs in the ENCODE selected regions of the human genome. Genes Dev 2007; 17:852-64. [PMID: 17568003 PMCID: PMC1891344 DOI: 10.1101/gr.5650707] [Citation(s) in RCA: 136] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2006] [Accepted: 12/12/2006] [Indexed: 12/16/2022]
Abstract
Functional RNA structures play an important role both in the context of noncoding RNA transcripts as well as regulatory elements in mRNAs. Here we present a computational study to detect functional RNA structures within the ENCODE regions of the human genome. Since structural RNAs in general lack characteristic signals in primary sequence, comparative approaches evaluating evolutionary conservation of structures are most promising. We have used three recently introduced programs based on either phylogenetic-stochastic context-free grammar (EvoFold) or energy directed folding (RNAz and AlifoldZ), yielding several thousand candidate structures (corresponding to approximately 2.7% of the ENCODE regions). EvoFold has its highest sensitivity in highly conserved and relatively AU-rich regions, while RNAz favors slightly GC-rich regions, resulting in a relatively small overlap between methods. Comparison with the GENCODE annotation points to functional RNAs in all genomic contexts, with a slightly increased density in 3'-UTRs. While we estimate a significant false discovery rate of approximately 50%-70% many of the predictions can be further substantiated by additional criteria: 248 loci are predicted by both RNAz and EvoFold, and an additional 239 RNAz or EvoFold predictions are supported by the (more stringent) AlifoldZ algorithm. Five hundred seventy RNAz structure predictions fall into regions that show signs of selection pressure also on the sequence level (i.e., conserved elements). More than 700 predictions overlap with noncoding transcripts detected by oligonucleotide tiling arrays. One hundred seventy-five selected candidates were tested by RT-PCR in six tissues, and expression could be verified in 43 cases (24.6%).
Collapse
Affiliation(s)
- Stefan Washietl
- Institute for Theoretical Chemistry, University of Vienna, A-1090 Wien, Austria.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
214
|
Abstract
Ordered heterogeneity was introduced as a basic feature of the living state in the mid-1950s. It was later expanded to "order in the large over heterogeneity in the small" as the first principle of a theory of organisms. Several examples of ordered heterogeneity were given at the time to illustrate the principle, but many more have become apparent since then to confirm its generality. They include minimum size requirements for progressive embryological development, the errant behavior of cells liberated from tissue architecture, their sorting out to reconstitute tissues on reaggregation, and contact regulation of cell proliferation. There is increasing heterogeneity of cell growth with age, and marked heterogeneity of many characters among cells of solid epithelial tumors. Normal growth behavior is reintroduced in solitary, carcinogen-initiated epidermal cells by contact with an excess of normal epidermal cells. Contact normalization also occurs when solitary hepatocarcinoma cells are transplanted into the parenchyma of normal liver of young, but not of old, animals. The role of the plasma membrane and adhesion molecules in ordering heterogeneity is evaluated. Organizing the results in a conceptual structure helps to understand classical observations of tumor biology such as the lifetime quiescence of carcinogen-initiated epidermal cells and the marked increase of cancer incidence with age. The principle of order above heterogeneity thus provides a unifying framework for a variety of seemingly unrelated processes in normal and neoplastic development. Whereas contact between cells is required for these processes to occur, gap junctional communication is not required.
Collapse
Affiliation(s)
- Harry Rubin
- Department of Molecular and Cell Biology, Life Sciences Addition, University of California, Berkeley, California 94720, USA
| |
Collapse
|
215
|
Discovery and characterization of 91 novel transcripts expressed in cattle placenta. BMC Genomics 2007; 8:113. [PMID: 17488528 PMCID: PMC1884150 DOI: 10.1186/1471-2164-8-113] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2006] [Accepted: 05/09/2007] [Indexed: 01/22/2023] Open
Abstract
Background Among the eutherian mammals, placental architecture varies to a greater extent than any other tissue. The diversity of placental types, even within a single mammalian order suggests that genes expressed in placenta are under strong Darwinian selection. Thus, the ruminant placenta may be a rich source of genes to explore adaptive evolutionary responses in mammals. The aim of our study was to identify novel transcripts expressed in ruminant placenta, and to characterize them with respect to their expression patterns, organization of coding sequences in the genome, and potential functions. Results A combination of bioinformatics, comparative genomics and transcript profiling was used to identify and characterize 91 novel transcripts (NTs) represented in a cattle placenta cDNA library. These NTs have no significant similarity to any non-ferungulate DNA or RNA sequence. Proteins longer than 100 aa were predicted for 29 NTs, and 21 are candidate non-coding RNAs. Eighty-six NTs were found to be expressed in one or more of 18 different tissues, with 39 (42%) showing tissue-preference, including six that were expressed exclusively in placentome. The authenticity of the NTs was confirmed by their alignment to cattle genome sequence, 42 of which showed evidence of mRNA splicing. Analysis of the genomic context where NT genes reside revealed 61 to be in intergenic regions, whereas 30 are within introns of known genes. The genes encoding the NTs were found to be significantly associated with subtelomeric regions. Conclusion The 91 lineage-specific transcripts are a useful resource for studying adaptive evolutionary responses of the ruminant placenta. The presence of so many genes encoding NTs in cattle but not primates or rodents suggests that gene loss and gain are important mechanisms of genome evolution in mammals. Furthermore, the clustering of NT genes within subtelomeric regions suggests that such regions are highly dynamic and may foster the birth of novel genes. The sequencing of additional vertebrate genomes with defined phylogenetic relationships will permit the search for lineage-specific genes to take on a more evolutionary context that is required to understand their origins and functions.
Collapse
|
216
|
Kapranov P, Willingham AT, Gingeras TR. Genome-wide transcription and the implications for genomic organization. Nat Rev Genet 2007; 8:413-23. [PMID: 17486121 DOI: 10.1038/nrg2083] [Citation(s) in RCA: 546] [Impact Index Per Article: 30.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Recent evidence of genome-wide transcription in several species indicates that the amount of transcription that occurs cannot be entirely accounted for by current sets of genome-wide annotations. Evidence indicates that most of both strands of the human genome might be transcribed, implying extensive overlap of transcriptional units and regulatory elements. These observations suggest that genomic architecture is not colinear, but is instead interleaved and modular, and that the same genomic sequences are multifunctional: that is, used for multiple independently regulated transcripts and as regulatory regions. What are the implications and consequences of such an interleaved genomic architecture in terms of increased information content, transcriptional complexity, evolution and disease states?
Collapse
Affiliation(s)
- Philipp Kapranov
- Affymetrix, Inc., 3420 Central Expressway, Santa Clara, California 95051, USA
| | | | | |
Collapse
|
217
|
Abstract
SUMMARY
It is usually thought that the development of complex organisms is controlled by protein regulatory factors and morphogenetic signals exchanged between cells and differentiating tissues during ontogeny. However, it is now evident that the majority of all animal genomes is transcribed, apparently in a developmentally regulated manner, suggesting that these genomes largely encode RNA machines and that there may be a vast hidden layer of RNA regulatory transactions in the background. I propose that the epigenetic trajectories of differentiation and development are primarily programmed by feed-forward RNA regulatory networks and that most of the information required for multicellular development is embedded in these networks, with cell–cell signalling required to provide important positional information and to correct stochastic errors in the endogenous RNA-directed program.
Collapse
Affiliation(s)
- John S Mattick
- ARC Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of Queensland, St Lucia QLD 4072, Australia.
| |
Collapse
|
218
|
Witzany G. Natural genome-editing competences of viruses. Acta Biotheor 2007; 54:235-53. [PMID: 17347785 DOI: 10.1007/s10441-006-9000-7] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2006] [Accepted: 12/22/2006] [Indexed: 01/28/2023]
Abstract
It is becoming increasingly evident that the driving forces of evolutionary novelty are not randomly derived chance mutations of the genetic text, but a precise genome editing by omnipresent viral agents. These competences integrate the whole toolbox of natural genetic engineering, replication, transcription, translation, genomic imprinting, genomic creativity, enzymatic inventions and all types of genetic repair patterns. Even the non-coding, repetitive DNA sequences which were interpreted as being ancient remnants of former evolutionary stages are now recognized as being of viral descent and crucial for higher-order regulatory and constitutional functions of protein structural vocabulary. In this article I argue that non-randomly derived natural genome editing can be envisioned as (a) combinatorial (syntactic), (b) context-specific (pragmatic) and (c) content-sensitive (semantic) competences of viral agents. These three-leveled biosemiotic competences could explain the emergence of complex new phenotypes in single evolutionary events. After short descriptions of the non-coding regulatory networks, major viral life strategies and pre-cellular viral life three of the major steps in evolution serve as examples: There is growing evidence that natural genome-editing competences of viruses are essential (1) for the evolution of the eukaryotic nucleus, (2) the adaptive immune system and (3) the placental mammals.
Collapse
Affiliation(s)
- Günther Witzany
- Philosophische Praxis, Vogelsangstr. 18c, 5111, Buermoos, Salzburg, Austria.
| |
Collapse
|
219
|
Abstract
Genome-wide epigenetic modification plays a pivotal role in regulating gene expression through chromatin structure and stability, tissue-specific and embryonic developmental specific gene regulation, and genomic imprinting. Mechanisms include chromatin remodeling through histone modification and DNA methylation, RNA associated gene silencing and chromosome inactivation, and genomic imprinting. These epigenetic mechanisms provide an added layer of transcriptional control of gene expression beyond those associated with variation in the sequence of the DNA. Variation in epigenetic regulation helps explain genetic diversity, but significant changes in epigenetic regulation can produce diseases. Advances in understanding epigenetic mechanisms have been accompanied by new therapeutic options and targets for treatment. This review focuses on a basic understanding of epigenetics and some of the diseases associated with epigenetic alterations.
Collapse
Affiliation(s)
- Darrel Waggoner
- Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA.
| |
Collapse
|
220
|
Carninci P, Hayashizaki Y. Noncoding RNA transcription beyond annotated genes. Curr Opin Genet Dev 2007; 17:139-44. [PMID: 17317145 DOI: 10.1016/j.gde.2007.02.008] [Citation(s) in RCA: 95] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2006] [Accepted: 02/12/2007] [Indexed: 11/20/2022]
Abstract
Recent analyses based on high-throughput transcriptome data have revealed that the fraction of the genome that is transcribed largely exceeds the fraction encoding protein. Transcription of unconventional genes into noncoding RNAs is widespread and, in mammals, these RNAs comprise at least half the total number of RNAs transcribed by RNA polymerase II. Although the function of the majority of noncoding RNAs has yet to be discovered, many of them are transcribed from both strands of the genome, and evidence points towards a regulatory function for many noncoding RNAs in mammalian cells.
Collapse
Affiliation(s)
- Piero Carninci
- Genome Exploration Research Group, RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan.
| | | |
Collapse
|
221
|
Zhang Z, Pang AWC, Gerstein M. Comparative analysis of genome tiling array data reveals many novel primate-specific functional RNAs in human. BMC Evol Biol 2007; 7 Suppl 1:S14. [PMID: 17288572 PMCID: PMC1796608 DOI: 10.1186/1471-2148-7-s1-s14] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Background Widespread transcription activities in the human genome were recently observed in high-resolution tiling array experiments, which revealed many novel transcripts that are outside of the boundaries of known protein or RNA genes. Termed as "TARs" (Transcriptionally Active Regions), these novel transcribed regions represent "dark matter" in the genome, and their origin and functionality need to be explained. Many of these transcripts are thought to code for novel proteins or non-protein-coding RNAs. We have applied an integrated bioinformatics approach to investigate the properties of these TARs, including cross-species conservation, and the ability to form stable secondary structures. The goal of this study is to identify a list of potential candidate sequences that are likely to code for functional non-protein-coding RNAs. We are particularly interested in the discovery of those functional RNA candidates that are primate-specific, i.e. those that do not have homologs in the mouse or dog genomes but in rhesus. Results Using sequence conservation and the probability of forming stable secondary structures, we have identified ~300 possible candidates for primate-specific noncoding RNAs. We are currently in the process of sequencing the orthologous regions of these candidate sequences in several other primate species. We will then be able to apply a "phylogenetic shadowing" approach to analyze the functionality of these ncRNA candidates. Conclusion The existence of potential primate-specific functional transcripts has demonstrated the limitation of previous genome comparison studies, which put too much emphasis on conservation between human and rodents. It also argues for the necessity of sequencing additional primate species to gain a better and more comprehensive understanding of the human genome.
Collapse
Affiliation(s)
- Zhaolei Zhang
- Banting & Best Department of Medical Research, Donnelly CCBR, University of Toronto, Toronto, ON M5S 3E1, Canada.
| | | | | |
Collapse
|
222
|
Prasanth KV, Spector DL. Eukaryotic regulatory RNAs: an answer to the 'genome complexity' conundrum. Genes Dev 2007; 21:11-42. [PMID: 17210785 DOI: 10.1101/gad.1484207] [Citation(s) in RCA: 301] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
A large portion of the eukaryotic genome is transcribed as noncoding RNAs (ncRNAs). While once thought of primarily as "junk," recent studies indicate that a large number of these RNAs play central roles in regulating gene expression at multiple levels. The increasing diversity of ncRNAs identified in the eukaryotic genome suggests a critical nexus between the regulatory potential of ncRNAs and the complexity of genome organization. We provide an overview of recent advances in the identification and function of eukaryotic ncRNAs and the roles played by these RNAs in chromatin organization, gene expression, and disease etiology.
Collapse
|
223
|
Schmidt AL, Anderson LM. Repetitive DNA elements as mediators of genomic change in response to environmental cues. Biol Rev Camb Philos Soc 2007. [DOI: 10.1111/j.1469-185x.2006.tb00217.x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
224
|
Panzitt K, Tschernatsch MMO, Guelly C, Moustafa T, Stradner M, Strohmaier HM, Buck CR, Denk H, Schroeder R, Trauner M, Zatloukal K. Characterization of HULC, a novel gene with striking up-regulation in hepatocellular carcinoma, as noncoding RNA. Gastroenterology 2007; 132:330-42. [PMID: 17241883 DOI: 10.1053/j.gastro.2006.08.026] [Citation(s) in RCA: 624] [Impact Index Per Article: 34.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/09/2005] [Accepted: 07/20/2006] [Indexed: 12/11/2022]
Abstract
BACKGROUND & AIMS Recent studies have highlighted the role of noncoding RNAs (ncRNAs) in carcinogenesis, and suggested that this class of genes might be used as biomarkers in cancer. We searched the human genome for novel genes including ncRNAs related to hepatocellular carcinoma (HCC). METHODS An HCC-specific gene library was generated and screened for deregulated genes with 46 HCCs, 4 focal nodular hyperplasias, and 7 cirrhoses utilizing cDNA arrays. Sequencing of library clones identified a novel ncRNA as the most up-regulated gene in HCC. This gene was also cloned from different monkeys and characterized by quantitative RT-PCR, Northern blot analysis and in situ hybridization. Structural and functional studies included comparative sequence and protein expression analyses, quantitative RT-PCR of polysomal preparations, and siRNA-mediated knockdown experiments. RESULTS The most up-regulated gene in HCC named highly up-regulated in liver cancer (HULC) was characterized as a novel mRNA-like ncRNA. HULC RNA is spliced and polyadenlyated, and resembles the mammalian LTR transposon 1A. It does not contain substantial open reading frames, and no native translation product was detected. HULC is present in the cytoplasm, where it copurifies with ribosomes. siRNA-mediated knockdown of HULC RNA in 2 HCC cell lines altered the expression of several genes, 5 of which were known to be affected in HCC, suggesting a role for HULC in post-transcriptional modulation of gene expression. CONCLUSIONS HULC is the first ncRNA with highly specific up-regulation in HCC. Because HULC was detected in blood of HCC patients, a potential use as novel biomarker can be envisaged.
Collapse
Affiliation(s)
- Katrin Panzitt
- Institute of Pathology, Medical University of Graz, Auenbruggerplatz 25, A-8036 Graz, Austria
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
225
|
Huang CF, Kaur J, Maguitman A, Rocha LM. Agent-based model of genotype editing. EVOLUTIONARY COMPUTATION 2007; 15:253-89. [PMID: 17705779 DOI: 10.1162/evco.2007.15.3.253] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
Evolutionary algorithms rarely deal with ontogenetic, non-inherited alteration of genetic information because they are based on a direct genotype-phenotype mapping. In contrast, several processes have been discovered in nature which alter genetic information encoded in DNA before it is translated into amino-acid chains. Ontogenetically altered genetic information is not inherited but extensively used in regulation and development of phenotypes, giving organisms the ability to, in a sense, re-program their genotypes according to environmental cues. An example of post-transcriptional alteration of gene-encoding sequences is the process of RNA Editing. Here we introduce a novel Agent-based model of genotype editing and a computational study of its evolutionary performance in static and dynamic environments. This model builds on our previous Genetic Algorithm with Editing, but presents a fundamentally novel architecture in which coding and non-coding genetic components are allowed to co-evolve. Our goals are: (1) to study the role of RNA Editing regulation in the evolutionary process, (2) to understand how genotype editing leads to a different, and novel evolutionary search algorithm, and (3) the conditions under which genotype editing improves the optimization performance of traditional evolutionary algorithms. We show that genotype editing allows evolving agents to perform better in several classes of fitness functions, both in static and dynamic environments. We also present evidence that the indirect genotype/phenotype mapping resulting from genotype editing leads to a better exploration/exploitation compromise of the search process. Therefore, we show that our biologically-inspired model of genotype editing can be used to both facilitate understanding of the evolutionary role of RNA regulation based on genotype editing in biology, and advance the current state of research in Evolutionary Computation.
Collapse
|
226
|
Abstract
The bovine genome sequence in ‘draft’ form will be complete in 2007. The availability of the sequence and very large numbers of single nucleotide polymorphisms will have profound effects on livestock production. The dairy industry is well positioned to capture the benefits of this enormous and enabling resource because of its comprehensive databases containing phenotypic and pedigree data for large numbers of animals, intense utilisation of genetics in breeding programs and efficient management of reproductive performance. The bovine genome sequence will assist in the development of novel products, especially value-added products, and markedly enhance the rate of genetic gain in the Australian dairy population. The immediate challenge facing the industry is the integration of new technological capabilities into existing breeding programs and production systems.
Collapse
|
227
|
Taft RJ, Pheasant M, Mattick JS. The relationship between non-protein-coding DNA and eukaryotic complexity. Bioessays 2007; 29:288-99. [PMID: 17295292 DOI: 10.1002/bies.20544] [Citation(s) in RCA: 417] [Impact Index Per Article: 23.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
There are two intriguing paradoxes in molecular biology--the inconsistent relationship between organismal complexity and (1) cellular DNA content and (2) the number of protein-coding genes--referred to as the C-value and G-value paradoxes, respectively. The C-value paradox may be largely explained by varying ploidy. The G-value paradox is more problematic, as the extent of protein coding sequence remains relatively static over a wide range of developmental complexity. We show by analysis of sequenced genomes that the relative amount of non-protein-coding sequence increases consistently with complexity. We also show that the distribution of introns in complex organisms is non-random. Genes composed of large amounts of intronic sequence are significantly overrepresented amongst genes that are highly expressed in the nervous system, and amongst genes downregulated in embryonic stem cells and cancers. We suggest that the informational paradox in complex organisms may be explained by the expansion of cis-acting regulatory elements and genes specifying trans-acting non-protein-coding RNAs.
Collapse
Affiliation(s)
- Ryan J Taft
- ARC Special Research Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of Queensland, St Lucia, Australia
| | | | | |
Collapse
|
228
|
Khanam T, Rozhdestvensky TS, Bundman M, Galiveti CR, Handel S, Sukonina V, Jordan U, Brosius J, Skryabin BV. Two primate-specific small non-protein-coding RNAs in transgenic mice: neuronal expression, subcellular localization and binding partners. Nucleic Acids Res 2006; 35:529-39. [PMID: 17175535 PMCID: PMC1802616 DOI: 10.1093/nar/gkl1082] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
In a rare occasion a single chromosomal locus was targeted twice by independent Alu-related retroposon insertions, and in both cases supported neuronal expression of the respective inserted genes encoding small non-protein coding RNAs (npcRNAs): BC200 RNA in anthropoid primates and G22 RNA in the Lorisoidea branch of prosimians. To avoid primate experimentation, we generated transgenic mice to study neuronal expression and protein binding partners for BC200 and G22 npcRNAs. The BC200 gene, with sufficient upstream flanking sequences, is expressed in transgenic mouse brain areas comparable to those in human brain, and G22 gene, with upstream flanks, has a similar expression pattern. However, when all upstream regions of the G22 gene were removed, expression was completely abolished, despite the presence of intact internal RNA polymerase III promoter elements. Transgenic BC200 RNA is transported into neuronal dendrites as it is in human brain. G22 RNA, almost twice as large as BC200 RNA, has a similar subcellular localization. Both transgenically expressed npcRNAs formed RNP complexes with poly(A) binding protein and the heterodimer SRP9/14, as does BC200 RNA in human. These observations strongly support the possibility that the independently exapted npcRNAs have similar functions, perhaps in translational regulation of dendritic protein biosynthesis in neurons of the respective primates.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Jürgen Brosius
- To whom correspondence should be addressed. Tel: +49 251 8358511; Fax: +49 251 8358512;
| | | |
Collapse
|
229
|
Abstract
For a long time, molecular evolutionary biologists have been focused on DNA and proteins, whereas RNA has lived in the shadow of its famous chemical cousins as a mere intermediary. Although this perspective has begun to change since genome-wide transcriptional profiling was successfully extended to evolutionary biology, it still echoes in evolutionary literature. In this mini-review, new developments of RNA biochemistry and transcriptomics are brought to the attention of evolutionary biologists. In particular, the unexpected abundance and functional significance of noncoding RNAs is briefly reviewed. Noncoding RNAs control a remarkable range of biological pathways and processes, all with obvious fitness consequences, such as initiation of translation, mRNA abundance, transposon jumping, chromosome architecture, stem cell maintenance, development of brain and muscles, insulin secretion, cancerogenesis and plant resistance to viral infections.
Collapse
Affiliation(s)
- P Michalak
- Department of Biology, The University of Texas at Arlington, Arlington, TX 76010, USA.
| |
Collapse
|
230
|
Pang KC, Stephen S, Dinger ME, Engström PG, Lenhard B, Mattick JS. RNAdb 2.0--an expanded database of mammalian non-coding RNAs. Nucleic Acids Res 2006; 35:D178-82. [PMID: 17145715 PMCID: PMC1751534 DOI: 10.1093/nar/gkl926] [Citation(s) in RCA: 133] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
RNAdb is a comprehensive database of mammalian non-protein-coding RNAs (ncRNAs). There is increasing recognition that ncRNAs play important regulatory roles in multicellular organisms, and there is an expanding rate of discovery of novel ncRNAs as well as an increasing allocation of function. In this update to RNAdb, we provide nucleotide sequences and annotations for tens of thousands of non-housekeeping ncRNAs, including a wide range of mammalian microRNAs, small nucleolar RNAs and larger mRNA-like ncRNAs. Some of these have documented functions and/or expression patterns, but the majority remain of unclear significance, and include PIWI-interacting RNAs, ncRNAs identified from the latest rounds of large-scale cDNA sequencing projects, putative antisense transcripts, as well as ncRNAs predicted on the basis of structural features and alignments. Improvements to the database comprise not only new and updated ncRNA datasets, but also provision of microarray-based expression data and closer interface with more specialized ncRNA resources such as miRBase and snoRNA-LBME-db. To access RNAdb, visit http://research.imb.uq.edu.au/RNAdb.
Collapse
Affiliation(s)
- Ken C. Pang
- ARC Special Research Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of QueenslandBrisbane, Queensland 4072, Australia
- T cell Laboratory, Ludwig Institute for Cancer Research, Melbourne Centre for Clinical Sciences, Austin Hospital, HeidelbergVictoria 3084, Australia
| | - Stuart Stephen
- ARC Special Research Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of QueenslandBrisbane, Queensland 4072, Australia
| | - Marcel E. Dinger
- ARC Special Research Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of QueenslandBrisbane, Queensland 4072, Australia
| | - Pär G. Engström
- Computational Biology Unit, Bergen Center for Computational Science, University of BergenBergen, Norway
- Programme for Genomics and Bioinformatics, Department of Cell and Molecular Biology, Karolinska InstitutetStockholm, Sweden
| | - Boris Lenhard
- Computational Biology Unit, Bergen Center for Computational Science, University of BergenBergen, Norway
| | - John S. Mattick
- ARC Special Research Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of QueenslandBrisbane, Queensland 4072, Australia
- To whom correspondence should be addressed. Tel: + 61 7 3346 2079; Fax: +1 61 7 3346 2111;
| |
Collapse
|
231
|
Abstract
Current knowledge about the variety and complexity of the processes that allow regulated gene expression in living organisms calls for a new understanding of genes. A 'postgenomic' understanding of genes as entities constituted during genome expression is outlined and illustrated with specific examples that formed part of a survey research instrument developed by two of the authors for an ongoing empirical study of conceptual change in contemporary biology.
Collapse
Affiliation(s)
- Karola C Stotz
- Cognitive Science Program, Indiana University Bloomington, Bloomington, IN 47406-7512, USA.
| | | | | |
Collapse
|
232
|
Ingraham SE, Lynch RA, Surti U, Rutter JL, Buckler AJ, Khan SA, Menon AG, Lepont P. Identification and characterization of novel human transcripts embedded within HMGA2 in t(12;14)(q15;q24.1) uterine leiomyoma. Mutat Res 2006; 602:43-53. [PMID: 17045619 DOI: 10.1016/j.mrfmmm.2006.07.007] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2006] [Revised: 07/28/2006] [Accepted: 07/31/2006] [Indexed: 11/22/2022]
Abstract
The high mobility group A2 protein (HMGA2) has been implicated in the pathogenesis of mesenchymal tumors such as leiomyoma, lipoma and hamartoma. HMGA2 was pinpointed by mapping the breakpoints in the chromosomal translocations in 12q15, especially the t(12;14) that is commonly seen in uterine leiomyoma. It is generally assumed that altered expression of HMGA2 is an early event in the pathway to tumor formation. Here, we show evidence that three novel transcripts, A15, B6 and D12 are located within the HMGA2 gene itself and are transcribed from the opposite strand. These embedded transcripts are expressed at 6-20-fold higher levels in tumors compared to matched myometrium from the same patients. We estimate that the domain of increased expression extends 500kb on chromosome 12q15, and encompasses the majority of t(12;14) translocation breakpoints. However, a corresponding domain of consistently altered expression is not seen on chromosome 14 or outside of the chromosome 12 multiple aberration region. These data suggest that t(12;14) breakpoints contribute to the pathogenesis of uterine leiomyoma by interrupting a complex regulation of HMGA2 and other genes embedded within and around it. We also discovered a novel laminin receptor gene, transcribed from the opposite strand, within the promoter region of HMGA2. Although the roles for these embedded transcripts are still unknown, preliminary data suggest that they are members of the family of non-coding RNA and that they may play an important role in the pathology of uterine leiomyoma.
Collapse
Affiliation(s)
- Susan E Ingraham
- Department of Molecular Genetics, Biochemistry and Microbiology, University of Cincinnati, Cincinnati, OH 45267-0524, USA
| | | | | | | | | | | | | | | |
Collapse
|
233
|
Abstract
The knowledge about classes of non-coding RNAs (ncRNAs) is growing very fast and it is mainly the structure which is the common characteristic property shared by members of the same class. For correct characterization of such classes it is therefore of great importance to analyse the structural features in great detail. In this manuscript I present RNAlishapes which combines various secondary structure analysis methods, such as suboptimal folding and shape abstraction, with a comparative approach known as RNA alignment folding. RNAlishapes makes use of an extended thermodynamic model and covariance scoring, which allows to reward covariation of paired bases. Applying the algorithm to a set of bacterial trp-operon leaders using shape abstraction it was able to identify the two alternating conformations of this attenuator. Besides providing in-depth analysis methods for aligned RNAs, the tool also shows a fairly well prediction accuracy. Therefore, RNAlishapes provides the community with a powerful tool for structural analysis of classes of RNAs and is also a reasonable method for consensus structure prediction based on sequence alignments. RNAlishapes is available for online use and download at .
Collapse
Affiliation(s)
- Björn Voss
- Experimental Bioinformatics, Institute of Biology II, Freiburg University, Schänzlestrasse 1, 79104 Freiburg, Germany.
| |
Collapse
|
234
|
Abstract
Eukaryotic genomes display segmental patterns of variation in various properties, including GC content and degree of evolutionary conservation. DNA segmentation algorithms are aimed at identifying statistically significant boundaries between such segments. Such algorithms may provide a means of discovering new classes of functional elements in eukaryotic genomes. This paper presents a model and an algorithm for Bayesian DNA segmentation and considers the feasibility of using it to segment whole eukaryotic genomes. The algorithm is tested on a range of simulated and real DNA sequences, and the following conclusions are drawn. Firstly, the algorithm correctly identifies non-segmented sequence, and can thus be used to reject the null hypothesis of uniformity in the property of interest. Secondly, estimates of the number and locations of change-points produced by the algorithm are robust to variations in algorithm parameters and initial starting conditions and correspond to real features in the data. Thirdly, the algorithm is successfully used to segment human chromosome 1 according to GC content, thus demonstrating the feasibility of Bayesian segmentation of eukaryotic genomes. The software described in this paper is available from the author's website (www.uq.edu.au/ approximately uqjkeith/) or upon request to the author.
Collapse
Affiliation(s)
- Jonathan M Keith
- Department of Mathematics, University of Queensland, Brisbane, Australia.
| |
Collapse
|
235
|
Abstract
The past few years have brought about a fundamental change in our understanding and definition of the RNA world and its role in the functional and regulatory architecture of the cell. The discovery of small RNAs that regulate many aspects of differentiation and development have joined the already known non-coding RNAs that are involved in chromosome dosage compensation, imprinting, and other functions to become key players in regulating the flow of genetic information. It is also evident that there are tens or even hundreds of thousands of other non-coding RNAs that are transcribed from the mammalian genome, as well as many other yet-to-be-discovered small regulatory RNAs. In the recent symposium RNA: Networks & Imaging held in Heidelberg, the dual roles of RNA as a messenger and a regulator in the flow of genetic information were discussed and new molecular genetic and imaging methods to study RNA presented.
Collapse
Affiliation(s)
- Marc Kenzelmann
- Department of Cellular and Molecular Pathology, German Cancer Research Center, Heidelberg, Germany.
| | | | | |
Collapse
|
236
|
Carninci P. Tagging mammalian transcription complexity. Trends Genet 2006; 22:501-10. [PMID: 16859803 DOI: 10.1016/j.tig.2006.07.003] [Citation(s) in RCA: 78] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2006] [Revised: 05/24/2006] [Accepted: 07/04/2006] [Indexed: 11/18/2022]
Abstract
The nature of the 'transcriptome' is more complex than first realized. Although CAGE, various tagging technologies and tiling arrays show that most of the mammalian genome is transcribed, a large proportion of transcripts do not encode proteins and are either poorly polyadenylated, involved in sense-antisense pairs or never leave the nucleus. In this article, I review the various techniques and data sets that are currently used to measure gene transcription and the evidence that reveals the true extent of transcription in mammalian genomes. The next few years will see efforts to identify novel transcripts systematically and decipher their function. A deeper understanding of transcriptional complexity might even lead us to redefine what we mean by the term 'gene'.
Collapse
Affiliation(s)
- Piero Carninci
- Genome Science Laboratory, Discovery and Research Institute, RIKEN Wako Institute, 2-1 Hirosawa, Wako, Saitama 351-0198 Japan.
| |
Collapse
|
237
|
Gustincich S, Sandelin A, Plessy C, Katayama S, Simone R, Lazarevic D, Hayashizaki Y, Carninci P. The complexity of the mammalian transcriptome. J Physiol 2006; 575:321-32. [PMID: 16857706 PMCID: PMC1819450 DOI: 10.1113/jphysiol.2006.115568] [Citation(s) in RCA: 83] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
A comprehensive understanding of protein and regulatory networks is strictly dependent on the complete description of the transcriptome of cells. After the determination of the genome sequence of several mammalian species, gene identification is based on in silico predictions followed by evidence of transcription. Conservative estimates suggest that there are about 20,000 protein-encoding genes in the mammalian genome. In the last few years the combination of full-length cDNA cloning, cap-analysis gene expression (CAGE) tag sequencing and tiling arrays experiments have unveiled unexpected additional complexities in the transcriptome. Here we describe the current view of the mammalian transcriptome focusing on transcripts diversity, the growing non-coding RNA world, the organization of transcriptional units in the genome and promoter structures. In-depth analysis of the brain transcriptome has been challenging due to the cellular complexity of this organ. Here we present a computational analysis of CAGE data from different regions of the central nervous system, suggesting distinctive mechanisms of brain-specific transcription.
Collapse
Affiliation(s)
- Stefano Gustincich
- Sector of Neurobiology, International School for Advanced Studies (ISAS)-SISSA, AREA Science Park, SS 14, Km 163,5, Basovizza, 34012 Trieste, Italy.
| | | | | | | | | | | | | | | |
Collapse
|
238
|
Missal K, Zhu X, Rose D, Deng W, Skogerbø G, Chen R, Stadler PF. Prediction of structured non-coding RNAs in the genomes of the nematodesCaenorhabditis elegans andCaenorhabditis briggsae. JOURNAL OF EXPERIMENTAL ZOOLOGY PART B-MOLECULAR AND DEVELOPMENTAL EVOLUTION 2006; 306:379-92. [PMID: 16425273 DOI: 10.1002/jez.b.21086] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
We present a survey for non-coding RNAs and other structured RNA motifs in the genomes of Caenorhabditis elegans and Caenorhabditis briggsae using the RNAz program. This approach explicitly evaluates comparative sequence information to detect stabilizing selection acting on RNA secondary structure. We detect 3,672 structured RNA motifs, of which only 678 are known non-translated RNAs (ncRNAs) or clear homologs of known C. elegans ncRNAs. Most of these signals are located in introns or at a distance from known protein-coding genes. With an estimated false positive rate of about 50% and a sensitivity on the order of 50%, we estimate that the nematode genomes contain between 3,000 and 4,000 RNAs with evolutionary conserved secondary structures. Only a small fraction of these belongs to the known RNA classes, including tRNAs, snoRNAs, snRNAs, or microRNAs. A relatively small class of ncRNA candidates is associated with previously observed RNA-specific upstream elements.
Collapse
Affiliation(s)
- Kristin Missal
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstrasse 16 18, D 04107 Leipzig, Germany.
| | | | | | | | | | | | | |
Collapse
|
239
|
Abstract
Increasing evidence suggests that the development and function of the nervous system is heavily dependent on RNA editing and the intricate spatiotemporal expression of a wide repertoire of non-coding RNAs, including micro RNAs, small nucleolar RNAs and longer non-coding RNAs. Non-coding RNAs may provide the key to understanding the multi-tiered links between neural development, nervous system function, and neurological diseases.
Collapse
Affiliation(s)
- Mark F Mehler
- ARC Special Research Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of Queensland, St Lucia, QLD 4072, Australia
| | | |
Collapse
|
240
|
Witzany G. Serial Endosymbiotic Theory (set): the biosemiotic update. Acta Biotheor 2006; 54:103-17. [PMID: 16988903 DOI: 10.1007/s10441-006-7831-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2005] [Accepted: 03/07/2006] [Indexed: 10/24/2022]
Abstract
The Serial Endosymbiotic Theory explains the origin of nucleated eukaryotic cells by a merging of archaebacterial and eubacterial cells. The paradigmatic change is that the driving force behind evolution is not ramification but merging. Lynn Margulis describes the symbiogenetic processes in the language of mechanistic biology in such terms as "merging", "fusion", and "incorporation". Biosemiotics argues that all cell-cell interactions are (rule-governed) sign-mediated interactions, i.e., communication processes. As the description of plant communication demonstrates, the biosemiotic approach is not limited to the level of molecular biology, but is also helpful in examining all sign-mediated interactions between organisms on the phenotypic level. If biosemiotics also uses the notions of "language" and "communication" to describe non-human sign-mediated interactions, then the underlying scientific justification of such usage should be critically considered. Therefore, I summarize the history of this discussion held between 1920 and 1980 and present its result, the pragmatic turn.
Collapse
Affiliation(s)
- Günther Witzany
- telos - Philosophische Praxis, Vogelsangstr. 18c, A-5111-Buermoos/Salzburg, Austria.
| |
Collapse
|
241
|
Costain WJ, Rasquinha I, Graber T, Luebbert C, Preston E, Slinn J, Xie X, MacManus JP. Cerebral ischemia induces neuronal expression of novel VL30 mouse retrotransposons bound to polyribosomes. Brain Res 2006; 1094:24-37. [PMID: 16730676 DOI: 10.1016/j.brainres.2006.03.120] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2005] [Revised: 03/14/2006] [Accepted: 03/23/2006] [Indexed: 01/27/2023]
Abstract
Mammalian genomes are burdened with a large heterogeneous group of endogenous replication defective retroviruses (retrotransposons). Previously, we identified a transcript resembling a virus-like 30S (VL30) retrotransposon increasing in mouse brain following transient cerebral ischemia. Paradoxically, this non-coding RNA was found bound to polyribosomes. Further analysis revealed that multiple retrotransposon species (BVL-1-like and mVL30-1-like) were bound to polyribosomes and induced by ischemia. These VL30 transcripts remained associated with polyribosomes in the presence of 0.5 M KCl, indicating that VL30 mRNA was tightly associated with ribosomal subunits. Furthermore, the profile of BVL-1 distribution on polyribosomal profiles was distinct from those of translated and translationally repressed mRNA. Consistent with expectations, 5.0 kb VL30 transcripts were detected in ischemic brain with a temporal pattern of expression that was distinct from c-fos. Expression of VL30 was localized in neurons using a combination of in situ hybridization and immunocytochemistry. 3'-RACE-PCR experiments yielded two unique sequences (VL30x-1 and VL30x-2) that were homologous to known VL30 genes. Phylogenetic analysis of VL30 promoter sequence (U3 region) resulted in the identification of two large VL30 subgroups. VL30x-1 and VL30x-2 were closely related and classified in a group that was distinct from the well-characterized VL30 genes BVL-1 and mVL30-1. The promoter regions of VL30x-1 and VL30x-2 did not possess the consensus sequences for either hypoxia or anoxia response elements, suggesting an alternative mechanism for induction. This is the first report that demonstrates ischemia-induced, neuronal expression of unique VL30 retrotransposons in mouse brain.
Collapse
Affiliation(s)
- Willard J Costain
- Institute for Biological Sciences M54, National Research Council, Montreal Road Laboratories, Ottawa, ON, Canada K1A 0R6.
| | | | | | | | | | | | | | | |
Collapse
|
242
|
Blanco E, Rojas R, Haeger P, Cuevas R, Perez C, Munita R, Quiroz G, Andrés ME, Forray MI, Gysling K. Intron retention as an alternative splice variant of the rat urocortin 1 gene. Neuroscience 2006; 140:1245-52. [PMID: 16650605 DOI: 10.1016/j.neuroscience.2006.03.031] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2006] [Revised: 02/28/2006] [Accepted: 03/07/2006] [Indexed: 11/26/2022]
Abstract
Urocortin 1, highly conserved metazoan gene of the corticotropin-releasing hormone family, is a simple gene structured in two exons and the corresponding intron. The urocortin 1 prepropeptide is entirely coded in the second exon. Preliminary non-isotopic in situ hybridization experiments with an oligonucleotide complementary to an intron sequence of the urocortin 1 gene showed a significant cytoplasmic-like staining, suggesting the occurrence of an intron-retained urocortin 1 transcript. This observation prompted us to study whether the urocortin 1 gene presents alternative splicing by intron retention event. Confocal fluorescent in situ hybridization for urocortin 1 RNA and the use of the specific DNA dye TOPRO-3 allowed us to show significant expression of the intron-retained urocortin 1 transcript that did not colocalize with TOPRO-3 staining indicating a cytoplasmic localization for the intron-retained urocortin 1 transcript. The natural occurrence of a polyadenylated intron-retained urocortin 1 RNA was further documented by reverse transcriptase polymerase chain reaction (PCR), primed with oligo(dT), of total RNA extracted from three brain regions, a midbrain region containing the Edinger-Westphal nucleus, cerebellum and prefrontal cortex. In the three brain regions studied, it was possible to amplify both intron-less as well as intron-retained urocortin 1 transcripts. The use of PCR primers that simultaneously amplify both urocortin 1 transcripts allowed us to show that the expression of both urocortin 1 transcripts differs among the brain regions analyzed, suggesting a tissue specific regulation of this alternative splicing. In silico analysis of the five known mammalian urocortin 1 genomic sequences showed high conservation of the urocortin 1 intron sequence. Further studies should investigate the regulation of this intron retention event and its consequence for the functionality of the urocortin 1 gene.
Collapse
Affiliation(s)
- E Blanco
- Department of Cellular and Molecular Biology, Faculty of Biological Sciences, Catholic University of Chile, Alameda 340, 833-1010, Santiago, Chile
| | | | | | | | | | | | | | | | | | | |
Collapse
|
243
|
Babendure JR, Babendure JL, Ding JH, Tsien RY. Control of mammalian translation by mRNA structure near caps. RNA (NEW YORK, N.Y.) 2006; 12:851-61. [PMID: 16540693 PMCID: PMC1440912 DOI: 10.1261/rna.2309906] [Citation(s) in RCA: 219] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
The scanning model of RNA translation proposes that highly stable secondary structures within mRNAs can inhibit translation, while structures of lower thermal stability also affect translation if close enough to the 5' methyl G cap. However, only fragmentary information is available about the dependence of translation efficiency in live mammalian cells on the thermodynamic stability, location, and GC content of RNA structures in the 5'-untranslated region. We devised a two-color fluorescence assay for translation efficiency in single live cells and compared a wide range of hairpins with predicted thermal stabilities ranging from -10 to -50 kcal/mol and 5' G cap-to-hairpin distances of 1-46 bases. Translation efficiency decreased abruptly as hairpin stabilities increased from deltaG = -25 to -35 kcal/mol. Shifting a hairpin as little as nine bases relative to the 5' cap could modulate translation more than 50-fold. Increasing GC content diminished translation efficiency when predicted thermal stability and cap-to-hairpin distances were held constant. We additionally found naturally occurring 5'-untranslated regions affected translation differently in live cells compared with translation in in vitro lysates. Our study will assist scientists in designing experiments that deliberately modulate mammalian translation with designed 5' UTRs.
Collapse
Affiliation(s)
- Jeremy R Babendure
- Department of Pharmacology, University of California, San Diego, La Jolla 92093-0647, USA
| | | | | | | |
Collapse
|
244
|
Abstract
The rapid expansion of methods for measuring biological data ranging from DNA sequence variations to mRNA expression and protein abundance presents the opportunity to utilize multiple types of information jointly in the study of human health and disease. Organisms are complex systems that integrate inputs at myriad levels to arrive at an observable phenotype. Therefore, it is essential that questions concerning the etiology of phenotypes as complex as common human diseases take the systemic nature of biology into account, and integrate the information provided by each data type in a manner analogous to the operation of the body itself. While limited in scope, the initial forays into the joint analysis of multiple data types have yielded interesting results that would not have been reached had only one type of data been considered. These early successes, along with the aforementioned theoretical appeal of data integration, provide impetus for the development of methods for the parallel, high-throughput analysis of multiple data types. The idea that the integrated analysis of multiple data types will improve the identification of biomarkers of clinical endpoints, such as disease susceptibility, is presented as a working hypothesis.
Collapse
Affiliation(s)
- David M Reif
- Center for Human Genetics Research, Vanderbilt University Medical School, 519 Light Hall, Nashville, TN 37232-0700, USA.
| | | | | |
Collapse
|
245
|
Abstract
The term non-coding RNA (ncRNA) is commonly employed for RNA that does not encode a protein, but this does not mean that such RNAs do not contain information nor have function. Although it has been generally assumed that most genetic information is transacted by proteins, recent evidence suggests that the majority of the genomes of mammals and other complex organisms is in fact transcribed into ncRNAs, many of which are alternatively spliced and/or processed into smaller products. These ncRNAs include microRNAs and snoRNAs (many if not most of which remain to be identified), as well as likely other classes of yet-to-be-discovered small regulatory RNAs, and tens of thousands of longer transcripts (including complex patterns of interlacing and overlapping sense and antisense transcripts), most of whose functions are unknown. These RNAs (including those derived from introns) appear to comprise a hidden layer of internal signals that control various levels of gene expression in physiology and development, including chromatin architecture/epigenetic memory, transcription, RNA splicing, editing, translation and turnover. RNA regulatory networks may determine most of our complex characteristics, play a significant role in disease and constitute an unexplored world of genetic variation both within and between species.
Collapse
Affiliation(s)
- John S Mattick
- Australian Research Council Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of Queensland, St Lucia, QLD 4072, Australia.
| | | |
Collapse
|
246
|
Ginger MR, Shore AN, Contreras A, Rijnkels M, Miller J, Gonzalez-Rimbau MF, Rosen JM. A noncoding RNA is a potential marker of cell fate during mammary gland development. Proc Natl Acad Sci U S A 2006; 103:5781-6. [PMID: 16574773 PMCID: PMC1420634 DOI: 10.1073/pnas.0600745103] [Citation(s) in RCA: 146] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2005] [Indexed: 12/26/2022] Open
Abstract
PINC is a large, alternatively spliced, developmentally regulated, noncoding RNA expressed in the regressed terminal ductal lobular unit-like structures of the parous mammary gland. Previous studies have shown that this population of cells possesses not only progenitor-like qualities (the ability to proliferate and repopulate a mammary gland) and the ability to survive developmentally programmed cell death but also the inhibition of carcinogen-induced proliferation. Here we report that PINC expression is temporally and spatially regulated in response to developmental stimuli in vivo and that PINC RNA is localized to distinct foci in either the nucleus or the cytoplasm in a cell-cycle-specific manner. Loss-of-function experiments suggest that PINC performs dual roles in cell survival and regulation of cell-cycle progression, suggesting that PINC may contribute to the developmentally mediated changes previously observed in the terminal ductal lobular unit-like structures of the parous gland. This is one of the first reports describing the functional properties of a large, developmentally regulated, mammalian, noncoding RNA.
Collapse
Affiliation(s)
| | - Amy N. Shore
- Program in Developmental Biology, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030; and
| | | | - Monique Rijnkels
- U.S. Department of Agriculture/Agricultural Research Services Children’s Nutrition Research Center, Department of Pediatrics, Baylor College of Medicine, 1100 Bates Street, Houston, TX 77030
| | | | | | | |
Collapse
|
247
|
|
248
|
Bernhart SH, Tafer H, Mückstein U, Flamm C, Stadler PF, Hofacker IL. Partition function and base pairing probabilities of RNA heterodimers. Algorithms Mol Biol 2006; 1:3. [PMID: 16722605 PMCID: PMC1459172 DOI: 10.1186/1748-7188-1-3] [Citation(s) in RCA: 200] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2006] [Accepted: 03/16/2006] [Indexed: 12/22/2022] Open
Abstract
BACKGROUND RNA has been recognized as a key player in cellular regulation in recent years. In many cases, non-coding RNAs exert their function by binding to other nucleic acids, as in the case of microRNAs and snoRNAs. The specificity of these interactions derives from the stability of inter-molecular base pairing. The accurate computational treatment of RNA-RNA binding therefore lies at the heart of target prediction algorithms. METHODS The standard dynamic programming algorithms for computing secondary structures of linear single-stranded RNA molecules are extended to the co-folding of two interacting RNAs. RESULTS We present a program, RNAcofold, that computes the hybridization energy and base pairing pattern of a pair of interacting RNA molecules. In contrast to earlier approaches, complex internal structures in both RNAs are fully taken into account. RNAcofold supports the calculation of the minimum energy structure and of a complete set of suboptimal structures in an energy band above the ground state. Furthermore, it provides an extension of McCaskill's partition function algorithm to compute base pairing probabilities, realistic interaction energies, and equilibrium concentrations of duplex structures.
Collapse
Affiliation(s)
- Stephan H Bernhart
- Theoretical Biochemistry Group, Institute for Theoretical Chemistry, University of Vienna, Währingerstrasse 17, Vienna, Austria
| | - Hakim Tafer
- Theoretical Biochemistry Group, Institute for Theoretical Chemistry, University of Vienna, Währingerstrasse 17, Vienna, Austria
| | - Ulrike Mückstein
- Theoretical Biochemistry Group, Institute for Theoretical Chemistry, University of Vienna, Währingerstrasse 17, Vienna, Austria
| | - Christoph Flamm
- Theoretical Biochemistry Group, Institute for Theoretical Chemistry, University of Vienna, Währingerstrasse 17, Vienna, Austria
- Bioinformatics Group, Department of Computer Science and Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstrasse 16–18, D-04170 Leipzig, Germany
| | - Peter F Stadler
- Theoretical Biochemistry Group, Institute for Theoretical Chemistry, University of Vienna, Währingerstrasse 17, Vienna, Austria
- Bioinformatics Group, Department of Computer Science and Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstrasse 16–18, D-04170 Leipzig, Germany
- The Santa Fe Institute, 1399 Hyde Park Rd., Santa Fe, New Mexico
| | - Ivo L Hofacker
- Theoretical Biochemistry Group, Institute for Theoretical Chemistry, University of Vienna, Währingerstrasse 17, Vienna, Austria
| |
Collapse
|
249
|
Sevignani C, Calin GA, Siracusa LD, Croce CM. Mammalian microRNAs: a small world for fine-tuning gene expression. Mamm Genome 2006; 17:189-202. [PMID: 16518686 PMCID: PMC2679635 DOI: 10.1007/s00335-005-0066-3] [Citation(s) in RCA: 264] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2005] [Accepted: 11/30/2005] [Indexed: 12/19/2022]
Abstract
The basis of eukaryotic complexity is an intricate genetic architecture where parallel systems are involved in tuning gene expression, via RNA-DNA, RNA-RNA, RNA-protein, and DNA-protein interactions. In higher organisms, about 97% of the transcriptional output is represented by noncoding RNA (ncRNA) encompassing not only rRNA, tRNA, introns, 5′ and 3′ untranslated regions, transposable elements, and intergenic regions, but also a large, rapidly emerging family named microRNAs. MicroRNAs are short 20-22-nucleotide RNA molecules that have been shown to regulate the expression of other genes in a variety of eukaryotic systems. MicroRNAs are formed from larger transcripts that fold to produce hairpin structures and serve as substrates for the cytoplasmic Dicer, a member of the RNase III enzyme family. A recent analysis of the genomic location of human microRNA genes suggested that 50% of microRNA genes are located in cancer-associated genomic regions or in fragile sites. This review focuses on the possible implications of microRNAs in post-transcriptional gene regulation in mammalian diseases, with particular focus on cancer. We argue that developing mouse models for deleted and/or overexpressed microRNAs will be of invaluable interest to decipher the regulatory networks where microRNAs are involved.
Collapse
Affiliation(s)
- Cinzia Sevignani
- Department of Microbiology and Immunology, Kimmel Cancer Center, Jefferson Medical College, Philadelphia, Pennsylvania 19107 USA
| | - George A. Calin
- Department of Molecular Virology, Immunology and Medical Genetics and Comprehensive Cancer Center, Ohio State University, Columbus Ohio, 43210 USA
| | - Linda D. Siracusa
- Department of Microbiology and Immunology, Kimmel Cancer Center, Jefferson Medical College, Philadelphia, Pennsylvania 19107 USA
| | - Carlo M. Croce
- Department of Molecular Virology, Immunology and Medical Genetics and Comprehensive Cancer Center, Ohio State University, Columbus Ohio, 43210 USA
| |
Collapse
|
250
|
Hertel J, Lindemeyer M, Missal K, Fried C, Tanzer A, Flamm C, Hofacker IL, Stadler PF, The Students of Bioinformatics Computer Labs 2004 and 2005. The expansion of the metazoan microRNA repertoire. BMC Genomics 2006; 7:25. [PMID: 16480513 PMCID: PMC1388199 DOI: 10.1186/1471-2164-7-25] [Citation(s) in RCA: 258] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2005] [Accepted: 02/15/2006] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND MicroRNAs have been identified as crucial regulators in both animals and plants. Here we report on a comprehensive comparative study of all known miRNA families in animals. We expand the MicroRNA Registry 6.0 by more than 1000 new homologs of miRNA precursors whose expression has been verified in at least one species. Using this uniform data basis we analyze their evolutionary history in terms of individual gene phylogenies and in terms of preservation of genomic nearness across species. This allows us to reliably identify microRNA clusters that are derived from a common transcript. RESULTS We identify three episodes of microRNA innovation that correspond to major developmental innovations: A class of about 20 miRNAs is common to protostomes and deuterostomes and might be related to the advent of bilaterians. A second large wave of innovations maps to the branch leading to the vertebrates. The third significant outburst of miRNA innovation coincides with placental (eutherian) mammals. In addition, we observe the expected expansion of the microRNA inventory due to genome duplications in early vertebrates and in an ancestral teleost. The non-local duplications in the vertebrate ancestor are predated by local (tandem) duplications leading to the formation of about a dozen ancient microRNA clusters. CONCLUSION Our results suggest that microRNA innovation is an ongoing process. Major expansions of the metazoan miRNA repertoire coincide with the advent of bilaterians, vertebrates, and (placental) mammals.
Collapse
Affiliation(s)
- Jana Hertel
- Bioinformatics Group, Department of Computer Science, University of Leipzig, Härtelstrasse 16-18, D-04107 Leipzig, Germany
| | - Manuela Lindemeyer
- Bioinformatics Group, Department of Computer Science, University of Leipzig, Härtelstrasse 16-18, D-04107 Leipzig, Germany
| | - Kristin Missal
- Bioinformatics Group, Department of Computer Science, University of Leipzig, Härtelstrasse 16-18, D-04107 Leipzig, Germany
| | - Claudia Fried
- Bioinformatics Group, Department of Computer Science, University of Leipzig, Härtelstrasse 16-18, D-04107 Leipzig, Germany
| | - Andrea Tanzer
- Bioinformatics Group, Department of Computer Science, University of Leipzig, Härtelstrasse 16-18, D-04107 Leipzig, Germany
- Institute for Theoretical Chemistry, University of Vienna, Währingerstrasse 17, A-1090 Wien, Austria
| | - Christoph Flamm
- Bioinformatics Group, Department of Computer Science, University of Leipzig, Härtelstrasse 16-18, D-04107 Leipzig, Germany
- Institute for Theoretical Chemistry, University of Vienna, Währingerstrasse 17, A-1090 Wien, Austria
| | - Ivo L Hofacker
- Institute for Theoretical Chemistry, University of Vienna, Währingerstrasse 17, A-1090 Wien, Austria
| | - Peter F Stadler
- Bioinformatics Group, Department of Computer Science, University of Leipzig, Härtelstrasse 16-18, D-04107 Leipzig, Germany
- Institute for Theoretical Chemistry, University of Vienna, Währingerstrasse 17, A-1090 Wien, Austria
- The Santa Fe Institute, 1399 Hyde Park Rd., Santa Fe NM 87501
| | | |
Collapse
|