1
|
Zhu H, Ding Q. Lower expression level of two RAGE alternative splicing isoforms in Alzheimer’s disease. Neurosci Lett 2015; 597:66-70. [DOI: 10.1016/j.neulet.2015.04.032] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2015] [Revised: 04/13/2015] [Accepted: 04/18/2015] [Indexed: 11/26/2022]
|
2
|
Large-scale whole-genome sequencing of the Icelandic population. Nat Genet 2015; 47:435-44. [PMID: 25807286 DOI: 10.1038/ng.3247] [Citation(s) in RCA: 564] [Impact Index Per Article: 56.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2014] [Accepted: 02/13/2015] [Indexed: 11/09/2022]
Abstract
Here we describe the insights gained from sequencing the whole genomes of 2,636 Icelanders to a median depth of 20×. We found 20 million SNPs and 1.5 million insertions-deletions (indels). We describe the density and frequency spectra of sequence variants in relation to their functional annotation, gene position, pathway and conservation score. We demonstrate an excess of homozygosity and rare protein-coding variants in Iceland. We imputed these variants into 104,220 individuals down to a minor allele frequency of 0.1% and found a recessive frameshift mutation in MYL4 that causes early-onset atrial fibrillation, several mutations in ABCB4 that increase risk of liver diseases and an intronic variant in GNAS associating with increased thyroid-stimulating hormone levels when maternally inherited. These data provide a study design that can be used to determine how variation in the sequence of the human genome gives rise to human diversity.
Collapse
|
3
|
Alternative splicing of mutually exclusive exons—A review. Biosystems 2013; 114:31-8. [DOI: 10.1016/j.biosystems.2013.07.003] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2013] [Accepted: 07/03/2013] [Indexed: 12/16/2022]
|
4
|
Huang H, Yu S, Liu H, Sun X. Nucleosome organization in sequences of alternative events in human genome. Biosystems 2012; 109:214-9. [DOI: 10.1016/j.biosystems.2012.05.011] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2011] [Revised: 05/25/2012] [Accepted: 05/28/2012] [Indexed: 12/01/2022]
|
5
|
Pillmann H, Hatje K, Odronitz F, Hammesfahr B, Kollmar M. Predicting mutually exclusive spliced exons based on exon length, splice site and reading frame conservation, and exon sequence homology. BMC Bioinformatics 2011; 12:270. [PMID: 21718515 PMCID: PMC3228551 DOI: 10.1186/1471-2105-12-270] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2011] [Accepted: 06/30/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Alternative splicing of pre-mature RNA is an important process eukaryotes utilize to increase their repertoire of different protein products. Several types of different alternative splice forms exist including exon skipping, differential splicing of exons at their 3'- or 5'-end, intron retention, and mutually exclusive splicing. The latter term is used for clusters of internal exons that are spliced in a mutually exclusive manner. RESULTS We have implemented an extension to the WebScipio software to search for mutually exclusive exons. Here, the search is based on the precondition that mutually exclusive exons encode regions of the same structural part of the protein product. This precondition provides restrictions to the search for candidate exons concerning their length, splice site conservation and reading frame preservation, and overall homology. Mutually exclusive exons that are not homologous and not of about the same length will not be found. Using the new algorithm, mutually exclusive exons in several example genes, a dynein heavy chain, a muscle myosin heavy chain, and Dscam were correctly identified. In addition, the algorithm was applied to the whole Drosophila melanogaster X chromosome and the results were compared to the Flybase annotation and an ab initio prediction. Clusters of mutually exclusive exons might be subsequent to each other and might encode dozens of exons. CONCLUSIONS This is the first implementation of an automatic search for mutually exclusive exons in eukaryotes. Exons are predicted and reconstructed in the same run providing the complete gene structure for the protein query of interest. WebScipio offers high quality gene structure figures with the clusters of mutually exclusive exons colour-coded, and several analysis tools for further manual inspection. The genome scale analysis of all genes of the Drosophila melanogaster X chromosome showed that WebScipio is able to find all but two of the 28 annotated mutually exclusive spliced exons and predicts 39 new candidate exons. Thus, WebScipio should be able to identify mutually exclusive spliced exons in any query sequence from any species with a very high probability. WebScipio is freely available to academics at http://www.webscipio.org.
Collapse
Affiliation(s)
- Holger Pillmann
- Abteilung NMR basierte Strukturbiologie, Max-Planck-Institut für Biophysikalische Chemie, Am Fassberg 11, D-37077 Göttingen, Germany
| | | | | | | | | |
Collapse
|
6
|
Ferreira EN, Rangel MCR, Galante PF, de Souza JE, Molina GC, de Souza SJ, Carraro DM. Alternative splicing enriched cDNA libraries identify breast cancer-associated transcripts. BMC Genomics 2010; 11 Suppl 5:S4. [PMID: 21210970 PMCID: PMC3045797 DOI: 10.1186/1471-2164-11-s5-s4] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Background Alternative splicing (AS) is a central mechanism in the generation of genomic complexity and is a major contributor to transcriptome and proteome diversity. Alterations of the splicing process can lead to deregulation of crucial cellular processes and have been associated with a large spectrum of human diseases. Cancer-associated transcripts are potential molecular markers and may contribute to the development of more accurate diagnostic and prognostic methods and also serve as therapeutic targets. Alternative splicing-enriched cDNA libraries have been used to explore the variability generated by alternative splicing. In this study, by combining the use of trapping heteroduplexes and RNA amplification, we developed a powerful approach that enables transcriptome-wide exploration of the AS repertoire for identifying AS variants associated with breast tumor cells modulated by ERBB2 (HER-2/neu) oncogene expression. Results The human breast cell line (C5.2) and a pool of 5 ERBB2 over-expressing breast tumor samples were used independently for the construction of two AS-enriched libraries. In total, 2,048 partial cDNA sequences were obtained, revealing 214 alternative splicing sequence-enriched tags (ASSETs). A subset with 79 multiple exon ASSETs was compared to public databases and reported 138 different AS events. A high success rate of RT-PCR validation (94.5%) was obtained, and 2 novel AS events were identified. The influence of ERBB2-mediated expression on AS regulation was evaluated by capillary electrophoresis and probe-ligation approaches in two mammary cell lines (Hb4a and C5.2) expressing different levels of ERBB2. The relative expression balance between AS variants from 3 genes was differentially modulated by ERBB2 in this model system. Conclusions In this study, we presented a method for exploring AS from any RNA source in a transcriptome-wide format, which can be directly easily adapted to next generation sequencers. We identified AS transcripts that were differently modulated by ERBB2-mediated expression and that can be tested as molecular markers for breast cancer. Such a methodology will be useful for completely deciphering the cancer cell transcriptome diversity resulting from AS and for finding more precise molecular markers.
Collapse
Affiliation(s)
- Elisa N Ferreira
- Laboratory of Genomics and Molecular Biology, Hospital A.C. Camargo, Fundação Antonio Prudente, São Paulo, 01509-900, Brazil
| | | | | | | | | | | | | |
Collapse
|
7
|
Mouse chromosome 17 candidate modifier genes for thrombosis. Mamm Genome 2010; 21:337-49. [PMID: 20700597 PMCID: PMC2923722 DOI: 10.1007/s00335-010-9274-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2009] [Accepted: 06/14/2010] [Indexed: 12/14/2022]
Abstract
Two overlapping quantitative trait loci (QTLs) for clot stability, Hmtb8 and Hmtb9, were identified on mouse chromosome 17 in an F2 intercross derived from C57BL/6J (B6) and B6-Chr17(A/J) (B6-Chr17) mouse strains. The intervals were in synteny with a QTL for thrombotic susceptibility on chromosome 18 in a human study, and there were 23 homologs between mouse and human. The objective of this study was to determine whether any of these genes in the syntenic region are likely candidates as modifiers for clot stability. Seven genes, Twsg1, Zfp161, Dlgap1, Ralbp1, Myom1, Rab31, and Emilin2, of the 23 genes with single nucleotide polymorphisms (SNPs) in the mRNA-UTR had differential expression in B6 and A/J mice. Dlgap1, Ralbp1, Myom1, and Emilin2 also had nonsynonymous SNPs. In addition, two other genes had nonsynonymous SNPs, Lama1 and Ndc80. Of these nine candidate genes, Emilin2 was selected for further analysis since other EMILIN (Elastin Microfibril Interface Located Protein) proteins have known functions in vascular structure and coagulation. Differences were found between B6 and A/J mice in vessel wall architecture and EMILIN2 protein in plasma, carotid vessel wall, and thrombi formed after ferric chloride injury. In B6-Chr17(A/J) mice both clot stability and Emilin2 mRNA expression were higher compared to those in B6 and A/J mice, suggesting the exposure of epistatic interactions. Although other homologous genes in the QTL region cannot be ruled out as causative genes, further investigation of Emilin2 as a candidate gene for thrombosis susceptibility is warranted.
Collapse
|
8
|
Otto TD, Wilinski D, Assefa S, Keane TM, Sarry LR, Böhme U, Lemieux J, Barrell B, Pain A, Berriman M, Newbold C, Llinás M. New insights into the blood-stage transcriptome of Plasmodium falciparum using RNA-Seq. Mol Microbiol 2010; 76:12-24. [PMID: 20141604 PMCID: PMC2859250 DOI: 10.1111/j.1365-2958.2009.07026.x] [Citation(s) in RCA: 302] [Impact Index Per Article: 20.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Recent advances in high-throughput sequencing present a new opportunity to deeply probe an organism's transcriptome. In this study, we used Illumina-based massively parallel sequencing to gain new insight into the transcriptome (RNA-Seq) of the human malaria parasite, Plasmodium falciparum. Using data collected at seven time points during the intraerythrocytic developmental cycle, we (i) detect novel gene transcripts; (ii) correct hundreds of gene models; (iii) propose alternative splicing events; and (iv) predict 5' and 3' untranslated regions. Approximately 70% of the unique sequencing reads map to previously annotated protein-coding genes. The RNA-Seq results greatly improve existing annotation of the P. falciparum genome with over 10% of gene models modified. Our data confirm 75% of predicted splice sites and identify 202 new splice sites, including 84 previously uncharacterized alternative splicing events. We also discovered 107 novel transcripts and expression of 38 pseudogenes, with many demonstrating differential expression across the developmental time series. Our RNA-Seq results correlate well with DNA microarray analysis performed in parallel on the same samples, and provide improved resolution over the microarray-based method. These data reveal new features of the P. falciparum transcriptional landscape and significantly advance our understanding of the parasite's red blood cell-stage transcriptome.
Collapse
Affiliation(s)
- Thomas D Otto
- Parasite Genomics, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
9
|
Sammeth M. Complete alternative splicing events are bubbles in splicing graphs. J Comput Biol 2009; 16:1117-40. [PMID: 19689216 DOI: 10.1089/cmb.2009.0108] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Eukaryotic splicing structures are known to involve a high degree of alternative forms derived from a premature transcript by alternative splicing (AS). With the advent of new sequencing technologies, evidence for new splice forms becomes increasingly available-bit by bit revealing that the true splicing diversity of "AS events" often comprises more than two alternatives and therefore cannot be sufficiently described by pairwise comparisons as conducted in analyzes hitherto. Especially, I emphasize on "complete" AS events which include all hitherto known variants of a splicing variation. Challenges emerge from the richness of data (millions of transcripts) and artifacts introduced during the technical process of obtaining transcript sequences ("noise")-especially when dealing with single-read sequences known as expressed sequence tags (ESTs). Herein, I describe a novel method to efficiently predict AS events in different resolutions ("dimensions") from transcript annotations that allows for combination of fragmented EST data with full-length cDNAs and can cope with large datasets containing noise. At the doorstep of many new splice forms becoming available by novel high-throughput sequencing technologies, the presented method helps to dynamically update AS databases. Applying this method to estimate the real complexity of alternative splicing, I found in human and murine annotations thousands of novel AS events that either have been disregarded or mischaracterized in earlier works. The growth of evidence for such events suggests that the number still keeps climbing. When considering complete events, the majority of exons that are observed as "mutually exclusive" in pairwise comparisons in fact involves at least one other alternative splice form that disagrees with their mutual exclusion. Similar observations also hold for the alternative skipping of two subsequent exons. Results suggest that the systematical analysis of complete AS events on large scale provides subtle insights in the mechanisms that drive (alternative) splicing.
Collapse
Affiliation(s)
- Michael Sammeth
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) , Barcelona, Spain.
| |
Collapse
|
10
|
Won J, Gifford E, Smith RS, Yi H, Ferreira PA, Hicks WL, Li T, Naggert JK, Nishina PM. RPGRIP1 is essential for normal rod photoreceptor outer segment elaboration and morphogenesis. Hum Mol Genet 2009; 18:4329-39. [PMID: 19679561 DOI: 10.1093/hmg/ddp385] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The function of the retinitis pigmentosa GTPase regulator interacting protein 1 (RPGRIP1) gene is currently not known. However, mutations within the gene lead to Leber Congenital Amaurosis and autosomal recessive retinitis pigmentosa in human patients. In a previously described knockout mouse model of the long splice variant of Rpgrip1, herein referred to as Rpgrip1(tm1Tili) mice, mislocalization of key outer segment proteins and dysmorphogenesis of outer segment discs preceded subsequent photoreceptor degeneration. In this report, we describe a new mouse model carrying a splice acceptor site mutation in Rpgrip1, herein referred to as Rpgrip1(nmf247) that is phenotypically distinct from Rpgrip1(tm1Tili) mice. Photoreceptor degeneration in homozygous Rpgrip1(nmf247) mice is earlier in onset and more severe when compared with Rpgrip1(tm1Tili) mice. Also, ultrastructural studies reveal that whereas Rpgrip1(nmf247) mutants have a normal structure and number of connecting cilia, unlike Rpgrip1(tm1Tili) mice, they do not elaborate rod outer segments (OS). Therefore, in addition to its role in OS disc morphogenesis, RPGRIP1 is essential for rod OS formation. Our study indicates the absence of multiple Rpgrip1 isoforms in Rpgrip1(nmf247) mice, suggesting different isoforms may play different roles in photoreceptors and underscores the importance of considering splice variants when generating targeted null mutations.
Collapse
Affiliation(s)
- Jungyeon Won
- The Jackson Laboratory, Bar Harbor, ME 04609, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Alternative splicing of transcription factors' genes: beyond the increase of proteome diversity. Comp Funct Genomics 2009:905894. [PMID: 19609452 PMCID: PMC2709715 DOI: 10.1155/2009/905894] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2008] [Revised: 04/06/2009] [Accepted: 05/18/2009] [Indexed: 11/29/2022] Open
Abstract
Functional modification of transcription regulators may lead to developmental changes and phenotypical differences between species. In this work, we study the influence of alternative splicing on transcription factors in human and mouse. Our results show that the impact of alternative splicing on transcription factors is similar in both species, meaning that the ways to increase variability should also be similar. However, when looking at the expression patterns of transcription factors, we observe that they tend to diverge regardless of the role of alternative splicing. Finally, we hypothesise that transcription regulation of alternatively spliced transcription factors could play an important role in the phenotypical differences between species, without discarding other phenomena or functional families.
Collapse
|
12
|
A general definition and nomenclature for alternative splicing events. PLoS Comput Biol 2008; 4:e1000147. [PMID: 18688268 PMCID: PMC2467475 DOI: 10.1371/journal.pcbi.1000147] [Citation(s) in RCA: 181] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2007] [Accepted: 07/01/2008] [Indexed: 11/19/2022] Open
Abstract
Understanding the molecular mechanisms responsible for the regulation of the transcriptome present in eukaryotic cells is one of the most challenging tasks in the postgenomic era. In this regard, alternative splicing (AS) is a key phenomenon contributing to the production of different mature transcripts from the same primary RNA sequence. As a plethora of different transcript forms is available in databases, a first step to uncover the biology that drives AS is to identify the different types of reflected splicing variation. In this work, we present a general definition of the AS event along with a notation system that involves the relative positions of the splice sites. This nomenclature univocally and dynamically assigns a specific “AS code” to every possible pattern of splicing variation. On the basis of this definition and the corresponding codes, we have developed a computational tool (AStalavista) that automatically characterizes the complete landscape of AS events in a given transcript annotation of a genome, thus providing a platform to investigate the transcriptome diversity across genes, chromosomes, and species. Our analysis reveals that a substantial part—in human more than a quarter—of the observed splicing variations are ignored in common classification pipelines. We have used AStalavista to investigate and to compare the AS landscape of different reference annotation sets in human and in other metazoan species and found that proportions of AS events change substantially depending on the annotation protocol, species-specific attributes, and coding constraints acting on the transcripts. The AStalavista system therefore provides a general framework to conduct specific studies investigating the occurrence, impact, and regulation of AS. The genome sequence is said to be an organism's blueprint, a set of instructions driving the organism's biology. The unfolding of these instructions—the so-called genes—is initiated by the transcription of DNA into RNA molecules, which subsequently are processed before they can take their functional role. During this processing step, initially identical RNA molecules may result in different products through a process known as alternative splicing (AS). AS therefore allows for widening the diversity from the limited repertoire of genes, and it is often postulated as an explanation for the apparent paradox that complex and simple organisms resemble in their number of genes; it characterizes species, individuals, and developmental and cellular conditions. Comparing the differences of AS products between cells may help to reveal the broad molecular basis underlying phenotypic differences—for instance, between a cancer and a normal cell. An obstacle for such comparisons has been that, so far, no paradigm existed to delineate each single quantum of AS, so-called AS events. Here, we describe a possibility of exhaustively decomposing AS complements into qualitatively different groups of events and a nomenclature to unequivocally denote them. This typological catalogue of AS events along with their observed frequencies represent the AS landscape, and we propose a procedure to automatically identify such landscapes. We use it to describe the human AS landscape and to investigate how it has changed throughout evolution.
Collapse
|
13
|
Oltean S, Febbo PG, Garcia-Blanco MA. Dunning rat prostate adenocarcinomas and alternative splicing reporters: powerful tools to study epithelial plasticity in prostate tumors in vivo. Clin Exp Metastasis 2008; 25:611-9. [PMID: 18523850 PMCID: PMC2471395 DOI: 10.1007/s10585-008-9186-y] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2008] [Accepted: 05/21/2008] [Indexed: 02/06/2023]
Abstract
Using alternative splicing reporters we have previously observed mesenchymal epithelial transitions in Dunning AT3 rat prostate tumors. We demonstrate here that the Dunning DT and AT3 cells, which express epithelial and mesenchymal markers, respectively, represent an excellent model to study epithelial transitions since these cells recapitulate gene expression profiles observed during human prostate cancer progression. In this manuscript we also present the development of two new tools to study the epithelial transitions by imaging alternative splicing decisions: a bichromatic fluorescence reporter to evaluate epithelial transitions in culture and in vivo, and a luciferase reporter to visualize the distribution of mesenchymal epithelial transitions in vivo.
Collapse
Affiliation(s)
- Sebastian Oltean
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Research Drive, Box 3053 (424 CARL), Durham, NC 27710 USA
- Center for RNA Biology, Duke University Medical Center, Durham, NC 27710 USA
| | - Phillip G. Febbo
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Research Drive, Box 3053 (424 CARL), Durham, NC 27710 USA
- Institute of Genome Science and Policy, Duke University Medical Center, Durham, NC 27710 USA
- Department of Medicine, Duke University Medical Center, Durham, NC 27710 USA
| | - Mariano A. Garcia-Blanco
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Research Drive, Box 3053 (424 CARL), Durham, NC 27710 USA
- Center for RNA Biology, Duke University Medical Center, Durham, NC 27710 USA
- Department of Medicine, Duke University Medical Center, Durham, NC 27710 USA
| |
Collapse
|
14
|
Abstract
Spliceosomal introns, a hallmark of eukaryotic gene organization, were an unexpected discovery. After three decades, crucial issues such as when and how introns first appeared in evolution remain unsettled. An issue yet to be answered is how intron positions arise de novo. Phylogenetic investigations concur that intron positions continue to emerge, at least in some lineages. Yet genomic scans for the sources of introns occupying new positions have been fruitless. Two alternative solutions to this paradox are: (i) formation of new intron positions halted before the recent past and (ii) it continues to occur, but through processes different from those generally assumed. One process generally dismissed is intron sliding--the relocation of a preexisting intron over short distances--because of supposed associated deleterious effects. The puzzle of intron gain arises owing to a pervasive operational definition of introns, which sees them as precisely demarcated segments of the genome separated from the neighboring nonintronic DNA by unmovable limits. Intron homology is defined as position homology. Recent studies of pre-mRNA processing indicate that this assumption needs to be revised. We incorporate recent advances on the evolutionarily frequent process of alternative splicing, by which exons of primary transcripts are spliced in different patterns, into a new model of intron sliding that accounts for the diversity of intron positions. We posit that intron positional diversity is driven by two overlapping processes: (i) background process of continuous relocation of preexisting introns by sliding and (ii) spurts of extensive gain/loss of new intron sequences.
Collapse
|
15
|
Multifactorial interplay controls the splicing profile of Alu-derived exons. Mol Cell Biol 2008; 28:3513-25. [PMID: 18332115 DOI: 10.1128/mcb.02279-07] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Exonization of Alu elements creates primate-specific genomic diversity. Here we combine bioinformatic and experimental methodologies to reconstruct the molecular changes leading to exon selection. Our analyses revealed an intricate network involved in Alu exonization. A typical Alu element contains multiple sites with the potential to serve as 5' splice sites (5'ss). First, we demonstrated the role of 5'ss strength in controlling exonization events. Second, we found that a cryptic 5'ss enhances the selection of a more upstream site and demonstrate that this is mediated by binding of U1 snRNA to the cryptic splice site, challenging the traditional role attributed to U1 snRNA of binding the 5'ss only. Third, we used a simple algorithm to identify specific sequences that determine splice site selection within specific Alu exons. Finally, by inserting identical exons within different sequences, we demonstrated the importance of flanking genomic sequences in determining whether an Alu exon will undergo exonization. Overall, our results demonstrate the complex interplay between at least four interacting layers that affect Alu exonization. These results shed light on the mechanism through which Alu elements enrich the primate transcriptome and allow a better understanding of the exonization process in general.
Collapse
|
16
|
Laskowski RA, Thornton JM. Understanding the molecular machinery of genetics through 3D structures. Nat Rev Genet 2008; 9:141-51. [PMID: 18160966 DOI: 10.1038/nrg2273] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Detailed knowledge of the three-dimensional structures of biological molecules has had an enormous impact on all areas of biological science, including genetics, as structure can reveal the fine details of how molecules perform their biological functions. Here we consider how changes in protein sequence affect the corresponding 3D structure, and describe how structural information about proteins, DNA and chromatin has shed light on gene regulatory mechanisms and the storage and transmission of epigenetic information. Finally, we describe how structure determination is benefiting from the high-throughput technologies of the worldwide structural genomics projects.
Collapse
Affiliation(s)
- Roman A Laskowski
- European Bioinformatics Institute, European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.
| | | |
Collapse
|
17
|
Ben-Dov C, Hartmann B, Lundgren J, Valcárcel J. Genome-wide analysis of alternative pre-mRNA splicing. J Biol Chem 2007; 283:1229-33. [PMID: 18024428 DOI: 10.1074/jbc.r700033200] [Citation(s) in RCA: 109] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Alternative splicing of mRNA precursors allows the synthesis of multiple mRNAs from a single primary transcript, significantly expanding the information content and regulatory possibilities of higher eukaryotic genomes. High-throughput enabling technologies, particularly large-scale sequencing and splicing-sensitive microarrays, are providing unprecedented opportunities to address key questions in this field. The picture emerging from these pioneering studies is that alternative splicing affects most human genes and a significant fraction of the genes in other multicellular organisms, with the potential to greatly influence the evolution of complex genomes. A combinatorial code of regulatory signals and factors can deploy physiologically coherent programs of alternative splicing that are distinct from those regulated at other steps of gene expression. Pre-mRNA splicing and its regulation play important roles in human pathologies, and genome-wide analyses in this area are paving the way for improved diagnostic tools and for the identification of novel and more specific pharmaceutical targets.
Collapse
Affiliation(s)
- Claudia Ben-Dov
- Centre de Regulació Genòmica, Dr. Aiguader 88, 08003 Barcelona, Spain
| | | | | | | |
Collapse
|
18
|
Marinescu V, Loomis PA, Ehmann S, Beales M, Potashkin JA. Regulation of retention of FosB intron 4 by PTB. PLoS One 2007; 2:e828. [PMID: 17786200 PMCID: PMC1952174 DOI: 10.1371/journal.pone.0000828] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2007] [Accepted: 08/14/2007] [Indexed: 11/19/2022] Open
Abstract
One effect of stressors such as chronic drug administration is that sequence within the terminal exon of the transcription factor FosB is recognized as intronic and removed by alternative splicing. This results in an open-reading-frame shift that produces a translation stop codon and ultimately a truncated protein, termed DeltaFosB. In vitro splicing assays with control and mutated transcripts generated from a fosB mini-gene construct indicated a CU-rich sequence at the 3' end of intron 4 (I4) plays an important role in regulating fosB pre-mRNA splicing due to its binding of polypyrimidine tract binding protein (PTB). PTB binding to this sequence is dependent upon phosphorylation by protein kinase A and is blocked if the CU-rich sequence is mutated to a U-rich region. When this mutated fosB minigene is expressed in HeLa cells, the splicing efficiency of its product is increased compared to wild type. Moreover, transient transfection of PTB-1 in HeLa cells decreased the splicing efficiency of a wild type fosB minigene transcript. Depletion of PTB from nuclear extracts facilitated U2AF65 binding to wild type sequence in vitro, suggesting these proteins function in a dynamic equilibrium to modulate fosB pre-mRNA alternative splicing. These results demonstrate for the first time that phosphorylated PTB promotes intron retention and thereby silences the splicing of fosB I4.
Collapse
Affiliation(s)
- Victor Marinescu
- Department of Cellular and Molecular Pharmacology, The Chicago Medical School, Rosalind Franklin University of Medicine and Science, North Chicago, Illinois, United States of America
| | - Patricia A. Loomis
- Department of Cellular and Molecular Pharmacology, The Chicago Medical School, Rosalind Franklin University of Medicine and Science, North Chicago, Illinois, United States of America
| | - Svetlana Ehmann
- Department of Cellular and Molecular Pharmacology, The Chicago Medical School, Rosalind Franklin University of Medicine and Science, North Chicago, Illinois, United States of America
| | - Mitchell Beales
- Department of Cellular and Molecular Pharmacology, The Chicago Medical School, Rosalind Franklin University of Medicine and Science, North Chicago, Illinois, United States of America
| | - Judith A. Potashkin
- Department of Cellular and Molecular Pharmacology, The Chicago Medical School, Rosalind Franklin University of Medicine and Science, North Chicago, Illinois, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
19
|
A procedure for identifying homologous alternative splicing events. BMC Bioinformatics 2007; 8:260. [PMID: 17640387 PMCID: PMC1950890 DOI: 10.1186/1471-2105-8-260] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2007] [Accepted: 07/19/2007] [Indexed: 01/11/2023] Open
Abstract
Background The study of the functional role of alternative splice isoforms of a gene is a very active area of research in biology. The difficulty of the experimental approach (in particular, in its high-throughput version) leaves ample room for the development of bioinformatics tools that can provide a useful first picture of the problem. Among the possible approaches, one of the simplest is to follow classical protein function annotation protocols and annotate target alternative splice events with the information available from conserved events in other species. However, the application of this protocol requires a procedure capable of recognising such events. Here we present a simple but accurate method developed for this purpose. Results We have developed a method for identifying homologous, or equivalent, alternative splicing events, based on the combined use of neural networks and sequence searches. The procedure comprises four steps: (i) BLAST search for homologues of the two isoforms defining the target alternative splicing event; (ii) construction of all possible candidate events; (iii) scoring of the latter with a series of neural networks; and (iv) filtering of the results. When tested in a set of 473 manually annotated pairs of homologous events, our method showed a good performance, with an accuracy of 0.99, a precision of 0.98 and a sensitivity of 0.93. When no candidates were available, the specificity of our method varied between 0.81 and 0.91. Conclusion The method described in this article allows the identification of homologous alternative splicing events, with a good success rate, indicating that such method could be used for the development of functional annotation of alternative splice isoforms.
Collapse
|
20
|
Ferreira EN, Galante PAF, Carraro DM, de Souza SJ. Alternative splicing: a bioinformatics perspective. MOLECULAR BIOSYSTEMS 2007; 3:473-7. [PMID: 17579772 DOI: 10.1039/b702485c] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
The degree of diversity at the transcriptome and proteome levels generated by alternative splicing is astonishing. In this review, we discuss several issues related to alternative splicing with a special emphasis on identification strategies based on bioinformatics.
Collapse
|
21
|
Liu F, Tøstesen E, Sundet JK, Jenssen TK, Bock C, Jerstad GI, Thilly WG, Hovig E. The human genomic melting map. PLoS Comput Biol 2007; 3:e93. [PMID: 17511513 PMCID: PMC1868775 DOI: 10.1371/journal.pcbi.0030093] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2006] [Accepted: 04/11/2007] [Indexed: 11/19/2022] Open
Abstract
In a living cell, the antiparallel double-stranded helix of DNA is a dynamically changing structure. The structure relates to interactions between and within the DNA strands, and the array of other macromolecules that constitutes functional chromatin. It is only through its changing conformations that DNA can organize and structure a large number of cellular functions. In particular, DNA must locally uncoil, or melt, and become single-stranded for DNA replication, repair, recombination, and transcription to occur. It has previously been shown that this melting occurs cooperatively, whereby several base pairs act in concert to generate melting bubbles, and in this way constitute a domain that behaves as a unit with respect to local DNA single-strandedness. We have applied a melting map calculation to the complete human genome, which provides information about the propensities of forming local bubbles determined from the whole sequence, and present a first report on its basic features, the extent of cooperativity, and correlations to various physical and biological features of the human genome. Globally, the melting map covaries very strongly with GC content. Most importantly, however, cooperativity of DNA denaturation causes this correlation to be weaker at resolutions fewer than 500 bps. This is also the resolution level at which most structural and biological processes occur, signifying the importance of the informational content inherent in the genomic melting map. The human DNA melting map may be further explored at http://meltmap.uio.no.
Collapse
Affiliation(s)
- Fang Liu
- Department of Tumor Biology, Institute for Cancer Research, Rikshospitalet-Radiumhospitalet Medical Center, Oslo, Norway
- PubGene AS, Vinderen, Oslo, Norway
| | - Eivind Tøstesen
- Department of Tumor Biology, Institute for Cancer Research, Rikshospitalet-Radiumhospitalet Medical Center, Oslo, Norway
| | | | | | - Christoph Bock
- Max-Planck-Institut für Informatik, Saarbrücken, Germany
| | - Geir Ivar Jerstad
- Department of Tumor Biology, Institute for Cancer Research, Rikshospitalet-Radiumhospitalet Medical Center, Oslo, Norway
| | - William G Thilly
- Biological Engineering Division, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Eivind Hovig
- Department of Tumor Biology, Institute for Cancer Research, Rikshospitalet-Radiumhospitalet Medical Center, Oslo, Norway
- Institute of Informatics, University of Oslo, Norway
- Medical Informatics, Institute for Cancer Research, Rikshospitalet-Radiumhospitalet Medical Center, Oslo, Norway
| |
Collapse
|
22
|
Leparc GG, Mitra RD. Non-EST-based prediction of novel alternatively spliced cassette exons with cell signaling function in Caenorhabditis elegans and human. Nucleic Acids Res 2007; 35:3192-202. [PMID: 17452356 PMCID: PMC1904267 DOI: 10.1093/nar/gkm187] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
To better understand the complex role that alternative splicing plays in intracellular signaling, it is important to catalog the numerous splice variants involved in signal transduction. Therefore, we developed PASE (Prediction of Alternative Signaling Exons), a computational tool to identify novel alternative cassette exons that code for kinase phosphorylation or signaling protein-binding sites. We first applied PASE to the Caenorhabditis elegans genome. In this organism, our algorithm had an overall specificity of ≥76.4%, including 33 novel cassette exons that we experimentally verified. We then used PASE to analyze the human genome and made 804 predictions, of which 308 were found as alternative exons in the transcript database. We experimentally tested 384 of the remaining unobserved predictions and discovered 26 novel human exons for a total specificity of ≥41.5% in human. By using a test set of known alternatively spliced signaling exons, we determined that the sensitivity of PASE is ∼70%. GO term analysis revealed that our exon predictions were found in the introns of known signal transduction genes more often than expected by chance, indicating PASE enriches for splice variants that function in signaling pathways. Overall, PASE was able to uncover 59 novel alternative cassette exons in C. elegans and humans through a genome-wide ab initio prediction method that enriches for exons involved in signaling.
Collapse
Affiliation(s)
| | - Robi David Mitra
- *To whom correspondence should be addressed. Tel: +1-314-362-2751; Fax: +1-314-362-2156;
| |
Collapse
|
23
|
Technologies for the Global Discovery and Analysis of Alternative Splicing. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2007; 623:64-84. [DOI: 10.1007/978-0-387-77374-2_5] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/11/2023]
|