1
|
Castanheira LE, Rodrigues RS, Boldrini-França J, Fonseca FP, Henrique-Silva F, Homsi-Brandeburgo MI, Rodrigues VM. Molecular cloning of a hyaluronidase from Bothrops pauloensis venom gland. J Venom Anim Toxins Incl Trop Dis 2014; 20:25. [PMID: 24987408 PMCID: PMC4077683 DOI: 10.1186/1678-9199-20-25] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2014] [Accepted: 05/30/2014] [Indexed: 12/03/2022] Open
Abstract
Background Hyaluronate is one of the major components of extracellular matrix from vertebrates whose breakdown is catalyzed by the enzyme hyaluronidase. These enzymes are widely described in snake venoms, in which they facilitate the spreading of the main toxins in the victim’s body during the envenoming. Snake venoms also present some variants (hyaluronidases-like substances) that are probably originated by alternative splicing, even though their relevance in envenomation is still under investigation. Hyaluronidases-like proteins have not yet been purified from any snake venom, but the cDNA that encodes these toxins was already identified in snake venom glands by transcriptomic analysis. Herein, we report the cloning and in silico analysis of the first hyaluronidase-like proteins from a Brazilian snake venom. Methods The cDNA sequence of hyaluronidase was cloned from the transcriptome of Bothrops pauloensis venom glands. This sequence was submitted to multiple alignment with other related sequences by ClustalW. A phylogenetic analysis was performed using MEGA 4 software by the neighbor joining (NJ) method. Results The cDNA from Bothrops pauloensis venom gland that corresponds to hyaluronidase comprises 1175 bp and codifies a protein containing 194 amino acid residues. The sequence, denominated BpHyase, was identified as hyaluronidase-like since it shows high sequence identities (above 83%) with other described snake venom hyaluronidase-like sequences. Hyaluronidases-like proteins are thought to be products of alternative splicing implicated in deletions of central amino acids, including the catalytic residues. Structure-based sequence alignment of BpHyase to human hyaluronidase hHyal-1 demonstrates a loss of some key secondary structures. The phylogenetic analysis indicates an independent evolution of BpHyal when compared to other hyaluronidases. However, these toxins might share a common ancestor, thus suggesting a broad hyaluronidase-like distribution among venomous snakes. Conclusions This work is the first report of a cDNA sequence of hyaluronidase from Brazilian snake venoms. Moreover, the in silico analysis of its deduced amino acid sequence opens new perspectives about the biological function of hyaluronidases-like proteins and may direct further studies comprising their isolation and/or recombinant production, as well as their structural and functional characterization.
Collapse
Affiliation(s)
- Letícia Eulalio Castanheira
- Instituto de Genética e Bioquímica, Universidade Federal de Uberlândia, Uberlândia, MG CEP 384000-902, Brasil ; National Institute of Sciences and Technology on Nanobiopharmaceutics (INCT), Federal University of Minas Gerais (UFMG), Belo Horizonte, Minas Gerais State, Brazil
| | - Renata Santos Rodrigues
- Instituto de Genética e Bioquímica, Universidade Federal de Uberlândia, Uberlândia, MG CEP 384000-902, Brasil ; National Institute of Sciences and Technology on Nanobiopharmaceutics (INCT), Federal University of Minas Gerais (UFMG), Belo Horizonte, Minas Gerais State, Brazil
| | - Johara Boldrini-França
- Department of Physics and Chemistry, School of Pharmaceutical Sciences, University of São Paulo (USP), Ribeirão Preto, São Paulo State, Brazil
| | - Fernando Pp Fonseca
- Department of Genetics and Evolution, Federal University of São Carlos (UFSCar), São Carlos, São Paulo State, Brazil
| | - Flávio Henrique-Silva
- Department of Genetics and Evolution, Federal University of São Carlos (UFSCar), São Carlos, São Paulo State, Brazil
| | - Maria I Homsi-Brandeburgo
- Instituto de Genética e Bioquímica, Universidade Federal de Uberlândia, Uberlândia, MG CEP 384000-902, Brasil
| | - Veridiana M Rodrigues
- Instituto de Genética e Bioquímica, Universidade Federal de Uberlândia, Uberlândia, MG CEP 384000-902, Brasil ; National Institute of Sciences and Technology on Nanobiopharmaceutics (INCT), Federal University of Minas Gerais (UFMG), Belo Horizonte, Minas Gerais State, Brazil
| |
Collapse
|
2
|
Talavera D, Robertson DL, Lovell SC. Alternative splicing and protein interaction data sets. Nat Biotechnol 2013; 31:292-3. [PMID: 23563420 DOI: 10.1038/nbt.2540] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
|
3
|
Morata J, Béjar S, Talavera D, Riera C, Lois S, de Xaxars GM, de la Cruz X. The relationship between gene isoform multiplicity, number of exons and protein divergence. PLoS One 2013; 8:e72742. [PMID: 24023641 PMCID: PMC3758341 DOI: 10.1371/journal.pone.0072742] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2013] [Accepted: 07/14/2013] [Indexed: 11/18/2022] Open
Abstract
At present we know that phenotypic differences between organisms arise from a variety of sources, like protein sequence divergence, regulatory sequence divergence, alternative splicing, etc. However, we do not have yet a complete view of how these sources are related. Here we address this problem, studying the relationship between protein divergence and the ability of genes to express multiple isoforms. We used three genome-wide datasets of human-mouse orthologs to study the relationship between isoform multiplicity co-occurrence between orthologs (the fact that two orthologs have more than one isoform) and protein divergence. In all cases our results showed that there was a monotonic dependence between these two properties. We could explain this relationship in terms of a more fundamental one, between exon number of the largest isoform and protein divergence. We found that this last relationship was present, although with variations, in other species (chimpanzee, cow, rat, chicken, zebrafish and fruit fly). In summary, we have identified a relationship between protein divergence and isoform multiplicity co-occurrence and explained its origin in terms of a simple gene-level property. Finally, we discuss the biological implications of these findings for our understanding of inter-species phenotypic differences.
Collapse
Affiliation(s)
- Jordi Morata
- Department of Structural Biology, Institut de Biologia Molecular de Barcelona (IBMB)-Consejo Superior de Investigaciones Científicas (CSIC), Barcelona, Spain
| | - Santi Béjar
- Department of Structural Biology, Institut de Biologia Molecular de Barcelona (IBMB)-Consejo Superior de Investigaciones Científicas (CSIC), Barcelona, Spain
| | - David Talavera
- Faculty of Life Sciences, Manchester University, Manchester, United Kingdom
| | - Casandra Riera
- Laboratory of Translational Bioinformatics in Neuroscience, Vall d'Hebron Institute of Research (VHIR), Barcelona, Spain
| | - Sergio Lois
- Laboratory of Translational Bioinformatics in Neuroscience, Vall d'Hebron Institute of Research (VHIR), Barcelona, Spain
| | - Gemma Mas de Xaxars
- Laboratori de Botànica, Facultat de Farmàcia, Universitat de Barcelona, Barcelona, Spain
| | - Xavier de la Cruz
- Department of Structural Biology, Institut de Biologia Molecular de Barcelona (IBMB)-Consejo Superior de Investigaciones Científicas (CSIC), Barcelona, Spain
- Laboratory of Translational Bioinformatics in Neuroscience, Vall d'Hebron Institute of Research (VHIR), Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
- * E-mail:
| |
Collapse
|
4
|
Barbany M, Morata J, Meyer T, Lois S, Orozco M, de la Cruz X. Characterization of the impact of alternative splicing on protein dynamics: the cases of glutathione S-transferase and ectodysplasin-A isoforms. Proteins 2012; 80:2235-49. [PMID: 22576332 DOI: 10.1002/prot.24112] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2011] [Revised: 04/24/2012] [Accepted: 05/02/2012] [Indexed: 12/31/2022]
Abstract
Recent studies have shown how alternative splicing (AS), the process by which eukaryotic genes express more than one product, affects protein sequence and structure. However, little information is available on the impact of AS on protein dynamics, a property fundamental for protein function. In this work, we have addressed this issue using molecular dynamics simulations of the isoforms of two model proteins: glutathione S-transferase and ectodysplasin-A. We have found that AS does not have a noticeable impact on global or local structure fluctuations. We have also found that, quite interestingly, AS has a significant effect on the coupling between key structural elements such as surface cavities. Our results provide the first atom-level view of the impact of AS on protein dynamics, as far as we know. They can contribute to refine our present view of the relationship between AS and protein disorder and, more importantly, they reveal how AS may modify structural dynamic couplings in proteins.
Collapse
|
5
|
Su Z, Li R, Song X, Liu G, Li Y, Chang X, Li C, Huang D. Identification of a novel isoform of DHRS4 protein with a nuclear localization signal. Gene 2012; 494:161-7. [PMID: 22227495 DOI: 10.1016/j.gene.2011.12.033] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2011] [Revised: 11/30/2011] [Accepted: 12/16/2011] [Indexed: 02/05/2023]
Abstract
The DHRS4 gene encodes an NADP(H)-dependent retinol dehydrogenase/reductase (NRDR) and plays an important role in regulating the synthesis of retinoic acid. In the present study, we identified a novel splice RNA variant, designated NRDRA2, of the human DHRS4 gene by RT-PCR, 3' RACE, and 5' RACE. NRDRA2 mRNA lacked exons 4 and 6, and had a shift in the reading frame when compared to DHRS4 mRNA, resulting in loss of the peroxisomal targeting signal of NRDR and gain of a nuclear localization signal in the predicted NRDRA2 protein. Endogenous NRDRA2 protein was identified in the human cervical carcinoma cell line HeLa by immunoprecipitation and mass spectrometric assay. A green fluorescent protein reporter assay showed that NRDRA2 protein mainly localized to the nuclei, confirming the sequence at its C-terminus as a legitimate nuclear localization signal sequence. This study identifies the alternative transcript variant NRDRA2 encoding a subcellular nuclear localized NRDRA2 protein.
Collapse
Affiliation(s)
- Zhongjing Su
- Department of Cell Biology, Shantou University Medical College, Shantou Guangdong, China
| | | | | | | | | | | | | | | |
Collapse
|
6
|
The Epithelial Sodium Channel alpha subunit (alpha ENaC) alternatively spliced form "b" in Dahl rats: What's next? Int Arch Med 2010; 3:14. [PMID: 20604958 PMCID: PMC2909934 DOI: 10.1186/1755-7682-3-14] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2010] [Accepted: 07/06/2010] [Indexed: 11/27/2022] Open
Abstract
Background The amiloride-sensitive Epithelial Sodium Channel (ENaC) is critical in maintaining Na+ balance, extracellular fluid volume and long term blood pressure control. ENaC is composed of three main subunits α, β, & γ. While α ENaC is critical for channel functionality, β & γ ENaC maximize channel function. To date, there are four alternatively spliced forms of the α subunit of ENaC (α ENaC-a, -b, -c, & -d) that have been published in rats, in addition to the major α ENaC transcript. While α ENaC-a, -c & -d transcripts are low abundance transcripts compared to full-length α ENaC, α ENaC-b is a higher abundance and salt-sensitive transcript compared to full-length α ENaC. Presentation of the hypothesis α ENaC-b protein, which is preferentially produced in Dahl R rats, to a greater extent on high salt diet, exerts a dominant negative effect on full-length α ENaC subunit by physically binding to and trapping full-length α ENaC subunit in the endoplasmic reticulum, and finally accelerating full-length α ENaC proteolytic degradation in a dose-dependent manner. Testing the hypothesis 1) To examine the mRNA and protein abundance of α ENaC-b relative to α ENaC full-length in kidney, lung, and taste tissues of Dahl rats. 2) To compare the expression (mRNA and protein) of α ENaC-b in kidneys of Dahl S and R rats on regular and high salt diet. 3) To examine the putative binding of α ENaC-b proteins to full-length α ENaC in vitro and to determine the impact of such binding on full-length α ENaC expression in vitro. Implications of the hypothesis Our studies will be the first to demonstrate the over-expression of salt-sensitive α ENaC-b spliced form in kidney tissues of Dahl R rats at the expense of full-length α ENaC. The current proposal will provide highly novel insights into the putative mechanisms leading to ENaC hypoactivity in high-salt-fed Dahl R rats. Finally, findings from the present proposal will uncover a new mechanism by which alternative splicing may control the regulation of ENaC expression/function.
Collapse
|
7
|
Evolutionary characteristics of exons expressed at different abundance levels in mammals. Sci Bull (Beijing) 2009. [DOI: 10.1007/s11434-009-0535-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
8
|
Shehata MF. Regulation of the epithelial sodium channel [ENaC] in kidneys of salt-sensitive Dahl rats: insights on alternative splicing. Int Arch Med 2009; 2:28. [PMID: 19785774 PMCID: PMC2761857 DOI: 10.1186/1755-7682-2-28] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2009] [Accepted: 09/29/2009] [Indexed: 11/10/2022] Open
Abstract
The epithelial sodium channel [ENaC] is critical for the maintenance of sodium balance, extracellular fluid volume and long term blood pressure control. Monogenic disorders causing ENaC hyperactivity have led to a severe form of hereditary hypertension in humans, known as Liddle's syndrome. Similarly, in animal models, ENaC hyperactivity has been well documented in kidneys of salt-sensitive [S] Dahl rats [a genetic model of salt-sensitive hypertension] versus their normotensive control [Dahl salt-resistant [R] rats]. The purpose of the present review is to highlight the differential regulation of ENaC in kidneys of Dahl S versus R rats. A systematic overview of the putative role of alternative splicing of the main alpha subunit of ENaC [alpha ENaC] in modulating ENaC expression in kidneys of Dahl rats will be discussed. Finally, a better understanding of the meaningful contribution of ENaC in the pathogenesis of salt-sensitive hypertension will be achieved upon completion of this review.
Collapse
Affiliation(s)
- Marlene F Shehata
- Department of Cellular and Molecular Medicine, Faculty of Medicine, University of Ottawa, Ottawa, ON, Canada.
| |
Collapse
|
9
|
Alternative splicing of transcription factors' genes: beyond the increase of proteome diversity. Comp Funct Genomics 2009:905894. [PMID: 19609452 PMCID: PMC2709715 DOI: 10.1155/2009/905894] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2008] [Revised: 04/06/2009] [Accepted: 05/18/2009] [Indexed: 11/29/2022] Open
Abstract
Functional modification of transcription regulators may lead to developmental changes and phenotypical differences between species. In this work, we study the influence of alternative splicing on transcription factors in human and mouse. Our results show that the impact of alternative splicing on transcription factors is similar in both species, meaning that the ways to increase variability should also be similar. However, when looking at the expression patterns of transcription factors, we observe that they tend to diverge regardless of the role of alternative splicing. Finally, we hypothesise that transcription regulation of alternatively spliced transcription factors could play an important role in the phenotypical differences between species, without discarding other phenomena or functional families.
Collapse
|
10
|
Shehata MF. A Novel Mechanism in Regulating the Alpha-Subunit of the Epithelial Sodium Channel (α ENaC) by the Alternatively Spliced Form α ENaC-b. BIOCHEMISTRY INSIGHTS 2009. [DOI: 10.4137/bci.s880] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
Introduction In Dahl rats’ kidney cortex, the alternatively spliced form of the epithelial sodium channel α subunit (α ENaC-b) is the most abundant mRNA transcript (32+/-3 fold > α ENaC-wt) as was investigated by quantitative RT-PCR analysis. α ENaC-b mRNA levels were significantly higher in Dahl R versus S rats, and were further augmented by high salt diet. Objectives In the present study, we described the molecular cloning and searched for a possible role of α ENaC-b by testing its potential expression in COS7 cells as well as its impact on α ENaC-wt expression levels when co-expressed in COS7 cells in a dose-dependent manner. Methods Using RT-PCR strategy, the full-length wildtype α ENaC transcript and the alternatively spliced form α ENaC-b were amplified, sequenced, cloned, subcloned into PCMV-sport6 expression vector, expressed and co-expressed into COS7 cells in a dose-dependent manner. A combination of denaturing and native western blotting techniques was employed to examine the expression of α ENaC-b in vitro, and to determine if an interaction between α ENaC-b and α ENaC-wt occurs in vitro, and finally to demonstrate if degradation of α ENaC-wt protein does occur. Results α ENaC-b is translated in COS7 cells. Co-expression of α ENaC-b together with α ENaC-wt reduced α ENaC-wt levels in a dose-dependent manner. α ENaC-wt and α ENaC-b appear to form a complex that enhances the degradation of α ENaC-wt. Conclusions Western blots suggest a novel mechanism in α ENaC regulation whereby α ENaC-b exerts a dominant negative effect on α ENaC-wt expression. This is potentially by sequestering α ENaC-wt, enhancing its proteolytic degradation, and possibly explaining the mechanism of salt-resistance in Dahl R rats.
Collapse
Affiliation(s)
- Marlene F. Shehata
- Department of Cellular and Molecular Medicine, Faculty of Medicine, University of Ottawa, Ottawa ON, Canada
| |
Collapse
|
11
|
Comparative analysis indicates that alternative splicing in plants has a limited role in functional expansion of the proteome. BMC Genomics 2009; 10:154. [PMID: 19358722 PMCID: PMC2674458 DOI: 10.1186/1471-2164-10-154] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2008] [Accepted: 04/09/2009] [Indexed: 11/23/2022] Open
Abstract
Background Alternative splicing (AS) is a widespread phenomenon in higher eukaryotes but the extent to which it leads to functional protein isoforms and to proteome expansion at large is still a matter of debate. In contrast to animal species, for which AS has been studied extensively at the protein and functional level, protein-centered studies of AS in plant species are scarce. Here we investigate the functional impact of AS in dicot and monocot plant species using a comparative approach. Results Detailed comparison of AS events in alternative spliced orthologs from the dicot Arabidopsis thaliana and the monocot Oryza sativa (rice) revealed that the vast majority of AS events in both species do not result from functional conservation. Transcript isoforms that are putative targets for the nonsense-mediated decay (NMD) pathway are as likely to contain conserved AS events as isoforms that are translated into proteins. Similar results were obtained when the same comparison was performed between the two more closely related monocot species rice and Zea mays (maize). Genome-wide computational analysis of functional protein domains encoded in alternatively and constitutively spliced genes revealed that only the RNA recognition motif (RRM) is overrepresented in alternatively spliced genes in all species analyzed. In contrast, three domain types were overrepresented in constitutively spliced genes. AS events were found to be less frequent within than outside predicted protein domains and no domain type was found to be enriched with AS introns. Analysis of AS events that result in the removal of complete protein domains revealed that only a small number of domain types is spliced-out in all species analyzed. Finally, in a substantial fraction of cases where a domain is completely removed, this domain appeared to be a unit of a tandem repeat. Conclusion The results from the ortholog comparisons suggest that the ability of a gene to produce more than one functional protein through AS does not persist during evolution. Cross-species comparison of the results of the protein-domain oriented analyses indicates little correspondence between the analyzed species. Based on the premise that functional genetic features are most likely to be conserved during evolution, we conclude that AS has only a limited role in functional expansion of the proteome in plants.
Collapse
|
12
|
Li X, Liang J, Yu H, Su B, Xiao C, Shang Y, Wang W. Functional consequences of new exon acquisition in mammalian chromodomain Y-like (CDYL) genes. Trends Genet 2007; 23:427-31. [PMID: 17573145 DOI: 10.1016/j.tig.2007.06.001] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2006] [Revised: 02/16/2007] [Accepted: 06/04/2007] [Indexed: 10/23/2022]
Abstract
The origin of new exons is an important mechanism for proteome diversity. Here, we report the recurrent origination of new exons in mammalian chromodomain Y-like (CDYL) genes and the functional consequences associated with the acquisition of the new exons. Driven by positive selection, the newly evolved longer peptide exhibits weaker transcription repression activity and attenuates the repression activity of the old form, suggesting that the acquisition of the new exons is functionally significant.
Collapse
Affiliation(s)
- Xin Li
- CAS-Max Planck Junior Research Group, Key Laboratory of Cellular and Molecular Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences (CAS), Kunming, Yunnan 650223, China
| | | | | | | | | | | | | |
Collapse
|
13
|
The functional modulation of epigenetic regulators by alternative splicing. BMC Genomics 2007; 8:252. [PMID: 17651478 PMCID: PMC1949830 DOI: 10.1186/1471-2164-8-252] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2007] [Accepted: 07/25/2007] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Epigenetic regulators (histone acetyltransferases, methyltransferases, chromatin-remodelling enzymes, etc) play a fundamental role in the control of gene expression by modifying the local state of chromatin. However, due to their recent discovery, little is yet known about their own regulation. This paper addresses this point, focusing on alternative splicing regulation, a mechanism already known to play an important role in other protein families, e.g. transcription factors, membrane receptors, etc. RESULTS To this end, we compiled the data available on the presence/absence of alternative splicing for a set of 160 different epigenetic regulators, taking advantage of the relatively large amount of unexplored data on alternative splicing available in public databases. We found that 49 % (70 % in human) of these genes express more than one transcript. We then studied their alternative splicing patterns, focusing on those changes affecting the enzyme's domain composition. In general, we found that these sequence changes correspond to different mechanisms, either repressing the enzyme's function (e.g. by creating dominant-negative inhibitors of the functional isoform) or creating isoforms with new functions. CONCLUSION We conclude that alternative splicing of epigenetic regulators can be an important tool for the function modulation of these enzymes. Considering that the latter control the transcriptional state of large sets of genes, we propose that epigenetic regulation of gene expression is itself strongly regulated by alternative splicing.
Collapse
|
14
|
A procedure for identifying homologous alternative splicing events. BMC Bioinformatics 2007; 8:260. [PMID: 17640387 PMCID: PMC1950890 DOI: 10.1186/1471-2105-8-260] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2007] [Accepted: 07/19/2007] [Indexed: 01/11/2023] Open
Abstract
Background The study of the functional role of alternative splice isoforms of a gene is a very active area of research in biology. The difficulty of the experimental approach (in particular, in its high-throughput version) leaves ample room for the development of bioinformatics tools that can provide a useful first picture of the problem. Among the possible approaches, one of the simplest is to follow classical protein function annotation protocols and annotate target alternative splice events with the information available from conserved events in other species. However, the application of this protocol requires a procedure capable of recognising such events. Here we present a simple but accurate method developed for this purpose. Results We have developed a method for identifying homologous, or equivalent, alternative splicing events, based on the combined use of neural networks and sequence searches. The procedure comprises four steps: (i) BLAST search for homologues of the two isoforms defining the target alternative splicing event; (ii) construction of all possible candidate events; (iii) scoring of the latter with a series of neural networks; and (iv) filtering of the results. When tested in a set of 473 manually annotated pairs of homologous events, our method showed a good performance, with an accuracy of 0.99, a precision of 0.98 and a sensitivity of 0.93. When no candidates were available, the specificity of our method varied between 0.81 and 0.91. Conclusion The method described in this article allows the identification of homologous alternative splicing events, with a good success rate, indicating that such method could be used for the development of functional annotation of alternative splice isoforms.
Collapse
|
15
|
Talavera D, Vogel C, Orozco M, Teichmann SA, de la Cruz X. The (in)dependence of alternative splicing and gene duplication. PLoS Comput Biol 2007; 3:e33. [PMID: 17335345 PMCID: PMC1808492 DOI: 10.1371/journal.pcbi.0030033] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2006] [Accepted: 01/04/2007] [Indexed: 11/18/2022] Open
Abstract
Alternative splicing (AS) and gene duplication (GD) both are processes that diversify the protein repertoire. Recent examples have shown that sequence changes introduced by AS may be comparable to those introduced by GD. In addition, the two processes are inversely correlated at the genomic scale: large gene families are depleted in splice variants and vice versa. All together, these data strongly suggest that both phenomena result in interchangeability between their effects. Here, we tested the extent to which this applies with respect to various protein characteristics. The amounts of AS and GD per gene are anticorrelated even when accounting for different gene functions or degrees of sequence divergence. In contrast, the two processes appear to be independent in their influence on variation in mRNA expression. Further, we conducted a detailed comparison of the effect of sequence changes in both alternative splice variants and gene duplicates on protein structure, in particular the size, location, and types of sequence substitutions and insertions/deletions. We find that, in general, alternative splicing affects protein sequence and structure in a more drastic way than gene duplication and subsequent divergence. Our results reveal an interesting paradox between the anticorrelation of AS and GD at the genomic level, and their impact at the protein level, which shows little or no equivalence in terms of effects on protein sequence, structure, and function. We discuss possible explanations that relate to the order of appearance of AS and GD in a gene family, and to the selection pressure imposed by the environment.
Collapse
Affiliation(s)
- David Talavera
- Molecular Modeling and Bioinformatics Unit, Parc Científic de Barcelona, Barcelona, Spain
- Protein Structure and Modelling Node, Instituto Nacional de Bioinfomática, Genoma España, Parc Científic de Barcelona, Barcelona, Spain
| | - Christine Vogel
- Medical Research Council Laboratory of Molecular Biology, Cambridge, United Kingdom
- Institute for Cellular and Molecular Biology, University of Texas Austin, Austin, Texas, United States of America
| | - Modesto Orozco
- Molecular Modeling and Bioinformatics Unit, Parc Científic de Barcelona, Barcelona, Spain
- Protein Structure and Modelling Node, Instituto Nacional de Bioinfomática, Genoma España, Parc Científic de Barcelona, Barcelona, Spain
- Departament de Bioquímica i Biologia Molecular, Universitat de Barcelona, Barcelona, Spain
- Institut per a la Recerca i Estudis Avançats (IRCA), Barcelona, Spain
| | - Sarah A Teichmann
- Medical Research Council Laboratory of Molecular Biology, Cambridge, United Kingdom
| | - Xavier de la Cruz
- Molecular Modeling and Bioinformatics Unit, Parc Científic de Barcelona, Barcelona, Spain
- Protein Structure and Modelling Node, Instituto Nacional de Bioinfomática, Genoma España, Parc Científic de Barcelona, Barcelona, Spain
- Computational Biology Program, Barcelona Supercomputing Center, Barcelona, Spain
| |
Collapse
|
16
|
Flicek P, Brent MR. Using several pair-wise informant sequences for de novo prediction of alternatively spliced transcripts. Genome Biol 2006; 7 Suppl 1:S8.1-9. [PMID: 16925842 PMCID: PMC1810557 DOI: 10.1186/gb-2006-7-s1-s8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
Background As part of the ENCODE Genome Annotation Assessment Project (EGASP), we developed the MARS extension to the Twinscan algorithm. MARS is designed to find human alternatively spliced transcripts that are conserved in only one or a limited number of extant species. MARS is able to use an arbitrary number of informant sequences and predicts a number of alternative transcripts at each gene locus. Results MARS uses the mouse, rat, dog, opossum, chicken, and frog genome sequences as pairwise informant sources for Twinscan and combines the resulting transcript predictions into genes based on coding (CDS) region overlap. Based on the EGASP assessment, MARS is one of the more accurate dual-genome prediction programs. Compared to the GENCODE annotation, we find that predictive sensitivity increases, while specificity decreases, as more informant species are used. MARS correctly predicts alternatively spliced transcripts for 11 of the 236 multi-exon GENCODE genes that are alternatively spliced in the coding region of their transcripts. For these genes a total of 24 correct transcripts are predicted. Conclusion The MARS algorithm is able to predict alternatively spliced transcripts without the use of expressed sequence information, although the number of loci in which multiple predicted transcripts match multiple alternatively spliced transcripts in the GENCODE annotation is relatively small.
Collapse
Affiliation(s)
- Paul Flicek
- European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK.
| | | |
Collapse
|
17
|
Xing Y, Lee C. Evidence of functional selection pressure for alternative splicing events that accelerate evolution of protein subsequences. Proc Natl Acad Sci U S A 2005; 102:13526-31. [PMID: 16157889 PMCID: PMC1224612 DOI: 10.1073/pnas.0501213102] [Citation(s) in RCA: 111] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
Recently, it was proposed that alternative splicing may act as a mechanism for opening accelerated paths of evolution, by reducing negative selection pressure, but there has been little evidence so far that this mechanism could produce adaptive benefit. Here, we use metrics of very different types of selection pressures [e.g., against amino acid mutations (Ka/Ks), against mutations at synonymous sites (Ks), and for protein reading-frame preservation] to address this question by genomewide analyses of human, chimpanzee, mouse, and rat. These data show that alternative splicing relaxes Ka/Ks selection pressure up to 7-fold, but intriguingly this effect is accompanied by a strong increase in selection pressure against synonymous mutations, which propagates into the adjacent intron, and correlates strongly with the alternative splicing level observed for each exon. These effects are highly local to the alternatively spliced exon. Comparisons of these four genomes consistently show an increase in the density of amino acid mutations (Ka) in alternatively spliced exons and a decrease in the density of synonymous mutations (Ks). This selection pressure against synonymous mutations in alternatively spliced exons was accompanied in all four genomes by a striking increase in selection pressure for protein reading-frame preservation, and both increased markedly with increasing evolutionary age. Restricting our analysis to a subset of exons with strong evidence for biologically functional alternative splicing produced identical results. Thus alternative splicing apparently can create evolutionary "hotspots" within a protein sequence, and these events have evidently been selected for during mammalian evolution.
Collapse
Affiliation(s)
- Yi Xing
- Molecular Biology Institute, Center for Genomics and Proteomics, Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095, USA
| | | |
Collapse
|