1
|
Luthra I, Jensen C, Chen XE, Salaudeen AL, Rafi AM, de Boer CG. Regulatory activity is the default DNA state in eukaryotes. Nat Struct Mol Biol 2024; 31:559-567. [PMID: 38448573 DOI: 10.1038/s41594-024-01235-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Accepted: 01/29/2024] [Indexed: 03/08/2024]
Abstract
Genomes encode for genes and non-coding DNA, both capable of transcriptional activity. However, unlike canonical genes, many transcripts from non-coding DNA have limited evidence of conservation or function. Here, to determine how much biological noise is expected from non-genic sequences, we quantify the regulatory activity of evolutionarily naive DNA using RNA-seq in yeast and computational predictions in humans. In yeast, more than 99% of naive DNA bases were transcribed. Unlike the evolved transcriptome, naive transcripts frequently overlapped with opposite sense transcripts, suggesting selection favored coherent gene structures in the yeast genome. In humans, regulation-associated chromatin activity is predicted to be common in naive dinucleotide-content-matched randomized DNA. Here, naive and evolved DNA have similar co-occurrence and cell-type specificity of chromatin marks, challenging these as indicators of selection. However, in both yeast and humans, extreme high activities were rare in naive DNA, suggesting they result from selection. Overall, basal regulatory activity seems to be the default, which selection can hone to evolve a function or, if detrimental, repress.
Collapse
Affiliation(s)
- Ishika Luthra
- School of Biomedical Engineering, University of British Columbia, Vancouver, British Columbia, Canada
| | - Cassandra Jensen
- School of Biomedical Engineering, University of British Columbia, Vancouver, British Columbia, Canada
| | - Xinyi E Chen
- School of Biomedical Engineering, University of British Columbia, Vancouver, British Columbia, Canada
| | - Asfar Lathif Salaudeen
- School of Biomedical Engineering, University of British Columbia, Vancouver, British Columbia, Canada
| | - Abdul Muntakim Rafi
- School of Biomedical Engineering, University of British Columbia, Vancouver, British Columbia, Canada
| | - Carl G de Boer
- School of Biomedical Engineering, University of British Columbia, Vancouver, British Columbia, Canada.
| |
Collapse
|
2
|
Sarkar I, Tisa LS, Gtari M, Sen A. Biosynthetic energy cost of potentially highly expressed proteins vary with niche in selected actinobacteria. J Basic Microbiol 2017; 58:154-161. [PMID: 29144540 DOI: 10.1002/jobm.201700350] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2017] [Revised: 10/27/2017] [Accepted: 11/05/2017] [Indexed: 11/08/2022]
Abstract
Amino acid and protein biosynthesis requires a number of high energy phosphate bonds and includes a dual energy cost for the synthesis of chemical intermediates during the fueling reactions and the conversion of precursor molecules to final products. One popular hypothesis is that the proteins encoded by putative highly expressed genes (hence called PHXPs) generally utilize low energy consuming amino acids to reduce the biosynthetic cost of the essential proteins. In our study, we found that this idea was not supported in the case of actinobacteria. With the actinobacteria, the energy costs of PHXPs varied in relation to their niche. Free-living, including aquatic, soil and extremophilic, and plant-associated actinobacteria were found to use energetically expensive amino acids in their PHXPs. An exception occurred with some animal-host-associated actinobacteria that used energy efficient amino acids. One explanation for these results may be due to the diverse metabolic patterns exhibited by actinobacteria under varied niches influenced by nutritional availability and physical environment.
Collapse
Affiliation(s)
- Indrani Sarkar
- NBU Bioinformatics Facility, Department of Botany, University of North Bengal, Siliguri, India
| | - Louis S Tisa
- Department of Molecular, Cellular and Biomedical Sciences, University of New Hampshire, Durham, New Hampshire
| | - Maher Gtari
- Laboratoire Microorganismes et Biomolécules Actives, Université de Tunis Elmanar (FST), Université de Carthage (INSAT), Tunis, Tunisia
| | - Arnab Sen
- NBU Bioinformatics Facility, Department of Botany, University of North Bengal, Siliguri, India
| |
Collapse
|
3
|
Architecture and Distribution of Introns in Core Genes of Four Fusarium Species. G3-GENES GENOMES GENETICS 2017; 7:3809-3820. [PMID: 28993438 PMCID: PMC5677156 DOI: 10.1534/g3.117.300344] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Removal of introns from transcribed RNA represents a crucial step during the production of mRNA in eukaryotes. Available whole-genome sequences and expressed sequence tags (ESTs) have increased our knowledge of this process and revealed various commonalities among eukaryotes. However, certain aspects of intron structure and diversity are taxon-specific, which can complicate the accuracy of in silico gene prediction methods. Using core genes, we evaluated the distribution and architecture of Fusarium circinatum spliceosomal introns, and linked these characteristics to the accuracy of the predicted gene models of the genome of this fungus. We also evaluated intron distribution and architecture in F. verticillioides, F. oxysporum, and F. graminearum, and made comparisons with F. circinatum. Results indicated that F. circinatum and the three other Fusarium species have canonical 5′ and 3′ splice sites, but with subtle differences that are apparently not shared with those of other fungal genera. The polypyrimidine tract of Fusarium introns was also found to be highly divergent among species and genes. Furthermore, the conserved adenosine nucleoside required during the first step of splicing is contained within unique branch site motifs in certain Fusarium introns. Data generated here show that introns of F. circinatum, as well as F. verticillioides, F. oxysporum, and F. graminearum, are characterized by a number of unique features such as the CTHAH and ACCAT motifs of the branch site. Incorporation of such information into genome annotation software will undoubtedly improve the accuracy of gene prediction methods used for Fusarium species and related fungi.
Collapse
|
4
|
Ma JE, Lang QQ, Qiu FF, Zhang L, Li XG, Luo W, Wang J, Wang X, Lin XR, Liu WS, Nie QH, Zhang XQ. Negative Glucocorticoid Response-Like Element from the First Intron of the Chicken Growth Hormone Gene Represses Gene Expression in the Rat Pituitary Tumor Cell Line. Int J Mol Sci 2016; 17:ijms17111863. [PMID: 27834851 PMCID: PMC5133863 DOI: 10.3390/ijms17111863] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2016] [Revised: 10/19/2016] [Accepted: 10/25/2016] [Indexed: 12/29/2022] Open
Abstract
The effects of introns, especially the first intron, on the regulation of gene expression remains unclear. Therefore, the objective of the present study was to investigate the transcriptional regulatory function of intron 1 on the chicken growth hormone (cGH) gene in the rat pituitary tumor cell line (GH4-C1). Transient transfection using first-intron-inserted cGH complete coding sequences (CDSs) and non-intron-inserted cGH CDS plasmids, quantitative RT-PCR (qRT-PCR) and western blot assays were used to detect the expression of cGH. The reporter gene assay was also used to investigate the effect of a series of fragments in the first intron of cGH on gene expression in GH4-C1. All of the results revealed that a 200-bp fragment located in the +485/+684 region of intron 1 was essential for repressing the expression of cGH. Further informatics analysis showed that there was a cluster of 13 transcriptional factor binding sites (TFBSs) in the +485/+684 region of the cGH intron 1. Disruption of a glucocorticoid response-like element (the 19-nucleotide sequence 5'-AGGCTTGACAGTGACCTCC-3') containing a T-box motif (TGACCT) located within this DNA fragment increased the expression of the reporter gene in GH4-C1. In addition, an electrophoretic mobility shift assay (EMSA) revealed a glucocorticoid receptor (GR) protein of rat binding to the glucocorticoid response-like element. Together, these results indicate that there is a negative glucocorticoid response-like element (nGRE) located in the +591/+609 region within the first intron of cGH, which is essential for the down-regulation of cGH expression.
Collapse
Affiliation(s)
- Jing-E Ma
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China.
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China.
- Key Lab of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, South China Agricultural University, Guangzhou 510642, China.
| | - Qian-Qian Lang
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China.
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China.
- Key Lab of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, South China Agricultural University, Guangzhou 510642, China.
| | - Feng-Fang Qiu
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China.
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China.
- Key Lab of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, South China Agricultural University, Guangzhou 510642, China.
| | - Li Zhang
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China.
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China.
- Key Lab of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, South China Agricultural University, Guangzhou 510642, China.
| | - Xiang-Guang Li
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China.
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China.
| | - Wen Luo
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China.
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China.
- Key Lab of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, South China Agricultural University, Guangzhou 510642, China.
| | - Juan Wang
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China.
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China.
- Key Lab of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, South China Agricultural University, Guangzhou 510642, China.
| | - Xing Wang
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China.
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China.
- Key Lab of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, South China Agricultural University, Guangzhou 510642, China.
| | - Xi-Ran Lin
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China.
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China.
- Key Lab of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, South China Agricultural University, Guangzhou 510642, China.
| | - Wen-Sheng Liu
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China.
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China.
- Key Lab of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, South China Agricultural University, Guangzhou 510642, China.
| | - Qing-Hua Nie
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China.
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China.
- Key Lab of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, South China Agricultural University, Guangzhou 510642, China.
| | - Xi-Quan Zhang
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China.
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China.
- Key Lab of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, South China Agricultural University, Guangzhou 510642, China.
| |
Collapse
|
5
|
Palazzo AF, Lee ES. Non-coding RNA: what is functional and what is junk? Front Genet 2015; 6:2. [PMID: 25674102 PMCID: PMC4306305 DOI: 10.3389/fgene.2015.00002] [Citation(s) in RCA: 557] [Impact Index Per Article: 55.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2014] [Accepted: 01/06/2015] [Indexed: 12/12/2022] Open
Abstract
The genomes of large multicellular eukaryotes are mostly comprised of non-protein coding DNA. Although there has been much agreement that a small fraction of these genomes has important biological functions, there has been much debate as to whether the rest contributes to development and/or homeostasis. Much of the speculation has centered on the genomic regions that are transcribed into RNA at some low level. Unfortunately these RNAs have been arbitrarily assigned various names, such as “intergenic RNA,” “long non-coding RNAs” etc., which have led to some confusion in the field. Many researchers believe that these transcripts represent a vast, unchartered world of functional non-coding RNAs (ncRNAs), simply because they exist. However, there are reasons to question this Panglossian view because it ignores our current understanding of how evolution shapes eukaryotic genomes and how the gene expression machinery works in eukaryotic cells. Although there are undoubtedly many more functional ncRNAs yet to be discovered and characterized, it is also likely that many of these transcripts are simply junk. Here, we discuss how to determine whether any given ncRNA has a function. Importantly, we advocate that in the absence of any such data, the appropriate null hypothesis is that the RNA in question is junk.
Collapse
Affiliation(s)
| | - Eliza S Lee
- Department of Biochemistry, University of Toronto Toronto, ON, Canada
| |
Collapse
|
6
|
Jiang L, Li XN, Niu DK. Higher frequency of intron loss from the promoter proximally paused genes of Drosophila melanogaster. Fly (Austin) 2014; 8:120-5. [PMID: 25483256 DOI: 10.4161/fly.29489] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Although intron losses have been widely reported, it is not clear whether they are neutral and therefore random or driven by positive selection. Intron transcription and splicing are time-consuming and can delay the expression of its host gene. For genes that must be activated quickly to respond to physiological or stress signals, intron delay may be deleterious. Promoter proximally paused (PPP) genes are a group of rapidly expressed genes. To respond quickly to activation signals, they generally initiate transcription competently but stall after synthesizing a short RNA. In this study, performed in Drosophila melanogaster, the PPP genes were found to have a significantly higher rate of intron loss than control genes. However, further analysis did not find more significant shrinkage of intron size in PPP genes. Referring to previous studies on the rates of transcription and splicing and to the time saved by deletion of the introns from mouse gene Hes7, it is here suggested that transcription delay is comparable to splicing delay only when the intron is 28.5 kb or larger, which is greater in size than 95% of vertebrate introns, 99.5% of Drosophila introns, and all the annotated introns of Saccharomyces cerevisiae and Arabidopsis thaliana. Delays in intron splicing are probably a selective force, promoting intron loss from quickly expressed genes. In other genes, it may have been an exaptation during the emergency of developmental clocks.
Collapse
Affiliation(s)
- Li Jiang
- a MOE Key Laboratory for Biodiversity Science and Ecological Engineering and Beijing Key Laboratory of Gene Resource and Molecular Development; College of Life Sciences; Beijing Normal University; Beijing, PR China
| | | | | |
Collapse
|
7
|
Yang YF, Zhu T, Niu DK. Association of intron loss with high mutation rate in Arabidopsis: implications for genome size evolution. Genome Biol Evol 2013; 5:723-33. [PMID: 23516254 PMCID: PMC4104619 DOI: 10.1093/gbe/evt043] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Despite the prevalence of intron losses during eukaryotic evolution, the selective forces acting on them have not been extensively explored. Arabidopsis thaliana lost half of its genome and experienced an elevated rate of intron loss after diverging from A. lyrata. The selective force for genome reduction was suggested to have driven the intron loss. However, the evolutionary mechanism of genome reduction is still a matter of debate. In this study, we found that intron-lost genes have high synonymous substitution rates. Assuming that differences in mutability among different introns are conserved among closely related species, we used the nucleotide substitution rate between orthologous introns in other species as the proxy of the mutation rate of Arabidopsis introns, either lost or extant. The lost introns were found to have higher mutation rates than extant introns. At the genome-wide level, A. thaliana has a higher mutation rate than A. lyrata, which correlates with the higher rate of intron loss and rapid genome reduction of A. thaliana. Our results indicate that selection to minimize mutational hazards might be the selective force for intron loss, and possibly also for genome reduction, in the evolution of A. thaliana. Small genome size and lower genome-wide intron density were widely reported to be correlated with phenotypic features, such as high metabolic rates and rapid growth. We argue that the mutational-hazard hypothesis is compatible with these correlations, by suggesting that selection for rapid growth might indirectly increase mutational hazards.
Collapse
Affiliation(s)
- Yu-Fei Yang
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering, and Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University, China
| | | | | |
Collapse
|
8
|
Frequency of intron loss correlates with processed pseudogene abundance: a novel strategy to test the reverse transcriptase model of intron loss. BMC Biol 2013; 11:23. [PMID: 23497167 PMCID: PMC3652778 DOI: 10.1186/1741-7007-11-23] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2013] [Accepted: 03/05/2013] [Indexed: 11/23/2022] Open
Abstract
Background Although intron loss in evolution has been described, the mechanism involved is still unclear. Three models have been proposed, the reverse transcriptase (RT) model, genomic deletion model and double-strand-break repair model. The RT model, also termed mRNA-mediated intron loss, suggests that cDNA molecules reverse transcribed from spliced mRNA recombine with genomic DNA causing intron loss. Many studies have attempted to test this model based on its predictions, such as simultaneous loss of adjacent introns, 3'-side bias of intron loss, and germline expression of intron-lost genes. Evidence either supporting or opposing the model has been reported. The mechanism of intron loss proposed in the RT model shares the process of reverse transcription with the formation of processed pseudogenes. If the RT model is correct, genes that have produced more processed pseudogenes are more likely to undergo intron loss. Results In the present study, we observed that the frequency of intron loss is correlated with processed pseudogene abundance by analyzing a new dataset of intron loss obtained in mice and rats. Furthermore, we found that mRNA molecules of intron-lost genes are mostly translated on free cytoplasmic ribosomes, a feature shared by mRNA molecules of the parental genes of processed pseudogenes and long interspersed elements. This feature is likely convenient for intron-lost gene mRNA molecules to be reverse transcribed. Analyses of adjacent intron loss, 3'-side bias of intron loss, and germline expression of intron-lost genes also support the RT model. Conclusions Compared with previous evidence, the correlation between the abundance of processed pseudogenes and intron loss frequency more directly supports the RT model of intron loss. Exploring such a correlation is a new strategy to test the RT model in organisms with abundant processed pseudogenes.
Collapse
|
9
|
Choi SS, Hannenhalli S. Three independent determinants of protein evolutionary rate. J Mol Evol 2013; 76:98-111. [PMID: 23400388 DOI: 10.1007/s00239-013-9543-6] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2012] [Accepted: 01/16/2013] [Indexed: 12/15/2022]
Abstract
One of the most widely accepted ideas related to the evolutionary rates of proteins is that functionally important residues or regions evolve slower than other regions, a reasonable outcome of which should be a slower evolutionary rate of the proteins with a higher density of functionally important sites. Oddly, the role of functional importance, mainly measured by essentiality, in determining evolutionary rate has been challenged in recent studies. Several variables other than protein essentiality, such as expression level, gene compactness, protein-protein interactions, etc., have been suggested to affect protein evolutionary rate. In the present review, we try to refine the concept of functional importance of a gene, and consider three factors-functional importance, expression level, and gene compactness, as independent determinants of evolutionary rate of a protein, based not only on their known correlation with evolutionary rate but also on a reasonable mechanistic model. We suggest a framework based on these mechanistic models to correctly interpret the correlations between evolutionary rates and the various variables as well as the interrelationships among the variables.
Collapse
Affiliation(s)
- Sun Shim Choi
- Department of Medical Biotechnology, College of Biomedical Science, and Institute of Bioscience & Biotechnology, Kangwon National University, Chuncheon, South Korea.
| | | |
Collapse
|
10
|
Niu DK, Jiang L. Can ENCODE tell us how much junk DNA we carry in our genome? Biochem Biophys Res Commun 2012; 430:1340-3. [PMID: 23268340 DOI: 10.1016/j.bbrc.2012.12.074] [Citation(s) in RCA: 73] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2012] [Accepted: 12/12/2012] [Indexed: 11/26/2022]
Abstract
One of the large, unsolved problems in human genetics is the proportion of functional sequences in genomes. Recently, the encyclopedia of DNA elements consortium revealed that the majority of the genome is biochemically active, which were described as biochemical functions. This has been used as evidence to pronounce the death of the junk DNA concept. In evolutionary biology, junk DNAs are sequences whose gain or loss does not seriously affect fitness of the host organism. In the human genome, a large amount of biochemical activity should be to repress the sequences so as to avoid their harmful expression. The biochemical activity is very different from functionality in the light of evolution. The single nucleotide polymorphism sites associated with disease and other phenotypes may be functional, but their abundance in the active genome regions is not reliable evidence of functionality. Because of sequence-independent functions, the proportion of functional regions would be underestimated when sequence constraints are used alone. Knockout may be the most effective means of distinguishing functional sequences from junk DNA.
Collapse
Affiliation(s)
- Deng-Ke Niu
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing 100875, China.
| | | |
Collapse
|
11
|
Niu DK, Yang YF. Why eukaryotic cells use introns to enhance gene expression: splicing reduces transcription-associated mutagenesis by inhibiting topoisomerase I cutting activity. Biol Direct 2011; 6:24. [PMID: 21592350 PMCID: PMC3118952 DOI: 10.1186/1745-6150-6-24] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2011] [Accepted: 05/18/2011] [Indexed: 11/10/2022] Open
Abstract
Background The costs and benefits of spliceosomal introns in eukaryotes have not been established. One recognized effect of intron splicing is its known enhancement of gene expression. However, the mechanism regulating such splicing-mediated expression enhancement has not been defined. Previous studies have shown that intron splicing is a time-consuming process, indicating that splicing may not reduce the time required for transcription and processing of spliced pre-mRNA molecules; rather, it might facilitate the later rounds of transcription. Because the densities of active RNA polymerase II on most genes are less than one molecule per gene, direct interactions between the splicing apparatus and transcriptional complexes (from the later rounds of transcription) are infrequent, and thus unlikely to account for splicing-mediated gene expression enhancement. Presentation of the hypothesis The serine/arginine-rich protein SF2/ASF can inhibit the DNA topoisomerase I activity that removes negative supercoiling of DNA generated by transcription. Consequently, splicing could make genes more receptive to RNA polymerase II during the later rounds of transcription, and thus affect the frequency of gene transcription. Compared with the transcriptional enhancement mediated by strong promoters, intron-containing genes experience a lower frequency of cut-and-paste processes. The cleavage and religation activity of DNA strands by DNA topoisomerase I was recently shown to account for transcription-associated mutagenesis. Therefore, intron-mediated enhancement of gene expression could reduce transcription-associated genome instability. Testing the hypothesis Experimentally test whether transcription-associated mutagenesis is lower in intron-containing genes than in intronless genes. Use bioinformatic analysis to check whether exons flanking lost introns have higher frequencies of short deletions. Implications of the hypothesis The mechanism of intron-mediated enhancement proposed here may also explain the positive correlation observed between intron size and gene expression levels in unicellular organisms, and the greater number of intron containing genes in higher organisms. Reviewers This article was reviewed by Dr Arcady Mushegian, Dr Igor B Rogozin (nominated by Dr I King Jordan) and Dr Alexey S Kondrashov. For the full reviews, please go to the Reviewer's Reports section.
Collapse
Affiliation(s)
- Deng-Ke Niu
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing 100875, China.
| | | |
Collapse
|
12
|
Spliceosomal intron size expansion in domesticated grapevine (Vitis vinifera). BMC Res Notes 2011; 4:52. [PMID: 21385391 PMCID: PMC3058033 DOI: 10.1186/1756-0500-4-52] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2011] [Accepted: 03/08/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Spliceosomal introns are important components of eukaryotic genes as their structure, sizes and contents reflect the architecture of gene and genomes. Intron size, determined by both neutral evolution, repetitive elements activities and potential functional constraints, varies significantly in eukaryotes, suggesting unique dynamics and evolution in different lineages of eukaryotic organisms. However, the evolution of intron size, is rarely studied. To investigate intron size dynamics in flowering plants, in particular domesticated grapevines, a survey of intron size and content in wine grape (Vitis vinifera Pinot Noir) genes was conducted by assembling and mapping the transcriptome of V. vinifera genes from ESTs to characterize and analyze spliceosomal introns. RESULTS Uncommonly large size of spliceosomal intron was observed in V. vinifera genome, otherwise inconsistent with overall genome size dynamics when comparing Arabidopsis, Populus and Vitis. In domesticated grapevine, intron size is generally not related to gene function. The composition of enlarged introns in grapevines indicated extensive transposable element (TE) activity within intronic regions. TEs comprise about 80% of the expanded intron space and in particular, recent LTR retrotransposon insertions are enriched in these intronic regions, suggesting an intron size expansion in the lineage leading to domesticated grapevine, instead of size contractions in Arabidopsis and Populus. Comparative analysis of selected intronic regions in V. vinifera cultivars and wild grapevine species revealed that accelerated TE activity was associated with grapevine domestication, and in some cases with the development of specific cultivars. CONCLUSIONS In this study, we showed intron size expansion driven by TE activities in domesticated grapevines, likely a result of long-term vegetative propagation and intensive human care, which simultaneously promote TE proliferation and repress TE removal mechanisms such as recombination. The intron size expansion observed in domesticated grapevines provided an example of rapid plant genome evolution in response to artificial selection and propagation, and may shed light on the important genomic changes during domestication. In addition, the transcriptome approach used to gather intron size data significantly improved annotations of the V. vinifera genome.
Collapse
|
13
|
Rao YS, Wang ZF, Chai XW, Wu GZ, Zhou M, Nie QH, Zhang XQ. Selection for the compactness of highly expressed genes in Gallus gallus. Biol Direct 2010; 5:35. [PMID: 20465857 PMCID: PMC2883972 DOI: 10.1186/1745-6150-5-35] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2009] [Accepted: 05/14/2010] [Indexed: 11/10/2022] Open
Abstract
Background Coding sequence (CDS) length, gene size, and intron length vary within a genome and among genomes. Previous studies in diverse organisms, including human, D. Melanogaster, C. elegans, S. cerevisiae, and Arabidopsis thaliana, indicated that there are negative relationships between expression level and gene size, CDS length as well as intron length. Different models such as selection for economy model, genomic design model, and mutational bias hypotheses have been proposed to explain such observation. The debate of which model is a superior one to explain the observation has not been settled down. The chicken (Gallus gallus) is an important model organism that bridges the evolutionary gap between mammals and other vertebrates. As D. Melanogaster, chicken has a larger effective population size, selection for chicken genome is expected to be more effective in increasing protein synthesis efficiency. Therefore, in this study the chicken was used as a model organism to elucidate the interaction between gene features and expression pattern upon selection pressure. Results Based on different technologies, we gathered expression data for nuclear protein coding, single-splicing genes from Gallus gallus genome and compared them with gene parameters. We found that gene size, CDS length, first intron length, average intron length, and total intron length are negatively correlated with expression level and expression breadth significantly. The tissue specificity is positively correlated with the first intron length but negatively correlated with the average intron length, and not correlated with the CDS length and protein domain numbers. Comparison analyses showed that ubiquitously expressed genes and narrowly expressed genes with the similar expression levels do not differ in compactness. Our data provided evidence that the genomic design model can not, at least in part, explain our observations. We grouped all somatic-tissue-specific genes (n = 1105), and compared the first intron length and the average intron length between highly expressed genes (top 5% expressed genes) and weakly expressed genes (bottom 5% expressed genes). We found that the first intron length and the average intron length in highly expressed genes are not different from that in weakly expressed genes. We also made a comparison between ubiquitously expressed genes and narrowly expressed somatic genes with similar expression levels. Our data demonstrated that ubiquitously expressed genes are less compact than narrowly expressed genes with the similar expression levels. Obviously, these observations can not be explained by mutational bias hypotheses either. We also found that the significant trend between genes' compactness and expression level could not be affected by local mutational biases. We argued that the selection of economy model is most likely one to explain the relationship between gene expression and gene characteristics in chicken genome. Conclusion Natural selection appears to favor the compactness of highly expressed genes in chicken genome. This observation can be explained by the selection of economy model. Reviewers This article was reviewed by Dr. Gavin Huttley, Dr. Liran Carmel (nominated by Dr. Eugene V. Koonin) and Dr. Araxi Urrutia (nominated by Dr. Laurence D. Hurst).
Collapse
Affiliation(s)
- You S Rao
- Department of Biological Technology, Jiangxi Educational Institute, Nanchang, Jiangxi, China
| | | | | | | | | | | | | |
Collapse
|
14
|
Costa JH, de Melo DF, Gouveia Z, Cardoso HG, Peixe A, Arnholdt-Schmitt B. The alternative oxidase family of Vitis vinifera reveals an attractive model to study the importance of genomic design. PHYSIOLOGIA PLANTARUM 2009; 137:553-65. [PMID: 19682279 DOI: 10.1111/j.1399-3054.2009.01267.x] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
'Genomic design' refers to the structural organization of gene sequences. Recently, the role of intron sequences for gene regulation is being better understood. Further, introns possess high rates of polymorphism that are considered as the major source for speciation. In molecular breeding, the length of gene-specific introns is recognized as a tool to discriminate genotypes with diverse traits of agronomic interest. 'Economy selection' and 'time-economy selection' have been proposed as models for explaining why highly expressed genes typically contain small introns. However, in contrast to these theories, plant-specific selection reveals that highly expressed genes contain introns that are large. In the presented research, 'wet'Aox gene identification from grapevine is advanced by a bioinformatics approach to study the species-specific organization of Aox gene structures in relation to available expressed sequence tag (EST) data. Two Aox1 and one Aox2 gene sequences have been identified in Vitis vinifera using grapevine cultivars from Portugal and Germany. Searching the complete genome sequence data of two grapevine cultivars confirmed that V. vinifera alternative oxidase (Aox) is encoded by a small multigene family composed of Aox1a, Aox1b and Aox2. An analysis of EST distribution revealed high expression of the VvAox2 gene. A relationship between the atypical long primary transcript of VvAox2 (in comparison to other plant Aox genes) and its expression level is suggested. V. vinifera Aox genes contain four exons interrupted by three introns except for Aox1a which contains an additional intron in the 3'-UTR. The lengths of primary Aox transcripts were estimated for each gene in two V. vinifera varieties: PN40024 and Pinot Noir. In both varieties, Aox1a and Aox1b contained small introns that corresponded to primary transcript lengths ranging from 1501 to 1810 bp. The Aox2 of PN40024 (12 329 bp) was longer than that from Pinot Noir (7279 bp) because of selection against a transposable-element insertion that is 5028 bp in size. An EST database basic local alignment search tool (BLAST) search of GenBank revealed the following ESTs percentages for each gene: Aox1a (26.2%), Aox1b (11.9%) and Aox2 (61.9%). Aox1a was expressed in fruits and roots, Aox1b expression was confined to flowers and Aox2 was ubiquitously expressed. These data for V. vinifera show that atypically long Aox intron lengths are related to high levels of gene expression. Furthermore, it is shown for the first time that two grapevine cultivars can be distinguished by Aox intron length polymorphism.
Collapse
Affiliation(s)
- José Hélio Costa
- Department of Biochemistry and Molecular Biology, Federal University of Ceará, PO Box 6029, 60455-900, Fortaleza, Ceará, Brazil
| | | | | | | | | | | |
Collapse
|
15
|
Yang H. In plants, expression breadth and expression level distinctly and non-linearly correlate with gene structure. Biol Direct 2009; 4:45; discussion 45. [PMID: 19930585 PMCID: PMC2794262 DOI: 10.1186/1745-6150-4-45] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2009] [Accepted: 11/21/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Compactness of highly/broadly expressed genes in human has been explained as selection for efficiency, regional mutation biases or genomic design. However, highly expressed genes in flowering plants were shown to be less compact than lowly expressed ones. On the other hand, opposite facts have also been documented that pollen-expressed Arabidopsis genes tend to contain shorter introns and highly expressed moss genes are compact. This issue is important because it provides a chance to compare the selectionism and the neutralism views about genome evolution. Furthermore, this issue also helps to understand the fates of introns, from the angle of gene expression. RESULTS In this study, I used expression data covering more tissues and employ new analytical methods to reexamine the correlations between gene expression and gene structure for two flowering plants, Arabidopsis thaliana and Oryza sativa. It is shown that, different aspects of expression pattern correlate with different parts of gene sequences in distinct ways. In detail, expression level is significantly negatively correlated with gene size, especially the size of non-coding regions, whereas expression breadth correlates with non-coding structural parameters positively and with coding region parameters negatively. Furthermore, the relationships between expression level and structural parameters seem to be non-linear, with the extremes of structural parameters possibly scale as power-laws or logrithmic functions of expression levels. CONCLUSION In plants, highly expressed genes are compact, especially in the non-coding regions. Broadly expressed genes tend to contain longer non-coding sequences, which may be necessary for complex regulations. In combination with previous studies about other plants and about animals, some common scenarios about the correlation between gene expression and gene structure begin to emerge. Based on the functional relationships between extreme values of structural characteristics and expression level, an effort was made to evaluate the relative effectiveness of the energy-cost hypothesis and the time-cost hypothesis.
Collapse
Affiliation(s)
- Hangxing Yang
- T-Life Research Center, Department of Physics, Fudan University, Shanghai, PR China.
| |
Collapse
|
16
|
Niu DK. Exon definition as a potential negative force against intron losses in evolution. Biol Direct 2008; 3:46. [PMID: 19014515 PMCID: PMC2614967 DOI: 10.1186/1745-6150-3-46] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2008] [Accepted: 11/13/2008] [Indexed: 12/03/2022] Open
Abstract
Background Previous studies have indicated that the wide variation in intron density (the number of introns per gene) among different eukaryotes largely reflects varying degrees of intron loss during evolution. The most popular model, which suggests that organisms lose introns through a mechanism in which reverse-transcribed cDNA recombines with the genomic DNA, concerns only one mutational force. Hypothesis Using exons as the units of splicing-site recognition, exon definition constrains the length of exons. An intron-loss event results in fusion of flanking exons and thus a larger exon. The large size of the newborn exon may cause splicing errors, i.e., exon skipping, if the splicing of pre-mRNAs is initiated by exon definition. By contrast, if the splicing of pre-mRNAs is initiated by intron definition, intron loss does not matter. Exon definition may thus be a selective force against intron loss. An organism with a high frequency of exon definition is expected to experience a low rate of intron loss throughout evolution and have a high density of spliceosomal introns. Conclusion The majority of spliceosomal introns in vertebrates may be maintained during evolution not because of potential functions, but because of their splicing mechanism (i.e., exon definition). Further research is required to determine whether exon definition is a negative force in maintaining the high intron density of vertebrates. Reviewers This article was reviewed by Dr. Scott W. Roy (nominated by Dr. John Logsdon), Dr. Eugene V. Koonin, and Dr. Igor B. Rogozin (nominated by Dr. Mikhail Gelfand). For the full reviews, please go to the Reviewers' comments section.
Collapse
Affiliation(s)
- Deng-Ke Niu
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, PR China.
| |
Collapse
|