1
|
Dey P, Mattick JS. High frequency of intron retention and clustered H3K4me3-marked nucleosomes in short first introns of human long non-coding RNAs. Epigenetics Chromatin 2021; 14:45. [PMID: 34579770 PMCID: PMC8477579 DOI: 10.1186/s13072-021-00419-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Accepted: 08/27/2021] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND It is established that protein-coding exons are preferentially localized in nucleosomes. To examine whether the same is true for non-coding exons, we analysed nucleosome occupancy in and adjacent to internal exons in genes encoding long non-coding RNAs (lncRNAs) in human CD4+ T cells and K562 cells. RESULTS We confirmed that internal exons in lncRNAs are preferentially associated with nucleosomes, but also observed an elevated signal from H3K4me3-marked nucleosomes in the sequences upstream of these exons. Examination of 200 genomic lncRNA loci chosen at random across all chromosomes showed that high-density regions of H3K4me3-marked nucleosomes, which we term 'slabs', are associated with genomic regions exhibiting intron retention. These retained introns occur in over 50% of lncRNAs examined and are mostly first introns with an average length of just 354 bp, compared to the average length of all human introns of 6355 and 7987 bp in mRNAs and lncRNAs, respectively. Removal of short introns from the dataset abrogated the high upstream H3K4me3 signal, confirming that the association of slabs and short lncRNA introns with intron retention holds genome-wide. The high upstream H3K4me3 signal is also associated with alternatively spliced exons, known to be prominent in lncRNAs. This phenomenon was not observed with mRNAs. CONCLUSIONS There is widespread intron retention and clustered H3K4me3-marked nucleosomes in short first introns of human long non-coding RNAs, which raises intriguing questions about the relationship of IR to lncRNA function and chromatin organization.
Collapse
Affiliation(s)
- Pinki Dey
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, 2052, Sydney, Australia
| | - John S Mattick
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, 2052, Sydney, Australia.
| |
Collapse
|
2
|
Hurst LD, Batada NN. Depletion of somatic mutations in splicing-associated sequences in cancer genomes. Genome Biol 2017; 18:213. [PMID: 29115978 PMCID: PMC5678748 DOI: 10.1186/s13059-017-1337-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2017] [Accepted: 10/12/2017] [Indexed: 01/01/2023] Open
Abstract
Background An important goal of cancer genomics is to identify systematically cancer-causing mutations. A common approach is to identify sites with high ratios of non-synonymous to synonymous mutations; however, if synonymous mutations are under purifying selection, this methodology leads to identification of false-positive mutations. Here, using synonymous somatic mutations (SSMs) identified in over 4000 tumours across 15 different cancer types, we sought to test this assumption by focusing on coding regions required for splicing. Results Exon flanks, which are enriched for sequences required for splicing fidelity, have ~ 17% lower SSM density compared to exonic cores, even after excluding canonical splice sites. While it is impossible to eliminate a mutation bias of unknown cause, multiple lines of evidence support a purifying selection model above a mutational bias explanation. The flank/core difference is not explained by skewed nucleotide content, replication timing, nucleosome occupancy or deficiency in mismatch repair. The depletion is not seen in tumour suppressors, consistent with their role in positive tumour selection, but is otherwise observed in cancer-associated and non-cancer genes, both essential and non-essential. Consistent with a role in splicing modulation, exonic splice enhancers have a lower SSM density before and after controlling for nucleotide composition; moreover, flanks at the 5’ end of the exons have significantly lower SSM density than at the 3’ end. Conclusions These results suggest that the observable mutational spectrum of cancer genomes is not simply a product of various mutational processes and positive selection, but might also be shaped by negative selection. Electronic supplementary material The online version of this article (doi:10.1186/s13059-017-1337-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Laurence D Hurst
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Nizar N Batada
- Institute for Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, EH4 2XU, UK.
| |
Collapse
|
3
|
Horlbeck MA, Witkowsky LB, Guglielmi B, Replogle JM, Gilbert LA, Villalta JE, Torigoe SE, Tjian R, Weissman JS. Nucleosomes impede Cas9 access to DNA in vivo and in vitro. eLife 2016; 5:e12677. [PMID: 26987018 PMCID: PMC4861601 DOI: 10.7554/elife.12677] [Citation(s) in RCA: 282] [Impact Index Per Article: 31.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2015] [Accepted: 03/16/2016] [Indexed: 12/23/2022] Open
Abstract
The prokaryotic CRISPR (clustered regularly interspaced palindromic repeats)-associated protein, Cas9, has been widely adopted as a tool for editing, imaging, and regulating eukaryotic genomes. However, our understanding of how to select single-guide RNAs (sgRNAs) that mediate efficient Cas9 activity is incomplete, as we lack insight into how chromatin impacts Cas9 targeting. To address this gap, we analyzed large-scale genetic screens performed in human cell lines using either nuclease-active or nuclease-dead Cas9 (dCas9). We observed that highly active sgRNAs for Cas9 and dCas9 were found almost exclusively in regions of low nucleosome occupancy. In vitro experiments demonstrated that nucleosomes in fact directly impede Cas9 binding and cleavage, while chromatin remodeling can restore Cas9 access. Our results reveal a critical role of eukaryotic chromatin in dictating the targeting specificity of this transplanted bacterial enzyme, and provide rules for selecting Cas9 target sites distinct from and complementary to those based on sequence properties.
Collapse
Affiliation(s)
- Max A Horlbeck
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, United States
- California Institute for Quantitative Biomedical Research, University of California, San Francisco, San Francisco, United States
- Howard Hughes Medical Institute, University of California, San Francisco, San Francisco, United States
- Center for RNA Systems Biology, University of California, San Francisco, San Francisco, United States
| | - Lea B Witkowsky
- Howard Hughes Medical Institute, University of California, Berkeley, Berkeley, United States
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, United States
- CIRM Center of Excellence, University of California, Berkeley, Berkeley, United States
| | - Benjamin Guglielmi
- Howard Hughes Medical Institute, University of California, Berkeley, Berkeley, United States
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, United States
- CIRM Center of Excellence, University of California, Berkeley, Berkeley, United States
| | - Joseph M Replogle
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, United States
- California Institute for Quantitative Biomedical Research, University of California, San Francisco, San Francisco, United States
- Howard Hughes Medical Institute, University of California, San Francisco, San Francisco, United States
- Center for RNA Systems Biology, University of California, San Francisco, San Francisco, United States
| | - Luke A Gilbert
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, United States
- California Institute for Quantitative Biomedical Research, University of California, San Francisco, San Francisco, United States
- Howard Hughes Medical Institute, University of California, San Francisco, San Francisco, United States
- Center for RNA Systems Biology, University of California, San Francisco, San Francisco, United States
| | - Jacqueline E Villalta
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, United States
- California Institute for Quantitative Biomedical Research, University of California, San Francisco, San Francisco, United States
- Howard Hughes Medical Institute, University of California, San Francisco, San Francisco, United States
- Center for RNA Systems Biology, University of California, San Francisco, San Francisco, United States
| | - Sharon E Torigoe
- Howard Hughes Medical Institute, University of California, Berkeley, Berkeley, United States
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, United States
- CIRM Center of Excellence, University of California, Berkeley, Berkeley, United States
| | - Robert Tjian
- Howard Hughes Medical Institute, University of California, Berkeley, Berkeley, United States
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, United States
- CIRM Center of Excellence, University of California, Berkeley, Berkeley, United States
| | - Jonathan S Weissman
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, United States
- California Institute for Quantitative Biomedical Research, University of California, San Francisco, San Francisco, United States
- Howard Hughes Medical Institute, University of California, San Francisco, San Francisco, United States
- Center for RNA Systems Biology, University of California, San Francisco, San Francisco, United States
| |
Collapse
|
4
|
DNA-Encoded Chromatin Structural Intron Boundary Signals Identify Conserved Genes with Common Function. Int J Genomics 2015; 2015:167578. [PMID: 25861617 PMCID: PMC4377520 DOI: 10.1155/2015/167578] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2014] [Accepted: 02/15/2015] [Indexed: 12/14/2022] Open
Abstract
The regulation of metazoan gene expression occurs in part by pre-mRNA splicing into mature RNAs. Signals affecting the efficiency and specificity with which introns are removed have not been completely elucidated. Splicing likely occurs cotranscriptionally, with chromatin structure playing a key regulatory role. We calculated DNA encoded nucleosome occupancy likelihood (NOL) scores at the boundaries between introns and exons across five metazoan species. We found that (i) NOL scores reveal a sequence-based feature at the introns on both sides of the intron-exon boundary; (ii) this feature is not part of any recognizable consensus sequence; (iii) this feature is conserved throughout metazoa; (iv) this feature is enriched in genes sharing similar functions: ATPase activity, ATP binding, helicase activity, and motor activity; (v) genes with these functions exhibit different genomic characteristics;
(vi) in vivo nucleosome positioning data confirm ontological enrichment at this feature; and (vii) genes with this feature exhibit unique dinucleotide distributions at the intron-exon boundary. The NOL scores point toward a physical property of DNA that may play a role in the mechanism of pre-mRNA splicing. These results provide a foundation for identification of a new set of regulatory DNA elements involved in splicing regulation.
Collapse
|
5
|
Trifonov EN, Volkovich Z, Frenkel ZM. Multiple levels of meaning in DNA sequences, and one more. Ann N Y Acad Sci 2012; 1267:35-8. [PMID: 22954214 DOI: 10.1111/j.1749-6632.2012.06589.x] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
If we define a genetic code as a widespread DNA sequence pattern that carries a message with an impact on biology, then there are multiple genetic codes. Sequences involved in these codes overlap and, thus, both interact with and constrain each other, such as for the triplet code, the intron-splicing code, the code for amphipathic alpha helices, and the chromatin code. Nucleosomes preferentially are located at the ends of exons, thus protecting splice junctions, with the N9 positions of guanines of the GT and AG junctions oriented toward the histones. Analysis of protein-coding sequences reveals numerous traces of tandem repeats, apparently formed by triplet expansion, which in effect is a genome inflation ``code''. Our data are consistent with the hypothesis that expansion of simple tandem repetition of certain aggressive triplets has been a characteristic of life from its emergence. Such expanding triplets appear to be the major factor underlying observed codon usage biases.
Collapse
Affiliation(s)
- Edward N Trifonov
- Genome Diversity Center, Institute of Evolution, University of Haifa, Mount Carmel, Haifa, Israel.
| | | | | |
Collapse
|
6
|
High resolution positioning of intron ends on the nucleosomes. Gene 2011; 489:6-10. [DOI: 10.1016/j.gene.2011.08.022] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2011] [Revised: 08/20/2011] [Accepted: 08/26/2011] [Indexed: 01/23/2023]
|
7
|
On parameters of the human genome. J Theor Biol 2011; 288:92-104. [DOI: 10.1016/j.jtbi.2011.07.021] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2011] [Revised: 06/28/2011] [Accepted: 07/21/2011] [Indexed: 02/06/2023]
|
8
|
Frenkel ZM, Bettecken T, Trifonov EN. Nucleosome DNA sequence structure of isochores. BMC Genomics 2011; 12:203. [PMID: 21510861 PMCID: PMC3097165 DOI: 10.1186/1471-2164-12-203] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2010] [Accepted: 04/21/2011] [Indexed: 12/03/2022] Open
Abstract
Background Significant differences in G+C content between different isochore types suggest that the nucleosome positioning patterns in DNA of the isochores should be different as well. Results Extraction of the patterns from the isochore DNA sequences by Shannon N-gram extension reveals that while the general motif YRRRRRYYYYYR is characteristic for all isochore types, the dominant positioning patterns of the isochores vary between TAAAAATTTTTA and CGGGGGCCCCCG due to the large differences in G+C composition. This is observed in human, mouse and chicken isochores, demonstrating that the variations of the positioning patterns are largely G+C dependent rather than species-specific. The species-specificity of nucleosome positioning patterns is revealed by dinucleotide periodicity analyses in isochore sequences. While human sequences are showing CG periodicity, chicken isochores display AG (CT) periodicity. Mouse isochores show very weak CG periodicity only. Conclusions Nucleosome positioning pattern as revealed by Shannon N-gram extension is strongly dependent on G+C content and different in different isochores. Species-specificity of the pattern is subtle. It is reflected in the choice of preferentially periodical dinucleotides.
Collapse
Affiliation(s)
- Zakharia M Frenkel
- Genome Diversity Center, Institute of Evolution, University of Haifa, Mount Carmel, Haifa 31905, Israel
| | | | | |
Collapse
|
9
|
Spies N, Nielsen CB, Padgett RA, Burge CB. Biased chromatin signatures around polyadenylation sites and exons. Mol Cell 2009; 36:245-54. [PMID: 19854133 DOI: 10.1016/j.molcel.2009.10.008] [Citation(s) in RCA: 304] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2009] [Revised: 10/06/2009] [Accepted: 10/08/2009] [Indexed: 12/29/2022]
Abstract
Core RNA-processing reactions in eukaryotic cells occur cotranscriptionally in a chromatin context, but the relationship between chromatin structure and pre-mRNA processing is poorly understood. We observed strong nucleosome depletion around human polyadenylation sites (PAS) and nucleosome enrichment just downstream of PAS. In genes with multiple alternative PAS, higher downstream nucleosome affinity was associated with higher PAS usage, independently of known PAS motifs that function at the RNA level. Conversely, exons were associated with distinct peaks in nucleosome density. Exons flanked by long introns or weak splice sites exhibited stronger nucleosome enrichment, and incorporation of nucleosome density data improved splicing simulation accuracy. Certain histone modifications, including H3K36me3 and H3K27me2, were specifically enriched on exons, suggesting active marking of exon locations at the chromatin level. Together, these findings provide evidence for extensive functional connections between chromatin structure and RNA processing.
Collapse
Affiliation(s)
- Noah Spies
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | | | | | | |
Collapse
|
10
|
Nucleosome positioning as a determinant of exon recognition. Nat Struct Mol Biol 2009; 16:996-1001. [PMID: 19684599 DOI: 10.1038/nsmb.1658] [Citation(s) in RCA: 346] [Impact Index Per Article: 21.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2009] [Accepted: 07/21/2009] [Indexed: 11/09/2022]
Abstract
Chromatin structure influences transcription, but its role in subsequent RNA processing is unclear. Here we present analyses of high-throughput data that imply a relationship between nucleosome positioning and exon definition. First, we have found stable nucleosome occupancy within human and Caenorhabditis elegans exons that is stronger in exons with weak splice sites. Conversely, we have found that pseudoexons--intronic sequences that are not included in mRNAs but are flanked by strong splice sites--show nucleosome depletion. Second, the ratio between nucleosome occupancy within and upstream from the exons correlates with exon-inclusion levels. Third, nucleosomes are positioned central to exons rather than proximal to splice sites. These exonic nucleosomal patterns are also observed in non-expressed genes, suggesting that nucleosome marking of exons exists in the absence of transcription. Our analysis provides a framework that contributes to the understanding of splicing on the basis of chromatin architecture.
Collapse
|
11
|
Gabdank I, Barash D, Trifonov EN. Nucleosome DNA Bendability Matrix(C. elegans). J Biomol Struct Dyn 2009; 26:403-11. [DOI: 10.1080/07391102.2009.10507255] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
12
|
Lifanov AP, Vlasov PK, Makeev VY, Esipova NG. Regular location of exon starts in collagen I and VII genes with periods comparable to nucleosome repeat lengths. Biophysics (Nagoya-shi) 2008. [DOI: 10.1134/s0006350908030147] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
|
13
|
Lazarev D, Manley JL. Concurrent splicing and transcription are not sufficient to enhance splicing efficiency. RNA (NEW YORK, N.Y.) 2007; 13:1546-57. [PMID: 17630325 PMCID: PMC1950766 DOI: 10.1261/rna.595907] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
The concept of a tight integration of transcription and splicing of mRNA precursors has been supported with increasing evidence in recent years. However, the mechanism and functional consequences of this integration remain largely unknown. We have examined how these processes impact upon one another when they occur together in HeLa nuclear extract. While both processes do in fact occur in parallel reactions in the extracts, we found no evidence that one process affects the other, under a variety of conditions tested. For example, neither the kinetics nor efficiency of splicing is significantly enhanced by de novo RNA polymerase II-mediated transcription, relative to that of presynthesized RNA added exogenously to the extract. Our results indicate that the act of transcription by RNA polymerase II in vitro is not sufficient to enhance splicing of the newly made RNA.
Collapse
Affiliation(s)
- Denis Lazarev
- Department of Biological Sciences, Columbia University, New York, NY 10027, USA
| | | |
Collapse
|
14
|
Abstract
Research into the origins of introns is at a critical juncture in the resolution of theories on the evolution of early life (which came first, RNA or DNA?), the identity of LUCA (the last universal common ancestor, was it prokaryotic- or eukaryotic-like?), and the significance of noncoding nucleotide variation. One early notion was that introns would have evolved as a component of an efficient mechanism for the origin of genes. But alternative theories emerged as well. From the debate between the "introns-early" and "introns-late" theories came the proposal that introns arose before the origin of genetically encoded proteins and DNA, and the more recent "introns-first" theory, which postulates the presence of introns at that early evolutionary stage from a reconstruction of the "RNA world." Here we review seminal and recent ideas about intron origins. Recent discoveries about the patterns and causes of intron evolution make this one of the most hotly debated and exciting topics in molecular evolutionary biology today.
Collapse
Affiliation(s)
- Francisco Rodríguez-Trelles
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525, USA.
| | | | | |
Collapse
|
15
|
Orlov YL, Levitskii VG, Smirnova OG, Podkolodnaya OA, Khlebodarova TM, Kolchanov NA. Statistical analysis of DNA sequences containing nucleosome positioning sites. Biophysics (Nagoya-shi) 2006. [DOI: 10.1134/s0006350906040051] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
|
16
|
Rodríguez-Trelles F, Tarrío R, Ayala FJ. Models of spliceosomal intron proliferation in the face of widespread ectopic expression. Gene 2006; 366:201-8. [PMID: 16288838 DOI: 10.1016/j.gene.2005.09.004] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2005] [Revised: 08/04/2005] [Accepted: 09/02/2005] [Indexed: 11/27/2022]
Abstract
It is now certain that today living organisms can acquire new spliceosomal introns in their genes. The proposed sources of spliceosomal introns are exons, transposons, and other introns, including spliceosomal and group II self-splicing introns. Spliceosomal introns are thought to be the most likely source, because the inserted sequence would immediately be endowed with the essential set of intron recognition sequences, thereby preventing the deleterious effects associated with incorrect splicing. The most obvious spliceosomal intron duplication pathways involve an RNA transcript intermediate step. Therefore, for a spliceosomal intron to be originated by duplication, either the source gene from which the novel intron is derived, or that gene and the recipient gene, which contains the novel intron, would need to be expressed in the germ line. Intron proliferation surveys indicate that putative intron duplicate-containing genes do not always match detectable expression in the germ line, which casts doubt on the generality of the duplication model. However, judging mechanisms of intron gain (or loss) from present-day gene expression profiles could be erroneous, if expression patterns were different at the time the introns arose. In fact, this may likely be so in most cases. Ectopic expression, i.e., the expression of genes at times and locations where the target gene is not known to have a function, is a much more common phenomenon than previously realized. We conclude with a speculation on a possible interplay between spliceosomal introns and ectopic expression at the origin of multicellularity.
Collapse
Affiliation(s)
- Francisco Rodríguez-Trelles
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525, USA.
| | | | | |
Collapse
|
17
|
Kogan S, Trifonov EN. Gene splice sites correlate with nucleosome positions. Gene 2005; 352:57-62. [PMID: 15862762 DOI: 10.1016/j.gene.2005.03.004] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2004] [Revised: 02/02/2005] [Accepted: 03/01/2005] [Indexed: 10/25/2022]
Abstract
Gene sequences in the vicinity of splice sites are found to possess dinucleotide periodicities, especially RR and YY, with the period close to the pitch of nucleosome DNA. This confirms previously reported findings about preferential positioning of splice junctions within the nucleosomes. The RR and YY dinucleotides oscillate counter-phase, i.e., their respective preferred positions are shifted about half-period from one another, as it was observed earlier for AA and TT dinucleotides. Species specificity of nucleosome positioning DNA pattern is indicated by the predominant use of the periodical GG(CC) dinucleotides in human and mouse genes, as opposed to predominant AA(TT) dinucleotides in Arabidopsis and C. elegans.
Collapse
Affiliation(s)
- Simon Kogan
- Genome Diversity Center, Institute of Evolution, University of Haifa, Mount Carmel, Haifa 31905, Israel
| | | |
Collapse
|
18
|
Abstract
The program RECON has been designed for constructing profiles of nucleosome potential, characterizing the probability of nucleosome formation along DNA sequences. The program used for recognition of nucleosome formation sites in genomic DNA sequences. It was developed using discriminant analysis based on a genetic algorithm method utilizing statistics for dinucleotide location within local regions of nucleosome formation sites. The program RECON is available at http://wwwmgs.bionet.nsc.ru/mgs/programs/recon/.
Collapse
Affiliation(s)
- Victor G Levitsky
- Novosibirsk State University, Pirogova 2, Novosibirsk, 630090, Russia.
| |
Collapse
|
19
|
Vinogradov AE. Compactness of human housekeeping genes: selection for economy or genomic design? Trends Genet 2004; 20:248-53. [PMID: 15109779 DOI: 10.1016/j.tig.2004.03.006] [Citation(s) in RCA: 124] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Affiliation(s)
- Alexander E Vinogradov
- Institute of Cytology, Russian Academy of Sciences, Tikhoretsky Ave 4, St Petersburg 194064, Russia.
| |
Collapse
|
20
|
Abstract
An analysis of donor splice site sequence (dsss) multitude was carried out. A special method was developed to study the multitude properties. The multitude of dsss was arranged as a hierarchical tree where the "root" consists of a small group of "ancestors", VAGgtVAG (gt stands in the beginning of intron, V=C,A,G). Every ancestor has its own clan of "descendants" which represent point mutants of it. The latter are split into "progenies". The first progeny looks like point mutants of the ancestors, the second as one-point mutants of sequences of the first progeny, and so forth. This picture looks like a hierarchical phylogenetic tree. The most striking property of the multitude is the successive monotonic decrease in occurrences of the sequences from one progeny to another. It is shown that such results may be modeled by a virtual mutational process.
Collapse
Affiliation(s)
- D A Denisov
- Department of Cell Research and Immunology, George S Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel.
| | | |
Collapse
|