1
|
van Helden J, André B, Collado-Vides J. Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies. J Mol Biol 1998; 281:827-42. [PMID: 9719638 DOI: 10.1006/jmbi.1998.1947] [Citation(s) in RCA: 409] [Impact Index Per Article: 15.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We present here a simple and fast method allowing the isolation of DNA binding sites for transcription factors from families of coregulated genes, with results illustrated in Saccharomyces cerevisiae. Although conceptually simple, the algorithm proved efficient for extracting, from most of the yeast regulatory families analyzed, the upstream regulatory sequences which had been previously found by experimental analysis. Furthermore, putative new regulatory sites are predicted within upstream regions of several regulons. The method is based on the detection of over-represented oligonucleotides. A specificity of this approach is to define the statistical significance of a site based on tables of oligonucleotide frequencies observed in all non-coding sequences from the yeast genome. In contrast with heuristic methods, this oligonucleotide analysis is rigorous and exhaustive. Its range of detection is however limited to relatively simple patterns: short motifs with a highly conserved core. These features seem to be shared by a good number of regulatory sites in yeast. This, and similar methods, should be increasingly required to identify unknown regulatory elements within the numerous new coregulated families resulting from measurements of gene expression levels at the genomic scale. All tools described here are available on the web at the site http://copan.cifn.unam.mx/Computational_Biology/ yeast-tools
Collapse
Affiliation(s)
- J van Helden
- Centro de Investigación sobre Fijación de Nitrógeno, Universidad Nacional Autónoma de México, AP565A Cuernavaca, Morelos, 62100, México.
| | | | | |
Collapse
|
2
|
Wada-Kiyama Y, Kiyama R. Conservation and periodicity of DNA bend sites in the human beta-globin gene locus. J Biol Chem 1995; 270:12439-45. [PMID: 7759485 DOI: 10.1074/jbc.270.21.12439] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
A total of seven DNA bend sites were mapped in the 4.4-kilobase human beta-globin gene region by the circular permutation assay. The periodicity of these sites (except one) was about every 700 (average 685.5 +/- 267.7) base pairs. All of the sites contained the sequence feature of short poly(dA) tracts, which are typical of DNA bending. The relative positions of the sites to the cap site were identical to those in the epsilon-globin gene region, suggesting that the bend sites were conserved during molecular evolution of the two globin genes. To explain this periodicity and conservation of the sites within the evolutionary unstable noncoding regions, we focused upon the appearance of a potential bend core sequence, A2N8A2N8A2 (A/A/A), and its complement, T2N8T2N8T2 (T/T/T). These sequences appeared in or very close to most of the bend sites of the globin gene regions, whereas other A+T-rich sequences or candidates for DNA bending did not. The distances between any two of the core sequences in the entire beta-globin locus showed a strong bias to a length of about 700 base pairs and its multiples, suggesting that the periodicity exists throughout the locus. The data presented here strengthen the idea of sequence-directed nucleosome phasing.
Collapse
Affiliation(s)
- Y Wada-Kiyama
- Department of Physiology, Nippon Medical School, Tokyo, Japan
| | | |
Collapse
|
3
|
Wada-Kiyama Y, Kiyama R. Periodicity of DNA bend sites in human epsilon-globin gene region. Possibility of sequence-directed nucleosome phasing. J Biol Chem 1994. [DOI: 10.1016/s0021-9258(17)31782-9] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
|
4
|
Gyuris J, Dencsö L, Polyák K, Duda E. Complex interaction of yeast nuclear proteins with the enhancer/promoter region of SV40. Curr Genet 1991; 20:359-63. [PMID: 1666981 DOI: 10.1007/bf00317062] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Though highly complex enhancers found in animal cells have not been reported to occur in yeasts they are able to activate the transcription of adjacent genes in yeast cells. Saccharomyces cerevisiae expresses a large number of nuclear proteins that are able to recognize, and specifically bind to, the enhancer sequences of the SV40 animal tumor virus. The complexity of proteins that interact with different elements of the animal enhancers is similar in yeast and animal cell nuclear extracts. Most enhancer motifs, recognized by known trans-acting factors, are protected in footprinting experiments by yeast nuclear proteins.
Collapse
Affiliation(s)
- J Gyuris
- Institute of Biochemistry, MTA Biological Research Center, Szeged, Hungary
| | | | | | | |
Collapse
|
5
|
Abstract
The DNA double helix exhibits local sequence-dependent polymorphism at the level of the single base pair and dinucleotide step. Curvature of the DNA molecule occurs in DNA regions with a specific type of nucleotide sequence periodicities. Negative supercoiling induces in vitro local nucleotide sequence-dependent DNA structures such as cruciforms, left-handed DNA, multistranded structures, etc. Techniques based on chemical probes have been proposed that make it possible to study DNA local structures in cells. Recent results suggest that the local DNA structures observed in vitro exist in the cell, but their occurrence and structural details are dependent on the DNA superhelical density in the cell and can be related to some cellular processes.
Collapse
Affiliation(s)
- E Palecek
- Max-Planck Institut für Biophysikalische Chemie, Göttingen, BRD
| |
Collapse
|
6
|
Stückle EE, Emmrich C, Grob U, Nielsen PJ. Statistical analysis of nucleotide sequences. Nucleic Acids Res 1990; 18:6641-7. [PMID: 2251125 PMCID: PMC332623 DOI: 10.1093/nar/18.22.6641] [Citation(s) in RCA: 33] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
In order to scan nucleic acid databases for potentially relevant but as yet unknown signals, we have developed an improved statistical model for pattern analysis of nucleic acid sequences by modifying previous methods based on Markov chains. We demonstrate the importance of selecting the appropriate parameters in order for the method to function at all. The model allows the simultaneous analysis of several short sequences with unequal base frequencies and Markov order k not equal to 0 as is usually the case in databases. As a test of these modifications, we show that in E. coli sequences there is a bias against palindromic hexamers which correspond to known restriction enzyme recognition sites.
Collapse
Affiliation(s)
- E E Stückle
- Max-Planck-Institut für Immunbiologie, Freiburg, FRG
| | | | | | | |
Collapse
|
7
|
Jackson DA, Dickinson P, Cook PR. Attachment of DNA to the nucleoskeleton of HeLa cells examined using physiological conditions. Nucleic Acids Res 1990; 18:4385-93. [PMID: 2167466 PMCID: PMC331255 DOI: 10.1093/nar/18.15.4385] [Citation(s) in RCA: 43] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Although it is widely believed that eukaryotic DNA is looped by attachment to a nucleoskeleton, there is controversy about its composition and which sequences are attached to it. As most nuclear derivatives are isolated using unphysiological conditions, the criticism that attachments seen in vitro are generated artifactually has been difficult to rebut. Therefore we have re-investigated attachments of chromatin to the skeleton using physiological conditions. HeLa cells are encapsulated in agarose microbeads and lysed using Triton in a 'physiological' buffer. Then, most chromatin can be electroeluted after treatment with a restriction enzyme to leave some at the base of the loops still attached. Analysis of the size and amounts of these residual fragments indicates that the loops are 80-90kbp long. The residual fragments are stably attached, with about 1kbp of each fragment protected from nuclease attack. This is very much longer than a typical protein-binding site of 10-20bp.
Collapse
Affiliation(s)
- D A Jackson
- Sir William Dunn School of Pathology, Oxford, UK
| | | | | |
Collapse
|
8
|
Vogt P. Potential genetic functions of tandem repeated DNA sequence blocks in the human genome are based on a highly conserved "chromatin folding code". Hum Genet 1990; 84:301-36. [PMID: 2407640 DOI: 10.1007/bf00196228] [Citation(s) in RCA: 99] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
This review is based on a thorough description of the structure and sequence organization of tandemly organized repetitive DNA sequence families in the human genome; it is aimed at revealing the locus-specific sequence organization of tandemly repetitive sequence structures as a highly conserved DNA sequence code. These repetitive so-called "super-structures" or "higher-order" structures are able to attract specific nuclear proteins. I shall define this code therefore as a "chromatin folding code". Since locus-specific superstructures of tandemly repetitive sequence units are present not only in the chromosome centromere or telomere region but also on the arms of the chromosomes, I assume that their chromatin folding code may contribute to, or even organize, the folding pathway of the chromatin chain in the nucleus. The "chromatin folding code" is based on its specific "chromatin code", which describes the sequence dependence of the helical pathway of the DNA primary sequence (i.e., secondary structure) entrapping the histone octamers in preferential positions. There is no periodicity in the distribution of the nucleosomes along the DNA chain. The folding pathway of the nucleosomal chromatin chain is however still flexible and determined by e.g., the length of the DNA chain between the nucleosomes. The fixation and stabilization of the chromatin chain in the space of the nucleus (i.e., its "functional state") may be mediated by additionally unique DNA protein interactions that are dictated by the "chromatin folding code". The unique DNA-protein interactions around the centromeres of human chromosomes are revealed for example by their "C-banding". I wish to stress that it is not my aim to relate each block of repetitive DNA sequences to a specific "chromatin folding code", but I shall demonstrate that there is an inherent potential for tandem repeated sequence units to develop a locus-specific repetitive higher order structure; this potential may create a specific chromatin folding code whenever a selection force exists at the position of this repetitive DNA structure in the genome.
Collapse
Affiliation(s)
- P Vogt
- Institut für Humangenetik und Anthropologie der Universität, Heidelberg, Federal Republic of Germany
| |
Collapse
|
9
|
Bodnar JW, Hanson PI, Polvino-Bodnar M, Zempsky W, Ward DC. The terminal regions of adenovirus and minute virus of mice DNAs are preferentially associated with the nuclear matrix in infected cells. J Virol 1989; 63:4344-53. [PMID: 2778878 PMCID: PMC251051 DOI: 10.1128/jvi.63.10.4344-4353.1989] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
The interaction of viral genomes with the cellular nuclear matrix was studied by using adenovirus-infected HeLa cells and minute virus of mice (MVM)-infected A-9 cells. Adenovirus DNA was associated with the nuclear matrix both early and late in infection, the tightest interaction being with DNA fragments that contain the covalently bound 5'-terminal protein. Replicative forms of MVM DNA were also found to be exclusively matrix associated during the first 16 to 20 h of infection; at later times viral DNA species accumulated in the soluble nuclear fraction at different rates, suggesting a saturation of nuclear matrix-binding sites. MVM DNA fragments enriched in the matrix fraction were also derived from the terminal regions of the viral genome. However, only the subset of fragments which possess a covalently bound 5'-terminal protein (i.e., DNA fragments in which the 5' palindromic DNA sequences are in the extended duplex rather than the hairpin conformation) were matrix associated. These observations suggest that the DNA-matrix interactions are, at least in part, mediated by the viral terminal proteins. Since these proteins have previously been shown to be intimately involved in viral DNA replication, our results further indicate that an association with the nuclear matrix may be important for viral genome replication and possibly also for efficient gene transcription.
Collapse
Affiliation(s)
- J W Bodnar
- Department of Biology, Northeastern University, Boston, Massachusetts 02115
| | | | | | | | | |
Collapse
|
10
|
Bodnar JW, Jones GS, Ellis CH. The domain model for eukaryotic DNA organization. 2: A molecular basis for constraints on development and evolution. J Theor Biol 1989; 137:281-320. [PMID: 2601348 DOI: 10.1016/s0022-5193(89)80074-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
A model for eukaryotic DNA organization has been proposed in which DNA regulatory processes depend on multiple site-specific DNA-nuclear matrix interactions throughout a DNA domain. In this model gene regulation depends on combinations of a few control factors in a cell to activate cell type-specific genes. This model suggests simple molecular mechanisms for organismal development which can account for sequential activation of appropriate groups of genes throughout development and for specific constraints on developmental pathways. Additionally, these suggested developmental pathways are consistent with mechanisms of evolution in which gradualism and punctuated equilibrium are not exclusive of one another and are interrelated mechanisms of evolution that are both induced by specific chromosomal mutations.
Collapse
Affiliation(s)
- J W Bodnar
- Department of Biology, Northeastern University, Boston, Massachusetts 02115
| | | | | |
Collapse
|
11
|
Abstract
Analysis of the nucleotide sequence of minute virus of mice (MVM) DNA indicates that the DNA termini contain clusters of potential DNA regulatory elements and that there are repetitive DNA elements highly reiterated throughout the entire genome, which may also have a role in DNA function. The left end of MVM DNA, which contains the promoter for the nonstructural genes, has a cluster of DNA elements that includes homologies to the polyoma virus enhancer, three copies of an E1A-inducible transcription factor (ATF) binding site, and a potential Z-DNA element. The MVM right end, which contains the origin of DNA replication, has a cluster of DNA elements that includes several homologies to the polyoma virus replication origin and a potential Z-DNA element. In addition, oligonucleotide frequency analysis indicates the presence of highly recurring sequence elements throughout the entire MVM genome that may be involved in regulation. This computer-aided analysis suggests similarities and significant differences in regulatory sequence organization between MVM and polyoma virus, and identifies specific DNA elements for future genetic characterization.
Collapse
Affiliation(s)
- J W Bodnar
- Northeastern University, Department of Biology, Boston, MA 02115
| |
Collapse
|
12
|
Bodnar JW. A domain model for eukaryotic DNA organization: a molecular basis for cell differentiation and chromosome evolution. J Theor Biol 1988; 132:479-507. [PMID: 3226138 DOI: 10.1016/s0022-5193(88)80086-9] [Citation(s) in RCA: 75] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
A model for eukaryotic chromatin organization is presented in which the basic structural and functional unit is the DNA domain. This simple model predicts that both chromosome replication and cell type-specific control of gene expression depend on a combination of stable and dynamic DNA-nuclear matrix interactions. The model suggests that in eukaryotes, DNA regulatory processes are controlled mainly by the intranuclear compartmentalization of the specific DNA sequences, and that control of gene expression involves multiple steps of specific DNA-nuclear matrix interactions. Predictions of the model are tested using available biochemical, molecular and cell biological data. In addition, the domain model is discussed as a simple molecular mechanism to explain cell differentiation in multi-cellular organisms and to explain the evolution of eukaryotic genomes consisting mainly of repetitive sequences and "junk" DNA.
Collapse
Affiliation(s)
- J W Bodnar
- Department of Biology, Northeastern University, Boston MA 02115
| |
Collapse
|