1
|
Hyrien O. Peaks cloaked in the mist: the landscape of mammalian replication origins. J Cell Biol 2015; 208:147-60. [PMID: 25601401 PMCID: PMC4298691 DOI: 10.1083/jcb.201407004] [Citation(s) in RCA: 68] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2014] [Accepted: 12/16/2014] [Indexed: 12/23/2022] Open
Abstract
Replication of mammalian genomes starts at sites termed replication origins, which historically have been difficult to locate as a result of large genome sizes, limited power of genetic identification schemes, and rareness and fragility of initiation intermediates. However, origins are now mapped by the thousands using microarrays and sequencing techniques. Independent studies show modest concordance, suggesting that mammalian origins can form at any DNA sequence but are suppressed by read-through transcription or that they can overlap the 5' end or even the entire gene. These results require a critical reevaluation of whether origins form at specific DNA elements and/or epigenetic signals or require no such determinants.
Collapse
Affiliation(s)
- Olivier Hyrien
- Institut de Biologie de l'Ecole Normale Supérieure, Centre National de la Recherche Scientifique UMR8197 and Institut National de la Santé et de la Recherche Médicale U1024, 75005 Paris, France
| |
Collapse
|
2
|
Mesner LD, Valsakumar V, Cieślik M, Pickin R, Hamlin JL, Bekiranov S. Bubble-seq analysis of the human genome reveals distinct chromatin-mediated mechanisms for regulating early- and late-firing origins. Genome Res 2013; 23:1774-88. [PMID: 23861383 PMCID: PMC3814878 DOI: 10.1101/gr.155218.113] [Citation(s) in RCA: 64] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2013] [Accepted: 07/08/2013] [Indexed: 11/24/2022]
Abstract
We have devised a method for isolating virtually pure and comprehensive libraries of restriction fragments that contained replication initiation sites (bubbles) in vivo. We have now sequenced and mapped the bubble-containing fragments from GM06990, a near-normal EBV-transformed lymphoblastoid cell line, and have compared origin distributions with a comprehensive replication timing study recently published for this cell line. We find that early-firing origins, which represent ∼32% of all origins, overwhelmingly represent zones, associate only marginally with active transcription units, are localized within large domains of open chromatin, and are significantly associated with DNase I hypersensitivity. Origin "density" falls from early- to mid-S-phase, but rises again in late S-phase to levels only 17% lower than in early S-phase. Unexpectedly, late origin density calculated on the 1-Mb scale increases as a function of increasing chromatin compaction. Furthermore, the median efficiency of origins in late-replicating, heterochromatic domains is only 25% lower than in early-replicating euchromatic loci. Thus, the activation of early- and late-firing origins must be regulated by quintessentially different mechanisms. The aggregate data can be unified into a model in which initiation site selection is driven almost entirely by epigenetic factors that fashion both the long-range and local chromatin environments, with underlying DNA sequence and local transcriptional activity playing only minor roles. Importantly, the comprehensive origin map we have prepared for GM06990 overlaps moderately well with origin maps recently reported for the genomes of four different human cell lines based on the distributions of small nascent strands.
Collapse
Affiliation(s)
| | | | - Marcin Cieślik
- Department of Biochemistry & Molecular Genetics, University of Virginia School of Medicine, Charlottesville, Virginia 22908, USA
| | - Rebecca Pickin
- Department of Biochemistry & Molecular Genetics, University of Virginia School of Medicine, Charlottesville, Virginia 22908, USA
| | - Joyce L. Hamlin
- Department of Biochemistry & Molecular Genetics, University of Virginia School of Medicine, Charlottesville, Virginia 22908, USA
| | - Stefan Bekiranov
- Department of Biochemistry & Molecular Genetics, University of Virginia School of Medicine, Charlottesville, Virginia 22908, USA
| |
Collapse
|
3
|
Trapping DNA replication origins from the human genome. Genes (Basel) 2013; 4:198-225. [PMID: 24705160 PMCID: PMC3899975 DOI: 10.3390/genes4020198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2013] [Revised: 04/05/2013] [Accepted: 04/09/2013] [Indexed: 11/17/2022] Open
Abstract
Synthesis of chromosomal DNA is initiated from multiple origins of replication in higher eukaryotes; however, little is known about these origins’ structures. We isolated the origin-derived nascent DNAs from a human repair-deficient cell line by blocking the replication forks near the origins using two different origin-trapping methods (i.e., UV- or chemical crosslinker-treatment and cell synchronization in early S phase using DNA replication inhibitors). Single-stranded DNAs (of 0.5–3 kb) that accumulated after such treatments were labeled with bromodeoxyuridine (BrdU). BrdU-labeled DNA was immunopurified after fractionation by alkaline sucrose density gradient centrifugation and cloned by complementary-strand synthesis and PCR amplification. Competitive PCR revealed an increased abundance of DNA derived from known replication origins (c-myc and lamin B2 genes) in the nascent DNA fractions from the UV-treated or crosslinked cells. Nucleotide sequences of 85 and 208 kb were obtained from the two libraries (I and II) prepared from the UV-treated log-phase cells and early S phase arrested cells, respectively. The libraries differed from each other in their G+C composition and replication-related motif contents, suggesting that differences existed between the origin fragments isolated by the two different origin-trapping methods. The replication activities for seven out of 12 putative origin loci from the early-S phase cells were shown by competitive PCR. We mapped 117 (library I) and 172 (library II) putative origin loci to the human genome; approximately 60% and 50% of these loci were assigned to the G-band and intragenic regions, respectively. Analyses of the flanking sequences of the mapped loci suggested that the putative origin loci tended to associate with genes (including conserved sites) and DNase I hypersensitive sites; however, poor correlations were found between such loci and the CpG islands, transcription start sites, and K27-acetylated histone H3 peaks.
Collapse
|
4
|
RNAP-II molecules participate in the anchoring of the ORC to rDNA replication origins. PLoS One 2013; 8:e53405. [PMID: 23308214 PMCID: PMC3537633 DOI: 10.1371/journal.pone.0053405] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2012] [Accepted: 11/29/2012] [Indexed: 12/29/2022] Open
Abstract
The replication of genomic DNA is limited to a single round per cell cycle. The first component, which recognises and remains bound to origins from recognition until activation and replication elongation, is the origin recognition complex. How origin recognition complex (ORC) proteins remain associated with chromatin throughout the cell cycle is not yet completely understood. Several genome-wide studies have undoubtedly demonstrated that RNA polymerase II (RNAP-II) binding sites overlap with replication origins and with the binding sites of the replication components. RNAP-II is no longer merely associated with transcription elongation. Several reports have demonstrated that RNAP-II molecules affect chromatin structure, transcription, mRNA processing, recombination and DNA repair, among others. Most of these activities have been reported to directly depend on the interaction of proteins with the C-terminal domain (CTD) of RNAP-II. Two-dimensional gels results and ChIP analysis presented herein suggest that stalled RNAP-II molecules bound to the rDNA chromatin participate in the anchoring of ORC proteins to origins during the G1 and S-phases. The results show that in the absence of RNAP-II, Orc1p, Orc2p and Cdc6p do not bind to origins. Moreover, co-immunoprecipitation experiments suggest that Ser2P-CTD and hypophosphorylated RNAP-II interact with Orc1p. In the context of rDNA, cryptic transcription by RNAP-II did not negatively interfere with DNA replication. However, the results indicate that RNAP-II is not necessary to maintain the binding of ORCs to the origins during metaphase. These findings highlight for the first time the potential importance of stalled RNAP-II in the regulation of DNA replication.
Collapse
|
5
|
Unraveling cell type-specific and reprogrammable human replication origin signatures associated with G-quadruplex consensus motifs. Nat Struct Mol Biol 2012; 19:837-44. [PMID: 22751019 DOI: 10.1038/nsmb.2339] [Citation(s) in RCA: 295] [Impact Index Per Article: 24.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2012] [Accepted: 06/01/2012] [Indexed: 11/08/2022]
Abstract
DNA replication is highly regulated, ensuring faithful inheritance of genetic information through each cell cycle. In metazoans, this process is initiated at many thousands of DNA replication origins whose cell type-specific distribution and usage are poorly understood. We exhaustively mapped the genome-wide location of replication origins in human cells using deep sequencing of short nascent strands and identified ten times more origin positions than we expected; most of these positions were conserved in four different human cell lines. Furthermore, we identified a consensus G-quadruplex-forming DNA motif that can predict the position of DNA replication origins in human cells, accounting for their distribution, usage efficiency and timing. Finally, we discovered a cell type-specific reprogrammable signature of cell identity that was revealed by specific efficiencies of conserved origin positions and not by the selection of cell type-specific subsets of origins.
Collapse
|
6
|
Lubelsky Y, Sasaki T, Kuipers MA, Lucas I, Le Beau MM, Carignon S, Debatisse M, Prinz JA, Dennis JH, Gilbert DM. Pre-replication complex proteins assemble at regions of low nucleosome occupancy within the Chinese hamster dihydrofolate reductase initiation zone. Nucleic Acids Res 2010; 39:3141-55. [PMID: 21148149 PMCID: PMC3082903 DOI: 10.1093/nar/gkq1276] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Genome-scale mapping of pre-replication complex proteins has not been reported in mammalian cells. Poor enrichment of these proteins at specific sites may be due to dispersed binding, poor epitope availability or cell cycle stage-specific binding. Here, we have mapped sites of biotin-tagged ORC and MCM protein binding in G1-synchronized populations of Chinese hamster cells harboring amplified copies of the dihydrofolate reductase (DHFR) locus, using avidin-affinity purification of biotinylated chromatin followed by high-density microarray analysis across the DHFR locus. We have identified several sites of significant enrichment for both complexes distributed throughout the previously identified initiation zone. Analysis of the frequency of initiations across stretched DNA fibers from the DHFR locus confirmed a broad zone of de-localized initiation activity surrounding the sites of ORC and MCM enrichment. Mapping positions of mononucleosomal DNA empirically and computing nucleosome-positioning information in silico revealed that ORC and MCM map to regions of low measured and predicted nucleosome occupancy. Our results demonstrate that specific sites of ORC and MCM enrichment can be detected within a mammalian intitiation zone, and suggest that initiation zones may be regions of generally low nucleosome occupancy where flexible nucleosome positioning permits flexible pre-RC assembly sites.
Collapse
Affiliation(s)
- Yoav Lubelsky
- Department of Biological Science, Florida State University, Tallahassee, FL 32306, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
7
|
Abstract
Mechanisms regulating where and when eukaryotic DNA replication initiates remain a mystery. Recently, genome-scale methods have been brought to bear on this problem. The identification of replication origins and their associated proteins in yeasts is a well-integrated investigative tool, but corresponding data sets from multicellular organisms are scarce. By contrast, standardized protocols for evaluating replication timing have generated informative data sets for most eukaryotic systems. Here, I summarize the genome-scale methods that are most frequently used to analyse replication in eukaryotes, the kinds of questions each method can address and the technical hurdles that must be overcome to gain a complete understanding of the nature of eukaryotic replication origins.
Collapse
|
8
|
Abstract
Studies in our laboratory over the last three decades have shown that the Chinese hamster dihydrofolate reductase (DHFR) origin of replication corresponds to a broad zone of inefficient initiation sites distributed throughout the spacer between the convergently transcribed DHFR and 2BE2121 genes. It is clear from mutational analysis that none of these sites is genetically required for controlling origin activity. However, the integrity of the promoter of the DHFR gene is needed to activate the downstream origin, while the 3' processing signals prevent invasion and inactivation of the downstream origin by transcription forks. Several other origins in metazoans have been shown to correspond to zones of inefficient sites, while a different subset appears to be similar to the fixed replicators that characterize origins in S. cerevisiae and lower organisms. These observations have led us to suggest a model in which the mammalian genome is dotted with a hierarchy of degenerate, redundant, and inefficient replicators at intervals of a kilobase or less, some of which may have evolved to be highly circumscribed and efficient. The activities of initiation sites are proposed to be largely regulated by local transcription and chromatin architecture. Recently, we and others have devised strategies for identifying active origins on a genome-wide scale in order to define their distributions between fixed and dispersive origin types and to detect relationships among origins, genes, and epigenetic markers. The global pictures emerging are suggestive but far from complete and appear to be plagued by some of the same uncertainties that have led to conflicting views of individual origins in the past (particularly DHFR). In this paper, we will trace the history of origin discovery in mammalian genomes, primarily using the well-studied DHFR origin as a model, because it has been analyzed by nearly every available origin mapping technique in several different laboratories, while many origins have been identified by only one. We will address the strengths and shortcomings of the various methods utilized to identify and characterize origins in complex genomes and will point out how we and others were sometimes led astray by false assumptions and biases, as well as insufficient information. The goal is to help guide future experiments that will provide a truly comprehensive and accurate portrait of origins and their regulation. After all, in the words of George Santayana, "Those who do not learn from history are doomed to repeat it."
Collapse
|
9
|
Transcription initiation activity sets replication origin efficiency in mammalian cells. PLoS Genet 2009; 5:e1000446. [PMID: 19360092 PMCID: PMC2661365 DOI: 10.1371/journal.pgen.1000446] [Citation(s) in RCA: 179] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2008] [Accepted: 03/04/2009] [Indexed: 12/24/2022] Open
Abstract
Genomic mapping of DNA replication origins (ORIs) in mammals provides a powerful means for understanding the regulatory complexity of our genome. Here we combine a genome-wide approach to identify preferential sites of DNA replication initiation at 0.4% of the mouse genome with detailed molecular analysis at distinct classes of ORIs according to their location relative to the genes. Our study reveals that 85% of the replication initiation sites in mouse embryonic stem (ES) cells are associated with transcriptional units. Nearly half of the identified ORIs map at promoter regions and, interestingly, ORI density strongly correlates with promoter density, reflecting the coordinated organisation of replication and transcription in the mouse genome. Detailed analysis of ORI activity showed that CpG island promoter-ORIs are the most efficient ORIs in ES cells and both ORI specification and firing efficiency are maintained across cell types. Remarkably, the distribution of replication initiation sites at promoter-ORIs exactly parallels that of transcription start sites (TSS), suggesting a co-evolution of the regulatory regions driving replication and transcription. Moreover, we found that promoter-ORIs are significantly enriched in CAGE tags derived from early embryos relative to all promoters. This association implies that transcription initiation early in development sets the probability of ORI activation, unveiling a new hallmark in ORI efficiency regulation in mammalian cells. The duplication of the genetic information of a cell starts from specific sites on the chromosomes called DNA replication origins. Their number varies from a few hundred in yeast cells to several thousands in human cells, distributed along the genome at comparable distances in both systems. An important question in the field is to understand how origins of replication are specified and regulated in the mammalian genome, as neither their location nor their activity can be directly inferred from the DNA sequence. Previous studies at individual origins and, more recently, at large scale across 1% of the human genome, have revealed that most origins overlap with transcriptional regulatory elements, and specifically with gene promoters. To gain insight into the nature of the relationship between active transcription and origin specification we have combined a genomic mapping of origins at 0.4% of the mouse genome with detailed studies of activation efficiency. The data identify two types of origins with distinct regulatory properties: highly efficient origins map at CpG island-promoters and low efficient origins locate elsewhere in association with transcriptional units. We also find a remarkable parallel organisation of the replication initiation sites and transcription start sites at efficient promoter-origins that suggests a prominent role of transcription initiation in setting the efficiency of replication origin activation.
Collapse
|
10
|
Hamlin JL, Mesner LD, Lar O, Torres R, Chodaparambil SV, Wang L. A revisionist replicon model for higher eukaryotic genomes. J Cell Biochem 2008; 105:321-9. [PMID: 18680119 PMCID: PMC2574905 DOI: 10.1002/jcb.21828] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The replicon model devised to explain replication control in bacteria has served as the guiding paradigm in the search for origins of replication in the more complex genomes of eukaryotes. In Saccharomyces cerevisiae, this model has proved to be extremely useful, leading to the identification of specific genetic elements (replicators) and the interacting initiator proteins that activate them. However, replication control in organisms ranging from Schizosaccharomyces pombe to mammals is far more fluid: only a small number of origins seem to represent classic replicators, while the majority correspond to zones of inefficient, closely spaced start sites none of which are indispensable for origin activity. In addition, it is apparent that the epigenetic state of a given sequence largely determines its ability to be used as a replication initiation site. These conclusions were arrived at over a period of three decades, and required the development of several novel replicon mapping techniques, as well as new ways of examining the chromatin architecture of any sequence of interest. Recently, methods have been elaborated for isolating all of the active origins in the genomes of higher eukaryotes en masse. Microarray analyses and more recent high-throughput sequencing technology will allow all the origins to be mapped onto the chromosomes of any organism whose genome has been sequenced. With the advent of whole-genome studies on gene expression and chromatin composition, the field is now positioned to define both the genetic and epigenetic rules that govern origin activity.
Collapse
Affiliation(s)
- J L Hamlin
- Department of Biochemistry and Molecular Genetics, University of Virginia School of Medicine, Charlottesville, VA 22908-0733, USA.
| | | | | | | | | | | |
Collapse
|
11
|
Hashizume T, Shimizu N. Dissection of mammalian replicators by a novel plasmid stability assay. J Cell Biochem 2007; 101:552-65. [PMID: 17226771 DOI: 10.1002/jcb.21210] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
A plasmid, bearing a mammalian replication initiation region (IR) and a matrix attachment region (MAR) was previously shown to be efficiently amplified to high copy number in mammalian cells and to generate chromosomal homogeneously staining regions (HSRs). The amplification mechanism was suggested to entail a head-on collision at the MAR between the transcription machinery and the hypothetical replication fork arriving from the IR, leading to double strand breakage (DSB) that triggered HSR formation. The experiments described here show that such plasmids are stabilized if collisions involving not only promoter-driven transcription but also promoter-independent transcription are avoided, and stable plasmids appeared to persist as submicroscopic episomes. These findings suggest that the IR sequence that promotes HSR generation may correspond to the sequence that supports replication initiation (replicator). Thus, we developed a "plasmid stability assay" that sensitively detects the activity of HSR generation in a test sequence. The assay was used to dissect two replicator regions, derived from the c-myc and DHFR ori-beta loci. Consequently, minimum sequences that efficiently promoted HSR generation were identified. They included several sequence elements, most of which coincided with reported replicator elements. These data and this assay will benefit studies of replication initiation and applications that depend on plasmid amplification.
Collapse
Affiliation(s)
- Toshihiko Hashizume
- Graduate School of Biosphere Science, Hiroshima University, Higashi-Hiroshima, Hiroshima 739-8521, Japan
| | | |
Collapse
|
12
|
Wang L, Lin CM, Lopreiato JO, Aladjem MI. Cooperative sequence modules determine replication initiation sites at the human beta-globin locus. Hum Mol Genet 2006; 15:2613-22. [PMID: 16877501 DOI: 10.1093/hmg/ddl187] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The human beta globin locus contains two adjacent replicators, each capable of initiating DNA replication when transferred from its native locus to ectopic sites. Here, we report a detailed analysis of the sequence requirements for replication initiation from these replicators. In both replicators, initiation required a combination of an asymmetric purine:pyrimidine sequence and several AT-rich stretches. Modules from the two replicators could combine to initiate replication. AT-rich sequences were essential for replicator activity: a low frequency of initiation was observed in DNA fragments that included a short stretch of AT-rich sequences, whereas inclusion of additional AT-rich stretches increased initiation efficiency. By contrast, replication initiated at a low level without the asymmetric purine:pyrimidine modules but they were required in synergy to achieve efficient initiation. These data support a combinatorial model for replicator activity and suggest that the initiation of DNA replication requires interaction between at least two distinct sequence modules.
Collapse
Affiliation(s)
- Lixin Wang
- Laboratory of Molecular Pharmacology, National Cancer Institute, Bethesda, MD 20892, USA
| | | | | | | |
Collapse
|
13
|
Sasaki T, Ramanathan S, Okuno Y, Kumagai C, Shaikh SS, Gilbert DM. The Chinese hamster dihydrofolate reductase replication origin decision point follows activation of transcription and suppresses initiation of replication within transcription units. Mol Cell Biol 2006; 26:1051-62. [PMID: 16428457 PMCID: PMC1347040 DOI: 10.1128/mcb.26.3.1051-1062.2006] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Chinese hamster ovary (CHO) cells select specific replication origin sites within the dihydrofolate reductase (DHFR) locus at a discrete point during G1 phase, the origin decision point (ODP). Origin selection is sensitive to transcription but not protein synthesis inhibitors, implicating a pretranslational role for transcription in origin specification. We have constructed a DNA array covering 121 kb surrounding the DHFR locus, to comprehensively investigate replication initiation and transcription in this region. When nuclei isolated within the first 3 h of G1 phase were stimulated to initiate replication in Xenopus egg extracts, replication initiated without any detectable preference for specific sites. At the ODP, initiation became suppressed from within the Msh3, DHFR, and 2BE2121 transcription units. Active transcription was mostly confined to these transcription units, and inhibition of transcription by alpha-amanitin resulted in the initiation of replication within transcription units, indicating that transcription is necessary to limit initiation events to the intergenic region. However, the resumption of DHFR transcription after mitosis took place prior to the ODP and so is not on its own sufficient to suppress initiation of replication. Together, these results demonstrate a remarkable flexibility in sequence selection for initiating replication and implicate transcription as one important component of origin specification at the ODP.
Collapse
Affiliation(s)
- Takayo Sasaki
- Department of Biochemistry and Molecular Biology, SUNY Upstate Medical University, 750 East Adams St., Syracuse, NY 13210, USA
| | | | | | | | | | | |
Collapse
|