![]() | ![]() |
Formats:
|
||||||||||||||||||
Copyright Sequeira-Mendes et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Transcription Initiation Activity Sets Replication Origin Efficiency in Mammalian Cells 1Instituto de Microbiología Bioquímica, CSIC/Universidad de Salamanca, Edificio Departamental, Salamanca, Spain 2PhD Programme in Experimental Biology and Biomedicine, Centre for Neuroscience and Cell Biology, University of Coimbra, Coimbra, Portugal 3Centro Nacional de Investigaciones Oncológicas, Madrid, Spain 4Clinical Sciences Centre, Medical Research Council, Hammersmith Hospital, London, United Kingdom 5Centre for Bioinformatics, Faculty of Natural Sciences, Imperial College London, London, United Kingdom Wendy A. Bickmore, Editor Medical Research Council Human Genetics Unit, United Kingdom * E-mail: mgvf/at/usal.es ¤Current address: Queensland Institute for Medical Research, Herston, Australia Conceived and designed the experiments: JSM MG. Performed the experiments: JSM. Analyzed the data: JSM RDU MG. Contributed reagents/materials/analysis tools: AA DH NB. Wrote the paper: MG. Received November 19, 2008; Accepted March 4, 2009. Abstract Genomic mapping of DNA replication origins (ORIs) in mammals provides a powerful means for understanding the regulatory complexity of our genome. Here we combine a genome-wide approach to identify preferential sites of DNA replication initiation at 0.4% of the mouse genome with detailed molecular analysis at distinct classes of ORIs according to their location relative to the genes. Our study reveals that 85% of the replication initiation sites in mouse embryonic stem (ES) cells are associated with transcriptional units. Nearly half of the identified ORIs map at promoter regions and, interestingly, ORI density strongly correlates with promoter density, reflecting the coordinated organisation of replication and transcription in the mouse genome. Detailed analysis of ORI activity showed that CpG island promoter-ORIs are the most efficient ORIs in ES cells and both ORI specification and firing efficiency are maintained across cell types. Remarkably, the distribution of replication initiation sites at promoter-ORIs exactly parallels that of transcription start sites (TSS), suggesting a co-evolution of the regulatory regions driving replication and transcription. Moreover, we found that promoter-ORIs are significantly enriched in CAGE tags derived from early embryos relative to all promoters. This association implies that transcription initiation early in development sets the probability of ORI activation, unveiling a new hallmark in ORI efficiency regulation in mammalian cells. Author Summary The duplication of the genetic information of a cell starts from specific sites on the chromosomes called DNA replication origins. Their number varies from a few hundred in yeast cells to several thousands in human cells, distributed along the genome at comparable distances in both systems. An important question in the field is to understand how origins of replication are specified and regulated in the mammalian genome, as neither their location nor their activity can be directly inferred from the DNA sequence. Previous studies at individual origins and, more recently, at large scale across 1% of the human genome, have revealed that most origins overlap with transcriptional regulatory elements, and specifically with gene promoters. To gain insight into the nature of the relationship between active transcription and origin specification we have combined a genomic mapping of origins at 0.4% of the mouse genome with detailed studies of activation efficiency. The data identify two types of origins with distinct regulatory properties: highly efficient origins map at CpG island-promoters and low efficient origins locate elsewhere in association with transcriptional units. We also find a remarkable parallel organisation of the replication initiation sites and transcription start sites at efficient promoter-origins that suggests a prominent role of transcription initiation in setting the efficiency of replication origin activation. Introduction DNA replication initiation is thought to be the most highly regulated process in genome duplication as cells must ensure that replication origins (ORIs) fire precisely once before cell division. A large number of studies during the last twenty years have provided a good understanding of the molecular mechanisms that regulate the initiation of DNA synthesis to occur at specific chromosomal sites and during a specific window in the cell cycle to avoid undesired re- or under-replication of any part of the eukaryotic genome [1]–[3]. Less understood is how ORI specification is achieved, particularly in metazoa where ORIs are not defined by DNA sequence and the origin recognition complex (ORC) does not show sequence specificity in vitro [4],[5]. However, metazoan ORIs are strongly linked to other genomic functions, most notably with transcription. Transcription itself can modulate ORI activity [6]–[8], transcription factors can interact with ORC [9]–[12] and the binding of transcription factors to a plasmid can localise replication initiation to that specific site [13]. In addition, recent high-throughput studies in various experimental systems have confirmed the long observed link between early replication timing and active transcription [14]–[18]. Despite these findings, the steps in the initiation process that are influenced by transcription are poorly understood. It is possible that changes in transcriptional status could modulate the initial selection of potential ORIs either during the G1 phase of the cell cycle (pre-RC formation) or during the activation of pre-RC in S-phase. Identification and characterisation of metazoan ORIs has been hindered by the complexity of these genomes and the lack of robust assays to comprehensively monitor DNA replication initiation. A recent genome-wide ORI mapping in HeLa cells over the regions covered by the ENCODE project has revealed that most initiation sites overlap with transcriptional regulatory elements, although there is not a direct link with gene regulation [19]. To further investigate the nature of the relationship between active transcription and ORI specification we have carried out an unbiased study of ORI location and efficiency in undifferentiated mouse embryonic stem (ES) cells. The chromatin environment of ES cells appears to be extremely permissive for gene transcription [20]. This status is maintained by hyperdynamic chromatin [21], bivalent chromatin marks [22] and Polycomb group proteins that suppress transcription at specific sites [23],[24], making the ES cell genome an excellent scenario to address the role of transcription in ORI selection and regulation. Here, we performed a high-resolution mapping of ORIs along 10.1 Mb of the mouse genome (~0.4%) encompassing a range of genomic features characteristic of gene-rich and gene-poor regions. Replication initiation sites were identified by hybridisation of short nascent strands on tiled genomic arrays and using a stringent algorithm that takes into account the size distribution of replication intermediates relative to the initiation point. In agreement with results from human cells, we found that in mouse ES cells most of the ORIs associate with annotated transcriptional units and nearly half of them locate at promoter regions. Moreover, we found that CpG island promoter-ORIs are the most efficient ORIs in the mouse genome and that ORI specification and firing efficiency is generally maintained across cell types. The organisation of replication initiation sites at promoter-ORIs mirrors the distribution of transcription start sites (TSS) suggesting a co-evolution of the regulatory regions of replication and transcription in the genome. Interestingly, promoter-ORIs are significantly enriched in CAGE tags derived from early embryos relative to all promoters. Our findings suggest that transcription initiation early in development sets the probability of ORI firing. Results Most ORIs in the Mouse Genome Associate to Transcriptional Units In asynchronously growing undifferentiated mouse ES cells a large proportion of the population is in the S-phase of the cell cycle. This specific property allowed us to obtain a large enough yield in purified replication intermediates to directly hybridise genomic arrays without previous amplification (see Materials and Methods). Two biological replicates of λ-exonuclease treated short nascent strands (300–800 nt in length) were co-hybridised with genomic DNA from the same cells to tiled genomic arrays covering 10.1 Mb of the mouse genome (Agilent Technologies). Arrays were analysed by a modification of ACME (Algorithm for Capturing Microarray Enrichment) [25]. ACME identifies signals in tiled array data using a sliding window centered in each probe and returning a p-value that assesses the enrichment by comparing observed and expected number of probes above a user-specified threshold (see Materials and Methods for further details). Preparations of short nascent strands purified from asynchronously growing cells are preferentially enriched in regions close to ORIs and less enriched in their immediately adjacent sequences, showing a pine-tree distribution peaking at the ORI that allows their fine mapping by quantitative real-time PCR methods (Q-PCR) [26]–[28]. Based on this property of the nascent DNA hybridised on the arrays we filtered the results from ACME to reliably identify replication initiation sites. Windows from ACME's analysis with a p-value<0.005 were further required to have a minimum of two probes per window, an average log2 ratio within the window larger than the 75th percentile of the data, and the defining probe of a window above the threshold and with a p-value<0.005 (see Materials and Methods). Replicate experiments showed a high degree of correlation and were averaged (R2 values of 0.954). Applying this stringent algorithm we identified 97 ORIs that mostly map associated to annotated transcriptional units (85%) and, specifically, at promoter regions (44%, from which 88% correspond to CpG island-promoters) (Table S1 and Figure 1A
Replication initiation at CpG islands in mammalian cells is well documented [29],[30] and our method identifies the ORIs associated with the CpG islands of the Hprt1 and Mecp2 genes precisely at the previously described sites, validating the quality of our ORI maps (ORIs 45236 and 67276, Table S2) [31],[32]. Our criterion detects ORI activity at 32% of all known promoters covered by the array (50% of the annotated CpG islands and 8% of the annotated non-CpG island promoters, Table S1). This result highlights at genomic scale the link between the regions that trigger replication and transcription initiation that has been previously suggested in studies at specific loci [26], [27], [29], [33]–[35]. Our results increase by more than one order of magnitude the number of characterised ORIs in the mouse genome. In addition, the small length of the nascent strands hybridised on the arrays and the window size chosen for the analysis allowed us to accurately define replication initiation sites within an 800 bp region (Table S2 and Figures 2
The identified ORIs were distributed at an average interorigin distance of 103 kb, however, half of them map within 60 kb distance suggesting a degree of ORI clustering (Figure 1B CpG island-ORIs Are the Most Efficient ORIs in Embryonic Stem Cells To validate our algorithm for ORI identification we selected 18 positive and 3 negative regions and analysed their abundance in independent preparations of purified 300–800 nt nascent strands by Q-PCR. Since Q-PCR defines ORIs as regions preferentially amplified in relation to their flanking sequences, we interrogated each region with 4 to 6 primer pairs spanning 2 kb across the probes defining the ORI and normalised the values to the flanking pair detecting the lowest amount of nascent strands in each case. The regions studied were representative of the observed ORI location relative to the genes. The average log2 ratios of the array duplicates for each region are shown in the top panels of the figure below the corresponding genomic maps (Figure 2 Q-PCR results suggested that CpG island promoter-ORIs were generally more efficient than non promoter-ORIs. As the arrays were hybridised with non-amplified short nascent strands, the output log2 ratios should give semi-quantitative information about ORI efficiency. Consistently, the hybridisation signals obtained at the CpG island-ORI class (mean values of 3.899) were significantly higher (p = 0.00005, Welch Two Sample T-test) than those at the non promoter-ORI class (mean values of 3.008). To be able to compare ORI efficiencies directly, we performed Q-PCR on three consecutive sucrose gradient fractions containing nascent strands of 100–600, 300–800 and 600–2000 nt in length, respectively, and normalised the abundance relative to that obtained at the negative regions in each gradient fraction (Figures 3Preparations of 100–600 nt nascent strands likely contained Okazaki fragments that co-purified with these small replication intermediates. Given that asynchronously growing ES cells were used, Okazaki fragments were expected to derive from the entire genome and to diminish the overall level of enrichment without a bias for any particular loci. Increasing the background signal, however, could critically affect the detection of weak ORIs, as seen at most non promoter-ORIs in the 100–600 nt nascent strand fraction (Figure 4B It should be noted that our experimental approach for ORI identification is not suited to detect ORIs dispersed across large regions, such as the ORI downstream of the DHFR gene in hamster CHO cells [36]. We could not, therefore, address either the abundance of broad initiation regions in the genome nor their firing efficiency. ORI Firing Efficiency Is Maintained across Cell Types To check whether this difference in ORI usage was conserved in other cell types we studied ORI firing efficiency by Q-PCR at 9 CpG island-ORIs and 10 non promoter-ORIs in preparations of 300–800 nt long nascent strands derived from mouse embryonic fibroblasts (MEFs) and NIH/3T3 transformed fibroblasts. We first analysed if DNA replication initiated at these sites in differentiated cell types by scanning a 2 kb region surrounding the ORI in experiments analogous to those shown in Figures 3
It is important to note that the higher efficiency in ORI activity found at CpG islands is not due to the overreplication occurring at promoter-ORIs that we recently reported [33]. Overreplicated intermediates are typically 100–200 bp long and their detection strictly relies on the use of cloned DNA to normalise primer pair efficiency. In this work we consistently normalised the data with genomic DNA that suppresses all possible contribution of overreplicated short fragments (either for array hybridisations or for Q-PCR measurements, see Materials and Methods). In addition, ORI firing efficiency at CpG islands was found to be consistently higher than at non-promoter ORIs along nascent strand preparations of increasing sizes, where the contribution of short overreplicated fragments is negligible (Figure 3B Highly Efficient ORIs Are Strictly Associated with TSS Closer examination of the log2 ratio profiles across several CpG island regions similar to those shown in Figure 3 Based on these observations, we asked whether two independent clusters of TSS for the same gene, but not located within the same CpG island region, were also associated with replication initiation sites. We analysed the Flna gene, which is transcribed from two alternative promoters, one located in a CpG island and another one 3.4 kb upstream that is not CpG island-associated. Our algorithm identified two separated peaks of nascent strands enrichment pointing exactly to the two tag clusters from where the transcription of the gene initiates (Figure 6C
The above results demonstrate a strong correlation between the initiation of replication and transcription at CpG island promoters and at clustered ORIs at promoter-rich regions. To address how general this association was and to test whether non-promoter ORIs might be good predictors of novel TSS regardless of their location in the genome, we extended the analysis of the histone signatures by ChIP to another 10 randomly selected non-promoter ORI regions. Figure 7A Transcription Initiation in the Embryo Specifies Replication Origin Efficiency The data presented in Figure 6 First, highly efficient ORIs would be expected to be preferentially associated to promoters driving ubiquitous expression. We considered the number of tags mapped at the TSS of each promoter-ORI as an indicator of relative promoter usage across several tissues and cell types [37] and analysed them relative to ORI occurrence. As expected, 78% of the ORIs locate at TSS where more than 75 clustered tags have been identified, representing the promoters of the most widely expressed genes at the studied regions (Figure 7C Second, many promoters in the genome should display replication initiation activity. To test this possibility we reanalysed our array data using a less stringent algorithm (see Materials and Methods). The strict algorithm detected ORI activity at 32% of annotated promoters (Table S1); applying the less stringent criterion we now detected ORI activity at 60% of the known promoters (83% of annotated CpG islands and 33% of the annotated non-CpG island promoters) (Table S1). Although in this case the association is slightly less prominent, ORIs also occur with higher frequency at the promoters with a higher number of mapped tags (Figure 7C Finally, if the spatial coincidence between replication and transcription initiation sites has a functional significance we would predict that promoter-ORIs would be transcriptionaly active in early development. To test this hypothesis we again surveyed the mouse CAGE database and analysed the tags derived from embryonic or germ line libraries mapped at each promoter-ORI relative to all known promoters [37]. CpG island associated genes, including those of tissue specific expression, are transcribed in the germ line [42]–[45], and we consistently found tags derived from early embryos and testis libraries at 90% of the CpG islands present in the array. When we performed similar analysis for CpG island-ORIs, the proportion increased to 100 and 95% (Figure 7B Discussion By combining a genome-wide approach to identify preferential sites of DNA replication initiation with in depth analysis at distinct classes of ORIs according to their genomic location, we were able to conclude that ORI firing efficiency is strongly associated to transcription initiation activity. The short size of nascent strands hybridised in the arrays and the stringent algorithm chosen to analyse the datasets allowed us to draw a highly accurate map of 97 new ORIs along 10.1 Mb of the mouse ES genome. A systematic analysis of the location of the identified ORIs revealed a strong correlation with annotated transcriptional units and specifically with the annotated 5′ ends of genes (Figure 1 Detailed measurements of nascent strand abundance at both classes of ORIs in preparations of replication intermediates of increasing sizes (Figures 3 Our results support the Jesuit model of ORI initiation proposed in the late 90's by Melvin DePamphilis (“many are called, but few are chosen”) [48],[49]. According to this model, the metazoan genome contains multiple potential sites of replication initiation whose activity is modulated during the G1 phase of each cell cycle by a combination of parameters such as nuclear organisation, chromatin structure, gene transcription or DNA sequence. This study identifies transcription initiation early in development as a strong determinant of ORI efficiency in mammalian cells. Transcription start sites of active genes are usually nucleosome-free indicating a more open chromatin conformation [38],[39] and presumably the parasitism of ORIs at TSS would increase the chances of firing through the facilitation of the assembly of the replication complexes to these sites. Indeed, a recent report showed that ORC binding to the Epstein-Barr virus origin of plasmid replication is stabilised by RNA [50], opening the possibility that nascent RNA molecules could contribute to ORC recruitment in mammalian cells. Interestingly, we found that ORI and promoter organisation are virtually identical (Figure 6 Our results point to a scenario where active promoters in germ cells and early embryonic cells will recruit pre-RCs and acquire the capability to drive replication. It is possible that the initiation of both replication and transcription at these promoter-ORIs will contribute to the configuration of a competent chromatin conformation that is a prerequisite for efficient replication initiation. This epigenetic state would then be transmitted and maintained in somatic cells. The above scenario can accommodate several observations made in various developmental systems. For example, in somatic cells, silent CpG islands on the inactive X chromosome function as ORIs as efficiently as their counterparts on the active X [32]. On the other hand, upon activation at specific developmental stages new ORIs are switched on while others are maintained [51],[52]. In addition, our work could provide experimental evidence in support of a hypothesis for the origin of CpG islands [53]. These authors proposed that CpG islands have acquired their distinct properties of C+G composition, CpG density and lack of DNA methylation due to their dual role as promoters and ORIs early in development. Since the number of CpG island associated genes is significantly smaller in mouse than in humans [54]–[56], presumably due to the different rates of CpG loss occurring during mammalian evolution [57],[58], we hypothesise that promoter-ORIs showing early embryonic expression that are not linked to CpG islands in the mouse genome would be CpG island associated in the human genome. To test this possibility we thoroughly searched for the presence of CpG islands at the human orthologous regions of the mouse promoter-ORIs identified in our work (human NCBI database build 36.3). We found that 50% of the non-CpG island associated promoter-ORIs expressed early in mouse development indeed harbour a CpG island in the human genome. However, the observed frequency of this association when considering all other promoters is only 10% (p-value = 0.007), making it tempting to speculate that the co-evolution of the regulatory regions driving replication and transcription initiation could have contributed to the shape of the mammalian genome.Materials and Methods Cell Culture The mouse embryonic stem cell line PGK12.1 was grown as described [59]. Mouse embryonic fibroblasts (MEFs) were derived from 12.5 dpc CD1 embryos and grown in F12 Nutrient Mixture (Ham) medium supplemented with 10% FCS, 1×105 U/ml penicillin, 100 mg/ml streptomycin, 2 mM L-glutamine, 1× non-essential amino acids, and 50 µM β-mercaptoethanol (Invitrogen). NIH/3T3 cells were cultivated as recommended in the ATCC. Nascent Strand Purification Genomic DNA isolation and nascent strands fractionation was performed as described [33]. Sucrose gradient fractions containing replication intermediates ranging between 100–600 nt, 300–800 nt and 600–2000 nt in size were subjected to digestion by λ-exonuclease, which degrades contaminating random sheared DNA leaving untouched DNA replication intermediates that are protected by a 5′ RNA-primer, as described [28]. ORI enrichment in 300–800 nt nascent strand preparations was routinely monitored by Q-PCR for Mecp2 CpG island-ORI region [32] and a flanking region located 1 kb downstream as control. Primer sequences are provided in Table S3. Only preparations showing a minimum of 5-times enrichment were used in array hybridisation experiments or Q-PCR validations. Three to four µg of λ-exonuclease treated nascent strands purified from 5×109 mouse embryonic stem cells were co-hybridised with the same amount of total genomic DNA for each array replicate. Hybridisation of DNA Microarrays Sample labelling, hybridisation and data extraction were performed according to standard procedures from Agilent Technologies (2005). Agilent 22K feature arrays were designed to cover two 4 Mb regions on the X chromosome (45.5–49.5 Mb and 65–69 Mb) and a 4 Mb region on chromosome 3 (95.5–99.5 Mb) of non-repetitive DNA sequences, with an average coverage of one 60-mer probe each 250 bp (Oxford Gene Technology). Probe design was based on Ensembl mouse build 35. Data Normalisation and Analysis Raw datasets from each experiment were loess normalised to remove signal intensity-dependent bias using GeneSpringX software (Agilent). Normalised data were analysed with the ACME algorithm [25], that uses the following approach to examine enrichment. First, using a user-specified threshold (0.95 in our case) probes are divided into positive probes (those with a log2 ratio larger than the specified quantile) and negative ones. ACME then uses a sliding window of fixed size (800 bp in our case) centered on each probe. Within each window, a chi-square test is used to examine enrichment by comparing the observed number of positive probes with the expected number. The p-value can be used as a rough guide to determine regions of interest (in our case, we used as cut-point a p-value<0.005). The original results from ACME are referred to as “Algorithm 2” in Figure 6 Replication intermediates abundance relative to annotated genomic features was first analysed by visual inspection using the GEB browser (http://web.bioinformatics.ic.ac.uk/geb) and manually validated in the mouse NCBI database build 36.1. Annotations for genes and transcripts were obtained from RefSeq, Ensembl and UniGene databases. CpG islands were identified using the strict algorithm displayed in the NCBI database: minimum length of 500 bp, minimum C+G content of 50% and minimum observed CpG/expected CpG of 0.6 [61]. Quantitative Real-Time PCR Quantitative real-time PCR was performed with an ABI Prism 7000 Detection System (Applied Biosystems), using SYBR Premix Ex Taq (Takara Bio Inc.) and following manufacturer's instructions. Four serial 10-fold dilutions of sonicated genomic DNA were amplified using the same reaction mixture as the samples to construct the standard curves. Primer sequences are indicated in Table S3. All real-time PCR reactions were performed in duplicate and in at least two independent preparations of nascent strands or immunoprecipitated material. Quantitative analyses were carried out using the ABI Prism 7000 SDS Software (version 1.2.3). Chromatin Immunoprecipitation PGK12.1 cells cross-linking and chromatin immunoprecipitations were performed as described [62], with the following modifications. Cells were harvested in Lysis Buffer I (5 mM PIPES pH 8.0, 85 mM KCl, 0.5% NP-40, and protease inhibitors). Nuclei were pelleted by centrifugation, resuspended in Lysis Buffer II (50 mM Tris pH 8.0, 1% SDS, and 10 mM EDTA pH 8.0, and protease inhibitors) and disrupted by sonication using Bioruptor (Diagenode), yielding genomic DNA fragments with a size distribution of 100–800 bp. For each ChIP 25 µg of chromatin were immunoprecipitated with the following polyclonal antibodies: H3 acetyl K9,14 (5 µg, Upstate), H3 tri methyl K4 (2 µg, Abcam), or H3 (2 µg, Abcam). Immune complexes were recovered by the addition of 20 µL of blocked protein A/G Plus beads (Santa Cruz) and washed and eluted as described [62]. Luciferase Reporter Assays PCR-amplified DNA fragments were cloned in both orientations upstream of the luciferase gene in the pGL3 basic vector (Promega). Constructs were cotransfected with a Renilla Luciferase Control Reporter Vector (pRL-SV40, Promega) using Lipofectamine 2000 (Invitrogen) and following manufacturer's instructions. Firefly and Renilla luciferase signals were quantified 30 h post-transfection using the Dual-Luciferase Reporter Assay System (Promega). Reporter expression was normalised with the Renilla luciferase signal and averaged across two independent transfections carried out in duplicate. Primer sequences used to amplify the fragments for cloning and insert sizes are provided in Table S3. Figure S1 Replication initiation activity at CpG island-ORIs and non promoter-ORIs in MEFs and NIH/3T3 cells. (A) Q-PCR measurements of nascent strands abundance across the positive probes defining the ORIs identified in ES cells in preparations of replication intermediates of 300–800 nt derived from MEFs. Normalisations were as in Figure 3 (2.91 MB TIF) Click here for additional data file.(2.7M, tif) Table S1 Summary of the ORI mapping data. Genomic features covered by the array, ORI distribution and percentages of ORI occurrence relative to the annotated genes along the 10.1 Mb and per region. (0.08 MB DOC) Click here for additional data file.(79K, doc) Table S2 List of the 97 newly identified mouse ORIs. The sequence of the 60-mer probe centred at the 800 bp significant window, the starting and ending position of that probe in the Ensembl mouse build 35 and the location relative to the genes is indicated. Blue rows show the 38 ORIs that were also identified in the arrays hybridised with 100–600 nt long nascent strands. (0.12 MB XLS) Click here for additional data file.(116K, xls) Table S3 List of the primers used in this work. (0.32 MB DOC) Click here for additional data file.(316K, doc) Acknowledgments We thank Mauro Lodolo, Rodrigo Lombraña, Belinda Rodríguez, and Néstor Saiz for their help in various aspects of the development of this work and Nicole Draper, Marcus Harrison, and Doug Hurd at Oxford Gene Technology for microarray probe design, hybridisations, and data extraction. We are also grateful to Francisco Antequera and Marie-Noelle Prioleau for advice and a critical reading of the manuscript, to Marie-Noelle Prioleau for sharing data with us before publication, and to the editor and two anonymous reviewers for excellent constructive criticism. Footnotes The authors have declared that no competing interests exist. Work in María Gómez's laboratory is supported by grants from the Spanish Ministry of Education and Science (BFU2007-66827) and the Consejería de Sanidad of the Junta de Castilla y León (SAN196/SA12/07). JSM was supported by a grant from the Portuguese Foundation for Science and Technology (SFRH/BD/11824/2003). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. References 1. Arias EE, Walter JC. Strength in numbers: preventing rereplication via multiple mechanisms in eukaryotic cells. Genes Dev. 2007;21:497–518. [PubMed] 2. Diffley JF. Regulation of early events in chromosome replication. Curr Biol. 2004;14:R778–786. [PubMed] 3. Tabancay AP, Jr, Forsburg SL. Eukaryotic DNA replication in a chromatin context. Curr Top Dev Biol. 2006;76:129–184. [PubMed] 4. Remus D, Beall EL, Botchan MR. DNA topology, not DNA sequence, is a critical determinant for Drosophila ORC-DNA binding. Embo J. 2004;23:897–907. [PubMed] 5. Vashee S, Cvetic C, Lu W, Simancek P, Kelly TJ, et al. Sequence-independent DNA binding and replication initiation by the human origin recognition complex. Genes Dev. 2003;17:1894–1908. [PubMed] 6. Lin HB, Dijkwel PA, Hamlin JL. Promiscuous initiation on mammalian chromosomal DNA templates and its possible suppression by transcription. Exp Cell Res. 2005;308:53–64. [PubMed] 7. Mesner LD, Hamlin JL. Specific signals at the 3′ end of the DHFR gene define one boundary of the downstream origin of replication. Genes Dev. 2005;19:1053–1066. [PubMed] 8. Saha S, Shan Y, Mesner LD, Hamlin JL. The promoter of the Chinese hamster ovary dihydrofolate reductase gene regulates the activity of the local origin and helps define its boundaries. Genes Dev. 2004;18:397–410. [PubMed] 9. Beall EL, Manak JR, Zhou S, Bell M, Lipsick JS, et al. Role for a Drosophila Myb-containing protein complex in site-specific DNA replication. Nature. 2002;420:833–837. [PubMed] 10. Bosco G, Du W, Orr-Weaver TL. DNA replication control through interaction of E2F-RB and the origin recognition complex. Nat Cell Biol. 2001;3:289–295. [PubMed] 11. Minami H, Takahashi J, Suto A, Saitoh Y, Tsutsumi K. Binding of AlF-C, an Orc1-binding transcriptional regulator, enhances replicator activity of the rat aldolase B origin. Mol Cell Biol. 2006;26:8770–8780. [PubMed] 12. Saitoh Y, Miyagi S, Ariga H, Tsutsumi K. Functional domains involved in the interaction between Orc1 and transcriptional repressor AlF-C that bind to an origin/promoter of the rat aldolase B gene. Nucleic Acids Res. 2002;30:5205–5212. [PubMed] 13. Danis E, Brodolin K, Menut S, Maiorano D, Girard-Reydet C, et al. Specification of a DNA replication origin by a transcription complex. Nat Cell Biol. 2004;6:721–730. [PubMed] 14. Farkash-Amar S, Lipson D, Polten A, Goren A, Helmstetter C, et al. Global organization of replication time zones of the mouse genome. Genome Res. 2008;18:1562–1570. [PubMed] 15. Hiratani I, Ryba T, Itoh M, Yokochi T, Schwaiger M, et al. Global reorganization of replication domains during embryonic stem cell differentiation. PLoS Biol. 2008;6:e245. doi:10.1371/journal.pbio.0060245. [PubMed] 16. MacAlpine DM, Rodriguez HK, Bell SP. Coordination of replication and transcription along a Drosophila chromosome. Genes Dev. 2004;18:3094–3105. [PubMed] 17. White EJ, Emanuelsson O, Scalzo D, Royce T, Kosak S, et al. DNA replication-timing analysis of human chromosome 22 at high resolution and different developmental states. Proc Natl Acad Sci USA. 2004;101:17771–17776. [PubMed] 18. Woodfine K, Fiegler H, Beare DM, Collins JE, McCann OT, et al. Replication timing of the human genome. Hum Mol Genet. 2004;13:191–202. [PubMed] 19. Cadoret JC, Meisch F, Hassan-Zadeh V, Luyten I, Guillet C, et al. Genome-wide studies highlight indirect links between human replication origins and gene regulation. Proc Natl Acad Sci USA. 2008;105:15837–15842. [PubMed] 20. Efroni S, Duttagupta R, Cheng J, Dehghani H, Hoeppner DJ, et al. Global transcription in pluripotent embryonic stem cells. Cell Stem Cell. 2008;2:437–447. [PubMed] 21. Meshorer E, Yellajoshula D, George E, Scambler PJ, Brown DT, et al. Hyperdynamic plasticity of chromatin proteins in pluripotent embryonic stem cells. Dev Cell. 2006;10:105–116. [PubMed] 22. Bernstein BE, Mikkelsen TS, Xie X, Kamal M, Huebert DJ, et al. A bivalent chromatin structure marks key developmental genes in embryonic stem cells. Cell. 2006;125:315–326. [PubMed] 23. Boyer LA, Plath K, Zeitlinger J, Brambrink T, Medeiros LA, et al. Polycomb complexes repress developmental regulators in murine embryonic stem cells. Nature. 2006;441:349–353. [PubMed] 24. Lee TI, Jenner RG, Boyer LA, Guenther MG, Levine SS, et al. Control of developmental regulators by Polycomb in human embryonic stem cells. Cell. 2006;125:301–313. [PubMed] 25. Scacheri PC, Crawford G, Davis S. Statistics for ChIP-chip and DNase hypersensitivity experiments on NimbleGen arrays. Meth Enzymol. 2006;411:270–282. [PubMed] 26. Giacca M, Zentilin L, Norio P, Diviacco S, Dimitrova D, et al. Fine mapping of a replication origin of human DNA. Proc Natl Acad Sci USA. 1994;91:7119–7123. [PubMed] 27. Keller C, Ladenburger EM, Kremer M, Knippers R. The origin recognition complex marks a replication origin in the human TOP1 gene promoter. J Biol Chem. 2002;277:31430–31440. [PubMed] 28. Prioleau M, Gendron MC, Hyrien O. Replication of the chicken beta-globin locus: early-firing origins at the 5′ HS4 insulator and the rho- and betaA-globin genes show opposite epigenetic modifications. Mol Cell Biol. 2003;23:3536–3549. [PubMed] 29. Delgado S, Gómez M, Bird A, Antequera F. Initiation of DNA replication at CpG islands in mammalian chromosomes. EMBO J. 1998;17:2426–2435. [PubMed] 30. Ladenburger EM, Keller C, Knippers R. Identification of a binding region for human origin recognition complex proteins 1 and 2 that coincides with an origin of DNA replication. Mol Cell Biol. 2002;22:1036–1048. [PubMed] 31. Cohen SM, Hatada S, Brylawski BP, Smithies O, Kaufman DG, et al. Complementation of replication origin function in mouse embryonic stem cells by human DNA sequences. Genomics. 2004;84:475–484. [PubMed] 32. Gómez M, Brockdorff N. Heterochromatin on the inactive X chromosome delays replication timing without affecting origin usage. Proc Natl Acad Sci USA. 2004;101:6923–6928. [PubMed] 33. Gómez M, Antequera F. Overreplication of short DNA regions during S phase in human cells. Genes Dev. 2008;22:375–385. [PubMed] 34. Gray SJ, Gerhardt J, Doerfler W, Small LE, Fanning E. An origin of DNA replication in the promoter region of the human fragile X mental retardation (FMR1) gene. Mol Cell Biol. 2007;27:426–437. [PubMed] 35. Kitsberg D, Selig S, Keshet I, Cedar H. Replication structure of the human beta-globin gene domain. Nature. 1993;366:588–590. [PubMed] 36. Hamlin JL, Dijkwel PA. On the nature of replication origins in higher eukaryotes. Curr Opin Genet Dev. 1995;5:153–161. [PubMed] 37. Carninci P, Sandelin A, Lenhard B, Katayama S, Shimokawa K, et al. Genome-wide analysis of mammalian promoter architecture and evolution. Nat Genet. 2006;38:626–635. [PubMed] 38. Guenther MG, Levine SS, Boyer LA, Jaenisch R, Young RA. A chromatin landmark and transcription initiation at most promoters in human cells. Cell. 2007;130:77–88. [PubMed] 39. Barski A, Cuddapah S, Cui K, Roh TY, Schones DE, et al. High-resolution profiling of histone methylations in the human genome. Cell. 2007;129:823–837. [PubMed] 40. Waltz SE, Trivedi AA, Leffak M. DNA replication initiates non-randomly at multiple sites near the c-myc gene in HeLa cells. Nucleic Acids Res. 1996;24:1887–1894. [PubMed] 41. Bernstein BE, Kamal M, Lindblad-Toh K, Bekiranov S, Bailey DK, et al. Genomic maps and comparative analysis of histone modifications in human and mouse. Cell. 2005;120:169–181. [PubMed] 42. Daniels R, Lowell S, Bolton V, Monk M. Transcription of tissue-specific genes in human preimplantation embryos. Hum Reprod. 1997;12:2251–2256. [PubMed] 43. Macleod D, Ali RR, Bird A. An alternative promoter in the mouse major histocompatibility complex class II I-Abeta gene: implications for the origin of CpG islands. Mol Cell Biol. 1998;18:4433–4443. [PubMed] 44. Ponger L, Duret L, Mouchiroud D. Determinants of CpG islands: expression in early embryo and isochore structure. Genome Res. 2001;11:1854–1860. [PubMed] 45. Wise TL, Pravtcheva DD. The undermethylated state of a CpG island region in igf2 transgenes is dependent on the H19 enhancers. Genomics. 1999;60:258–271. [PubMed] 46. Core LJ, Waterfall JJ, Lis JT. Nascent RNA sequencing reveals widespread pausing and divergent initiation at human promoters. Science. 2008;322:1845–1848. [PubMed] 47. Seila AC, Calabrese JM, Levine SS, Yeo GW, Rahl PB, et al. Divergent transcription from active promoters. Science. 2008;322:1849–1851. [PubMed] 48. DePamphilis ML. Origins of DNA replication. In: DePamphilis ML, editor. DNA replication in eukaryotic cells. CSHL Press; 1996. pp. 45–86. 49. DePamphilis ML. Replication origins in metazoan chromosomes: fact or fiction? Bioessays. 1999;21:5–16. [PubMed] 50. Norseen J, Thomae A, Sridharan V, Aiyar A, Schepers A, et al. RNA-dependent recruitment of the origin recognition complex. Embo J. 2008;27:3024–3035. [PubMed] 51. Dazy S, Gandrillon O, Hyrien O, Prioleau M. Broadening of DNA replication origin usage during metazoan cell differentiation. EMBO Rep. 2006;7:806–811. [PubMed] 52. Norio P, Kosiyatrakul S, Yang Q, Guan Z, Brown NM, et al. Progressive activation of DNA replication initiation in large domains of the immunoglobulin heavy chain locus during B cell development. Molecular Cell. 2005;20:575–587. [PubMed] 53. Antequera F, Bird A. CpG islands as genomic footprints of promoters that are associated with replication origins. Curr Biol. 1999;9:R661–667. [PubMed] 54. Han L, Su B, Li WH, Zhao Z. CpG island density and its correlations with genomic features in mammalian genomes. Genome Biol. 2008;9:R79. [PubMed] 55. Venter JC, Adams MD, Myers EW, Li PW, Mural RJ, et al. The sequence of the human genome. Science. 2001;291:1304–1351. [PubMed] 56. Waterston RH, Lindblad-Toh K, Birney E, Rogers J, Abril JF, et al. Initial sequencing and comparative analysis of the mouse genome. Nature. 2002;420:520–562. [PubMed] 57. Antequera F, Bird A. Number of CpG islands and genes in human and mouse. Proc Natl Acad Sci USA. 1993;90:11995–11999. [PubMed] 58. Cross SH, Bird AP. CpG islands and genes. Current Opinion in Gen Dev. 1995;5:309–314. 59. Penny GD, Kay GF, Sheardown SA, Rastan S, Brockdorff N. Requirement for Xist in X chromosome inactivation. Nature. 1996;379:131–137. [PubMed] 60. Team RDC. R: a languange and environment for statistical computing. 2008. Vienna, Austria. 61. Takai D, Jones PA. Comprehensive analysis of CpG islands in human chromosomes 21 and 22. Proc Natl Acad Sci USA. 2002;99:3740–3745. [PubMed] 62. Morey L, Brenner C, Fazi F, Villa R, Gutierrez A, et al. MBD3, a component of the NuRD complex, facilitates chromatin alteration and deposition of epigenetic marks. Mol Cell Biol. 2008;28:5912–5923. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||
Genes Dev. 2007 Mar 1; 21(5):497-518.
[Genes Dev. 2007]Curr Top Dev Biol. 2006; 76():129-84.
[Curr Top Dev Biol. 2006]EMBO J. 2004 Feb 25; 23(4):897-907.
[EMBO J. 2004]Genes Dev. 2003 Aug 1; 17(15):1894-908.
[Genes Dev. 2003]Exp Cell Res. 2005 Aug 1; 308(1):53-64.
[Exp Cell Res. 2005]Genes Dev. 2004 Feb 15; 18(4):397-410.
[Genes Dev. 2004]Nature. 2002 Dec 19-26; 420(6917):833-7.
[Nature. 2002]Proc Natl Acad Sci U S A. 2008 Oct 14; 105(41):15837-42.
[Proc Natl Acad Sci U S A. 2008]Cell Stem Cell. 2008 May 8; 2(5):437-47.
[Cell Stem Cell. 2008]Dev Cell. 2006 Jan; 10(1):105-16.
[Dev Cell. 2006]Cell. 2006 Apr 21; 125(2):315-26.
[Cell. 2006]Nature. 2006 May 18; 441(7091):349-53.
[Nature. 2006]Cell. 2006 Apr 21; 125(2):301-13.
[Cell. 2006]Methods Enzymol. 2006; 411():270-82.
[Methods Enzymol. 2006]Proc Natl Acad Sci U S A. 1994 Jul 19; 91(15):7119-23.
[Proc Natl Acad Sci U S A. 1994]Mol Cell Biol. 2003 May; 23(10):3536-49.
[Mol Cell Biol. 2003]EMBO J. 1998 Apr 15; 17(8):2426-35.
[EMBO J. 1998]Mol Cell Biol. 2002 Feb; 22(4):1036-48.
[Mol Cell Biol. 2002]Genomics. 2004 Sep; 84(3):475-84.
[Genomics. 2004]Proc Natl Acad Sci U S A. 2004 May 4; 101(18):6923-8.
[Proc Natl Acad Sci U S A. 2004]Proc Natl Acad Sci U S A. 1994 Jul 19; 91(15):7119-23.
[Proc Natl Acad Sci U S A. 1994]Genes Dev. 2004 Dec 15; 18(24):3094-105.
[Genes Dev. 2004]Proc Natl Acad Sci U S A. 2008 Oct 14; 105(41):15837-42.
[Proc Natl Acad Sci U S A. 2008]Proc Natl Acad Sci U S A. 2004 May 4; 101(18):6923-8.
[Proc Natl Acad Sci U S A. 2004]Curr Opin Genet Dev. 1995 Apr; 5(2):153-61.
[Curr Opin Genet Dev. 1995]Nat Genet. 2006 Jun; 38(6):626-35.
[Nat Genet. 2006]Genes Dev. 2008 Feb 1; 22(3):375-85.
[Genes Dev. 2008]Nat Genet. 2006 Jun; 38(6):626-35.
[Nat Genet. 2006]Cell. 2007 Jul 13; 130(1):77-88.
[Cell. 2007]Cell. 2007 May 18; 129(4):823-37.
[Cell. 2007]Proc Natl Acad Sci U S A. 2008 Oct 14; 105(41):15837-42.
[Proc Natl Acad Sci U S A. 2008]Nucleic Acids Res. 1996 May 15; 24(10):1887-94.
[Nucleic Acids Res. 1996]Nat Genet. 2006 Jun; 38(6):626-35.
[Nat Genet. 2006]Cell. 2007 May 18; 129(4):823-37.
[Cell. 2007]Cell. 2005 Jan 28; 120(2):169-81.
[Cell. 2005]Nat Genet. 2006 Jun; 38(6):626-35.
[Nat Genet. 2006]Nat Genet. 2006 Jun; 38(6):626-35.
[Nat Genet. 2006]Hum Reprod. 1997 Oct; 12(10):2251-6.
[Hum Reprod. 1997]Genomics. 1999 Sep 15; 60(3):258-71.
[Genomics. 1999]Proc Natl Acad Sci U S A. 2008 Oct 14; 105(41):15837-42.
[Proc Natl Acad Sci U S A. 2008]Science. 2008 Dec 19; 322(5909):1845-8.
[Science. 2008]Science. 2008 Dec 19; 322(5909):1849-51.
[Science. 2008]Bioessays. 1999 Jan; 21(1):5-16.
[Bioessays. 1999]Cell. 2007 Jul 13; 130(1):77-88.
[Cell. 2007]Cell. 2007 May 18; 129(4):823-37.
[Cell. 2007]EMBO J. 2008 Nov 19; 27(22):3024-35.
[EMBO J. 2008]Nat Genet. 2006 Jun; 38(6):626-35.
[Nat Genet. 2006]Proc Natl Acad Sci U S A. 2004 May 4; 101(18):6923-8.
[Proc Natl Acad Sci U S A. 2004]EMBO Rep. 2006 Aug; 7(8):806-11.
[EMBO Rep. 2006]Mol Cell. 2005 Nov 23; 20(4):575-87.
[Mol Cell. 2005]Curr Biol. 1999 Sep 9; 9(17):R661-7.
[Curr Biol. 1999]Genome Biol. 2008; 9(5):R79.
[Genome Biol. 2008]Nature. 2002 Dec 5; 420(6915):520-62.
[Nature. 2002]Proc Natl Acad Sci U S A. 1993 Dec 15; 90(24):11995-9.
[Proc Natl Acad Sci U S A. 1993]Nature. 1996 Jan 11; 379(6561):131-7.
[Nature. 1996]Genes Dev. 2008 Feb 1; 22(3):375-85.
[Genes Dev. 2008]Mol Cell Biol. 2003 May; 23(10):3536-49.
[Mol Cell Biol. 2003]Proc Natl Acad Sci U S A. 2004 May 4; 101(18):6923-8.
[Proc Natl Acad Sci U S A. 2004]Methods Enzymol. 2006; 411():270-82.
[Methods Enzymol. 2006]Proc Natl Acad Sci U S A. 2002 Mar 19; 99(6):3740-5.
[Proc Natl Acad Sci U S A. 2002]Mol Cell Biol. 2008 Oct; 28(19):5912-23.
[Mol Cell Biol. 2008]