![]() | ![]() |
Formats:
|
||||||||||||||||||
Copyright © 2009 by Cold Spring Harbor Laboratory Press H3K27me3 forms BLOCs over silent genes and intergenic regions and specifies a histone banding pattern on a mouse autosomal chromosome 1 CeMM Research Center for Molecular Medicine, Austrian Academy of Sciences, A1030 Vienna, Austria; 2 Research Institute of Molecular Pathology, A1030, Vienna, Austria 3These authors contributed equally to this work. 4Present address: CeMM, c/o AKH, Leitstelle 5H.J2.09, Währinger Gürtel 18-20, A1090 Vienna, Austria. 5Corresponding author.E-mail denise.barlow/at/univie.ac.at; fax 43 1 4277 9546. Received May 13, 2008; Accepted November 17, 2008. This article has been cited by other articles in PMC.Abstract In mammals, genome-wide chromatin maps and immunofluorescence studies show that broad domains of repressive histone modifications are present on pericentromeric and telomeric repeats and on the inactive X chromosome. However, only a few autosomal loci such as silent Hox gene clusters have been shown to lie in broad domains of repressive histone modifications. Here we present a ChIP-chip analysis of the repressive H3K27me3 histone modification along chr 17 in mouse embryonic fibroblast cells using an algorithm named broad local enrichments (BLOCs), which allows the identification of broad regions of histone modifications. Our results, confirmed by BLOC analysis of a whole genome ChIP-seq data set, show that the majority of H3K27me3 modifications form BLOCs rather than focal peaks. H3K27me3 BLOCs modify silent genes of all types, plus flanking intergenic regions and their distribution indicates a negative correlation between H3K27me3 and transcription. However, we also found that some nontranscribed gene-poor regions lack H3K27me3. We therefore performed a low-resolution analysis of whole mouse chr 17, which revealed that H3K27me3 is enriched in mega-base-pair-sized domains that are also enriched for genes, short interspersed elements (SINEs) and active histone modifications. These genic H3K27me3 domains alternate with similar-sized gene-poor domains. These are deficient in active histone modifications, as well as H3K27me3, but are enriched for long interspersed elements (LINEs) and long-terminal repeat (LTR) transposons and H3K9me3 and H4K20me3. Thus, an autosome can be seen to contain alternating chromatin bands that predominantly separate genes from one retrotransposon class, which could offer unique domains for the specific regulation of genes or the silencing of autonomous retrotransposons. Post-translational modifications on histone tails either reflect or directly influence the transcriptional status of genes and are classified as active, when they correlate with expressed genes, or, as repressive, when they correlate with silent genes. Chromatin immunoprecipitation (ChIP) using histone modification antibodies and tiling array analysis (ChIP-chip) or new generation sequencing technology (ChIP-seq), has been used to profile histone modifications of mouse and human chromosomes (Bernstein et al. 2007; Schones and Zhao 2008). Together these analyses show that active histone modifications such as H3K4 methylation and histone acetylation are enriched on expressed genes over short focal regions at promoters and nonpromoter putative gene-regulatory regions (Heintzman et al. 2007). However, these active marks can also be found at silent gene promoters in undifferentiated embryonic stem (ES) cells and in T cells, and, active transcription has been found to correlate with additional modifications such as H3K36 tri-methylation (me3), that spreads through the transcribed gene body (Bernstein et al. 2006b; Roh et al. 2006; Barski et al. 2007). Repressive histone modifications, such as H3K9me3, H4K20me3, and H3K27me3 have been associated in many cells types with gene silencing or heterochromatin formation (Martens et al. 2005; Boyer et al. 2006; Regha et al. 2007; Wutz 2007). In contrast to active histone modifications that are restricted to gene regulatory elements or the transcribed gene body, repressive histone modifications have also been shown to cover much larger regions, such as silent gene clusters, pericentromeric and telomeric repeats or mega-base-pair domains on the inactive X chromosome (Chadwick and Willard 2004; Schotta et al. 2004; Squazzo et al. 2006). Repressive H3K27me3 modifications have attracted particular attention as they have been shown to repress developmentally important genes and are thought to maintain stem cell pluripotency (Boyer et al. 2006). However, while some studies have shown that Polycomb repressor complex 2 (PRC2) that catalyzes H3K27me3, is required for ES cell differentiation (Pasini et al. 2007), other studies have shown that ES cells retain pluripotency in the absence of functional PRC2 (Chamberlain et al. 2008). A role for H3K27me3 in repressing developmentally important genes is, however, supported by genome-wide mapping in combination with functional studies. Thus, H3K27me3 or PRC2 have been identified on key ES cell developmental regulatory genes, on genes showing lineage-specific activation and on highly conserved noncoding elements (Azuara et al. 2006; Bernstein et al. 2006a; Bracken et al. 2006; Lee et al. 2006; Squazzo et al. 2006). These studies focused on gene promoters or genomic regions containing key developmental genes and demonstrated that H3K27me3 mainly forms focal peaks of enrichment on CG-rich silent gene promoters. However, in some regions notably the four mammalian Hox gene clusters, both PRC2 and H3K27me3 covered broad domains from 10 kb up to 140 kb, which spanned entire genes or gene clusters (Ringrose 2007). Four studies have also mapped H3K27me3 modifications across the whole mouse or human genome (Barski et al. 2007; Mikkelsen et al. 2007; Pan et al. 2007; Zhao et al. 2007). These studies came to generally similar conclusions: that H3K27me3 largely formed focal modifications at silent gene promoters with a few exceptions showing modifications of broad domains containing gene clusters. In contrast, genome-wide profiles of the Drosophila genome show that H3K27me3 covers large genomic domains containing silent genes that included genic and intergenic regions (Schwartz et al. 2006; Beisel et al. 2007). Mammalian chromosomes are known to be longitudinally organized in a manner that can be visualized as a banding pattern in stained metaphase chromosomes, which shows that dark Giemsa (or G-bands) alternate with light reverse-Giemsa (or R-bands), along the chromosome length (Craig and Bickmore 1993). Cytological studies based on DNA probe hybridization have shown that genes and repeats are differently distributed between dark and light bands. Dark bands are enriched for autonomous long interspersed elements (LINEs) retrotransposons, while light bands are enriched for genes and nonautonomous short interspersed elements (SINEs) retrotransposons (Boyle et al. 1990). These banding patterns are suggested to reflect differences in chromatin structure with dark G-bands containing more condensed late-replicating chromatin. However, a detailed comparison of chromatin composition between dark and light bands has not yet been made. Here we used ChIP-chip to profile active and repressive histone modifications on mouse chr 17 in a differentiated cell type, to examine the relationship between histone modifications, silent and expressed genes and interspersed repeats, along the chromosomal length. Our results using an algorithm that identified broad regions of histone modifications named broad local enrichments (BLOCs), were confirmed by unsupervised segmentation of continuous genomic data by hidden Markov models (Day et al. 2007) and by ChIP-seq analysis, and show that while active, H3K4me and acetylation modifications form peaks—H3K27me3 forms BLOCs with an average size of 43 kb that overlap silent genes and intergenic regions. Furthermore, we show that H3K27me3 is not randomly distributed along the chromosome length instead it is enriched in regions of high gene and SINE density and depleted in regions of high LINE/long-terminal repeat (LTR) and low gene density. We also show that regions of high LINE/LTR density are depleted of active histone modifications, but are enriched for repressive H3K9me3 and H4K20me3 histone modifications. Together this shows that specific enriched histone modifications distinguish gene-poor/LINE–LTR-rich chromosomal domains from gene-rich/SINE-rich domains, which have previously been shown to correlate, respectively, with dark and light Giemsa bands in metaphase chromosomes. These two types of chromosome histone domains may provide unique compartments for the specific regulation of gene expression or the silencing of autonomous retrotransposons. Results Active histone modifications form peaks, but H3K27me3 forms BLOCs The distribution of active histone modifications H3K4me2, H3K4me3 and H3K9Ac and the repressive H3K27me3 modification were mapped across nonrepeat regions of mouse chr 17 in two independent mouse embryonic fibroblast (MEF) cell lines MEFB1 and MEFF (Regha et al. 2007; Supplemental Table 1) using a custom chr 17 oligonucleotide NimbleGen tiling array chip described in Methods (Fig. 1
All three active histone modifications formed typical peak shapes with an average width of 2.4–2.9 kb, while H3K27me3 BLOCs had an average size of 43 kb with a high variation (Fig. 2A Silent and expressed genes cluster close to H3K27me3 BLOCs Visual inspection of H3K27me3 BLOCs across chr 17 indicated that although they contain silent genes they are often closely flanked by expressed genes (Fig. 1 We therefore determined how H3K27me3 BLOCs and active histone peaks are spatially distributed across whole chr 17 in relation to expressed and silent genes. Gene expression status was determined by cDNA hybridization to the same tiling array used for ChIP-chip (see Methods). An RNA chip analysis was performed independently on MEFB1 and MEFF and both cell lines gave similar results (Supplemental Fig. 4; data not shown). This analysis showed that H3K4me2, H3K4me3, and H3K9Ac were most often associated with expressed genes. On average 89% of expressed genes were marked by H3K4me2, 64% by H3K4me3, and 73% by H3K9Ac (data not shown). A small number of silent genes also contained active histone modification peaks, 26% with H3K4me2, 6% with H3K4me3, and 1% with H3K9Ac (data not shown). These three active histone modifications were frequently located on the same genomic regions, but were rarely found to associate with H3K27me3, whether analyzed as ChIPOTle peaks or as BLOCs (Fig. 2D
Since the analysis of expressed genes in Figure 3A Validation of H3K27me3 ChIP-chip BLOCs We performed two tests on the H3K27me3 ChIP-chip data. First, we determined the statistical significance of the difference of the H3K27me3 enrichment within BLOCs and the remaining chip, which shows the signal within BLOCs is significantly higher (P < 10−4) (Supplemental Fig. 6). Second, we performed an analysis of whole chr 17 H3K27me3 ChIP-chip profiles and expression identified by RNA chip using hidden Markov modeling (HMM) and Viterbi segmentation (Day et al. 2007), which confirmed that H3K27me3 and transcription show a negative correlation and that H3K27me3 forms BLOCs (Supplemental Fig. 7). Previous descriptions of H3K27me3 profiles across whole mouse and human genomes, based on ChIP-chip or on ChIP-seq analyses, have mostly shown that H3K27me3 forms focal modifications at silent gene promoters with a few exceptions showing modifications of broad domains containing gene clusters (Barski et al. 2007; Mikkelsen et al. 2007; Pan et al. 2007; Zhao et al. 2007). In order to test if the identification of H3K27me3 BLOCs was a consequence of the low dynamic range of tiling arrays, we selected one large 365.4 kb BLOC on mouse chr 17 and performed a scanning qPCR assay throughout the BLOC, using primers spaced at ~10 kb intervals and the same MEFF ChIP material used for the ChIP-chip analysis (Fig. 4A
We next analyzed the same MEFF chip material by whole genome sequencing (ChIP-seq, see Methods for details) and the results are similar to the scanning qPCR assay and show a broad enriched domain with oscillating peaks coincident with the ChIP-chip identified H3K27me3 BLOC. In Figure 4B As an additional validation test for the H3K27me3 ChIP-chip BLOCs identified here, we compared our data with published H3K27me3 profiles from the same mouse region obtained by ChIP-seq of MEF cells (Mikkelsen et al. 2007). A distance distribution analysis (Supplemental Fig. 5C) performed on the Mikkelsen et al. (2007) data shows that H3K27me3 ChIP-seq peaks are located inside silent genes and intergenic regions, but not in expressed genes, similar to the analysis of H3K27me3 BLOCs shown in Figure 3B Histone modifications identify two types of chromosome domain on chr 17 that correlate with gene and repeat density While the H3K27me3 data analysis indicated a negative correlation between transcription and H3K27me3 when the analysis focused on genes and intergenic regions, visual inspection of 5 Mb windows across whole chr 17 identified nontranscribed regions that lacked H3K27me3. We tested if H3K27me3 preferentially modifies genes in specific chromosomal domains regions by generating histone modification and gene and repeat density profiles at a low resolution across the whole of chr 17. Figure 5
The profiles obtained for these two histone domain types were next quantitatively analyzed with the HMM-Seg program (Day et al. 2007) to identify enriched domains along chr 17 (Fig. 6
A similar analysis of the Mikkelsen et al. (2007) ChIP-seq data set derived from formaldehyde-fixed chromatin, did not identify two alternating domain types on mouse chr 17 in MEF cells (see comment in Supplemental Fig. 10 legend; data not shown). However, two alternating domain types were seen on human chr 18 in human T cells using ChIP-seq data (Barski et al. 2007) that was derived from native ChIP, similarly to the data obtained here. Supplemental Figure 10 shows that on human chr 18 the repressive H3K27me3 and the active H3K4me3 marks form enriched domains that alternate with H3K9me3 enriched domains and the HMM-Seg algorithm also identifies these alternating domains. While repeat and gene density patterns on human chr 18 were not as clearly mutually exclusive as found on mouse chr 17, the H3K9me3 domains show a similar tendency to correlate with gene-poor and SINE-poor regions (Supplemental Fig. 10). Discussion Here we present an analysis of active and repressive histone modifications and their relationship to gene expression as well as to gene and retrotransposon density, along the length of mouse chr 17 in MEF differentiated cells. We show, using ChIP-chip and ChIP-seq data, that repressive H3K27me3 modifications are found as broad localized regions that we call BLOCs, which are also a feature of all mouse chromosomes. H3K27me3 BLOCs on chr 17 cover all types of silent genes as well as intergenic regions. Expressed genes are excluded from BLOCs, but are found in the immediate flanking regions. We expanded this gene-centered 100 bp high-resolution analysis of H3K27me3 to a chromosome-wide low-resolution analysis of the averaged H3K27me3 distribution over 200 kb windows. This lower resolution analysis showed that H3K27me3 is enriched in gene-rich/SINE-rich domains and depleted in gene-poor/LINE-rich domains. We further show that two active histone modifications (H3Ac, H4Ac) as well as H4K20me1 (whose role in gene expression is not yet clear), show the same chromosome domain enrichment as H3K27me3. In contrast, repressive H3K9me3 and H4K20me3 modifications show the opposite pattern and are enriched in gene-poor/LINE-rich chromosome domains. As metaphase chromosomes have been shown to contain alternating light R-bands and dark G-bands that are, respectively, gene-rich/SINE-rich and gene-poor/LINE-rich (Boyle et al. 1990), these results indicate that R- and G-bands are also associated with specific combinations of histone modification in nonsynchronized cell populations that principally contain interphase chromosomes. Active histone modifications form peaks, but H3K27me3 forms BLOCs The histone modification profiles identified here show characteristic features in terms of size of modified region, total chromosome coverage and position relative to genes and gene activity. Peaks of active histone modification had an average width of 2.6 kb and covered 2%–4% of mouse chr 17, with H3K4me2 being two- to threefold more abundant than H3K9Ac and H3K4me3. In contrast, repressive H3K27me3 modifications were found in broad localized regions named BLOCs that had an average size of 43 kb and covered 11% of mouse chr 17. These modifications also differed in position relative to genes and intergenic regions with active histone modifications mostly found on genes with a bias toward promoters. However, repressive H3K27me3 BLOCs were equally distributed over genes and intergenic regions and did not specifically mark promoters. The ChIP-chip profiles described here for H3K4me2, H3K4me3, and H3K9Ac are in agreement with those obtained using ChIP-chip or ChIP-seq from different mammalian cell types, which have generally shown that these active histone modifications are enriched over short focal regions at promoters and nonpromoter putative gene-regulatory regions (Bernstein et al. 2007; Heintzman et al. 2007; Schones and Zhao 2008). In contrast, the ChIP-chip profiles described here for H3K27me3 show differences with previous results in terms of the types of genes modified and the size and position of the modified region. Our interest in H3K27me3 arose from a study of an imprinted gene cluster in MEFs where we noted that flanking silent nonimprinted genes, which lack clear developmental roles, were modified by broad regions of H3K27me3 (Regha et al. 2007). Previous results mainly from ES cells identified an association between H3K27me3 and silent developmental regulatory genes (Bernstein et al. 2006a; Lee et al. 2006). While an association with silent lineage-specific genes was also noted in differentiated cell types (Squazzo et al. 2006), the results presented here differ, as they show a general association between H3K27me3 and silent genes that lie in gene-rich/SINE-rich chromosomal domains in MEF differentiated cells. Thus, while H3K27me3 has been shown to play a role in repressing genes that promote ES cell differentiation (Boyer et al. 2006; Pasini et al. 2007), our results indicate that H3K27me3 may also play a general repressive role in gene regulation in differentiated cells. The ChIP-chip profiles presented here show that H3K27me3 modifications mostly form BLOCs, not peaks. Previous studies that mainly focused on gene promoters identified punctate H3K27me3 patterns and a small number of broad or “blanket” H3K27me3 patterns covering silent gene clusters (Ringrose 2007). Only a few studies have generated continuous H3K27me3 profiles across large chromosomal regions in the mammalian genome, however, these also identified the majority of H3K27me3 enrichment as narrow peaks on silent gene promoters, although broad H3K27me3 enrichment was seen over silent Hox gene clusters and sometimes in the gene body of silent genes (Squazzo et al. 2006; Barski et al. 2007; Mikkelsen et al. 2007; Pan et al. 2007; Zhao et al. 2007). Despite these differences with published maps, there are four arguments that indicate the H3K27me3 BLOCs identified here accurately describe its profile across mouse chr 17 in MEFs. First, the specificity of the H3K27me3 antibody used here has been fully demonstrated (Peters et al. 2003; Perez-Burgos et al. 2004; Schwartz et al. 2006). Second, the native ChIP-chip technique used here produces profiles of active histone modification profiles in agreement with previous results (Bernstein et al. 2007; Heintzman et al. 2007; Schones and Zhao 2008). Third, the estimated chromosomal coverage of H3K27me3 BLOCs of 11%–26% in MEFs, is in agreement with quantitative mass spectrometry estimates that 10%–20% of histones are modified in ES cells by H3K27me3 (Peters et al. 2003). Lastly, we show that two alternative techniques (qPCR and ChIP-seq) identify H3K27me3 BLOCs similar to those seen in ChIP-chip analysis. It is possible that identification of H3K27me3 modification patterns is influenced by the chosen peak-finding threshold that can affect the ability to detect peaks or BLOCs. In the example shown in Supplemental Figure 9, we also apply the BLOC algorithm to the ChIP-chip data from human ES cells previously characterized as mostly containing H3K27me3 peaks (Pan et al. 2007), which identifies H3K27me3 BLOCs that are conserved between the mouse and human genome. The BLOC algorithm failed to identify broad modified regions in a published ChIP-seq data set from mouse MEF cells (Mikkelsen et al. 2007), however 79% of H3K27me3 peaks indentified in this data set coincide with the BLOCs identified here. Notably, while the H3K27me3 ChIP-seq data generated here show oscillating peaks of enrichment throughout the BLOC, only 10%–15% of identified ChIP-seq peaks correlate with promoters (Supplemental Fig. 9; data not shown). This is in contrast to published ChIP-seq data (Mikkelsen et al. 2007), which show that 41% of observed H3K27me3 peaks correlate with promoters. While we currently have no explanation for this difference, our analysis of the published H3K27me3 ChIP-seq peaks (Mikkelsen et al. 2007) does show that they maintain the same spatial correlation relative to expressed and silent genes and intergenic regions as observed for H3K27me3 BLOCs. Silent and expressed genes cluster close to H3K27me3 BLOCs We used distance plots to show that silent genes lie within H3K27me3 BLOCs, which is in agreement with previous results that identify this modification as repressive (Boyer et al. 2006; Pasini et al. 2007). Notably, expressed genes were preferentially located in regions immediately flanking H3K27me3 BLOCs. This produced a characteristic three-peak pattern when expressed and silent genes on chr 17 were analyzed together, that is also reproduced in a whole genome ChIP-seq data set. As visual inspection of the data identified many incidents where H3K27me3 BLOCs appeared to be abruptly terminated by active transcription (Fig. 1
Histone modifications identify two types of chromosome bands that correlate with gene and repeat density The analysis of H3K27me3 BLOCs relative to silent and expressed genes indicates it is induced by lack of active transcription rather than sequence-specific features. However, this interpretation is contradicted by the presence of gene-poor nontranscribed genomic regions that lack H3K27me3, which prompted us to examine the relationship between histone modifications and sequence features, such as genes and retrotransposon repeats along mouse chr 17. Our results show that histone modifications identify two types of alternating chromosomal domains that correlate with gene density and different types of retrotransposon elements. Type 1 is gene-rich and correlates with active H3Ac and H4Ac histone modifications, H4K20me1 (whose role in transcription is not clear) (Karachentsev et al. 2005; Papp and Muller 2006), repressive H3K27me3 modifications and nonautonomous SINE retrotransposons. Type 2 is gene poor and correlates with repressive H3K9me3 and H4K20me3 modifications and autonomous LINES and LTR retrotransposons. As previous studies of metaphase chromosomes (Boyle et al. 1990; Craig and Bickmore 1993) have shown that gene-rich/SINE-rich regions are lightly stained by Giemsa (known as light R-bands), while gene-poor/LINE-rich regions are darkly stained by Giemsa (known as dark G-bands), this indicates that specific patterns of histone modifications may discriminate Giemsa bands in nonsynchronized cell populations principally composed of interphase cells. The tiling array used in this ChIP-chip study was repeat masked, thus the obtained histone profiles do not directly result from hybridization to repeats themselves, but from hybridization to single copy sequences flanking the repeats. However, LTR retrotransposons have been shown to be enriched for H3K9me3 and/or H4K20me3 (Martens et al. 2005; Mikkelsen et al. 2007), thus the enrichment on flanking single copy regions may reflect retrotransposon epigenetic modifications. A connection between histone modifications and chromosome domains has previously been made in human T cells, which similarly showed that dark G-bands correlate with H3K4 methylation depletion and H3K9me3 enrichment, while light R-bands correlate with H3K4 methylation enrichment and reduced H3K9me3 (Barski et al. 2007). Our low-resolution analysis of this data set supports this observation by showing that alternating domains of H3K4me3 (plus H3K27me3) and H3K9me3 occur along human chr 18 and that the H3K9me3 domains correlate with dark staining Giemsa bands (Supplemental Fig. 10). Mutual exclusive distribution of H3K27me3 and H3K9me3 based on ChIP-chip data has also been demonstrated in several mammalian cell lines (Squazzo et al. 2006). A published analysis of ENCODE regions (Thurman et al. 2007) using the same computational approach that we applied in Figure 6 The findings in this study refine our understanding of the distribution of histone modifications in two ways. First we show by high-resolution ChIP profiling that H3K27me3 is not restricted to the promoter regions of silent genes, but instead generally marks broad localized regions that include silent genes and intergenic regions. Second, we use a low-resolution analysis to show that large chromosomal domains on an autosome are alternately enriched for distinctive histone modifications that correlate with gene density and different retrotransposon elements. While these low-resolution histone-banding patterns do not exclude the existence of smaller domains within them that may show contrary patterns, they do indicate the possibility that chromosomes are subdivided into domains that largely regulate genes and domains that largely silence autonomous retrotransposons. These two regions may impose different epigenetic constraints, e.g., genes lying close to LINE/LTR-rich bands may be more affected by epigenetic mechanisms normally directed toward transposon silencing, as has been well demonstrated in the plant genome (Weil and Martienssen 2008). An improved understanding of genes and their relative position to specific enriched histone domains may also give new insights into genes showing epigenetic dysregulation in development and disease. Methods Chromatin immunoprecipitation (ChIP), microarray design, and hybridizations Native ChIP, T7 in vitro amplification, RNA-chip, and the MEFF and MEFB1 cell lines were described previously (Regha et al. 2007). Antibodies, cells, and replicates are listed in Supplemental Table 1. The custom mouse chr 17 NimbleGen tiling array was designed by identifying 50 bp windows containing at least 18 unique 17-mers with a maximum distance of 15 bp (Thomas Jenuwein and the GEN-AU bioinformatics team, pers. comm.). This identified 390,000/50-mers giving a resolution of ~100 bp from single copy sequences (program is available on request) from mouse chr 17: 3021,656 to 92,867,543 (UCSC [mm6], March 2005, Build 34). chip hybridizations and scanning were performed by NimbleGen Systems Iceland and involved two technical replicates (with a dye swap) for 2 MEF cell lines (MEFB1 and MEFF) for most histone modification. The data were Tukey bi-weight normalized before analysis. Identification of focal regions enriched in active histone modifications and H3K27me3 from ChIP-chip data Using an implementation of ChIPOTle (Buck et al. 2005), a P-value was calculated for each 1500 bp window with at least 8 probes and assigned to bins of size 1 × 10−5. The cutoff for selection of enriched windows was determined by generating a null distribution through permutation of the signals (400 times) and assigning the resulting P-values to bins. The bin where the ratio of summed averaged randomized counts/summed real counts was still smaller than the chosen false discovery rate (1 × 10−7) was the upper limit for the P-value. A second selection step removed windows where the intensity of less than one-third of the probes was below the sum of the mean and the standard deviation of all the experimental probes. Peaks were joined if they overlapped or were within 500 bp of each other. Identification of H3K27me3 broad local enrichments (BLOCs) The start of a BLOC is defined by 10–13 consecutive probes, where 10 probes show a positive log2 (ChIP/input) value. The BLOC end was defined by 6–8 probes where six probes show a negative log2 (ChIP/input) value. BLOCs with a median log2 (ChIP/input) value greater than 0.25× standard deviations above the median log2 (ChIP/input) value of all probes of the chip were used. BLOCs that were separated by less than 10 kb were merged and only BLOCs larger than 5 kb in length were used for data analyses. In the case where several H3K27me3 chip replicates were available for one cell line, BLOCs were identified separately for each replicate. The BLOCs were fused by taking the overlapping BLOC regions and excluding regions that did not overlap. ChIP-seq BLOC finding was performed on 25 bp fragment density maps (see ChIP-seq below) from one MEFF genome data set. The BLOC start was defined by 20–23 windows, where 20 windows show a fragment density >0 and the BLOC end was defined by 6–8 windows where six windows show a fragment density <0. All 25 bp windows with no fragment density were assigned the value −1. The BLOCs program is available at http://genauwiki.imp.ac.at. Gene expression analysis These were based on cDNA hybridization to the NimbleGen chr 17 array relative to genomic DNA (RNA-chip). Informative genes were identified in two steps. First, only genes with at least five tiling array probes in exons were considered informative (750 of the 1282 genes on chr 17). The median signal of probes within exons was calculated. Genes with a value above the median value of all probes with a positive log2 (ChIP/input) value on the array were classified as expressed, while genes with a value below the median were classified as silent. Second, only genes with a coverage of at least eight tiling array probes per 1500 bp over >70% of the gene locus were used for data analyses. In MEFB1 cells this identified 553 of the 1282 genes on chr 17 from one replicate. In MEFF cells we only counted genes with the same expression status in two replicates and analyzed 518 of the 1282 genes on chr 17. Combined analysis of histone modifications and location in genes Focal regions enriched for active histone modifications as defined by ChIPOTle were identified for each hybridization replicate of a respective histone modification. The enriched regions from all hybridizations (Supplemental Table 1) were pooled for each histone modification. Overlap between the enriched regions was determined and those common to all files were pooled and used for data analyses in Figure 2
Z-score analysis We calculated an estimate of whether the observed overlap between two enriched regions for histone modifications was significant by randomly reshuffling the enriched regions 10,000 times to generate an empirical null distribution of the total overlap lengths. Only regions with a probe density >8 probes/1500 bp were used for analysis. The observed total overlap length was expressed as a Z-score relative to this null distribution: for example a Z-score of 10.0 indicates the actual overlap length was 10 standard deviation units higher than the mean of the random overlap length distribution. Distance distribution analysis The distance of the midpoint of each gene to the borders of the closest H3K27me3 BLOC was calculated. The distances were combined into 10 kb distance bins and plotted using MS-Excel. H3K27me3 BLOCs located completely within genes were excluded from the analysis. Genes that were more than 50% overlapped by an H3K27me3 BLOC were put into the “inside” bin (genes in BLOCs). Histone profiling relative to gene expression For each probe, the distance to the transcriptional start site of the genes of each category (expressed, silent, genes in BLOCs, see above) was calculated and normalized to the length of the gene. The normalized log2 (ChIP/input) values of the probes were smoothened by fitting a cubic smoothing spline, with the relative distances as predictors. A randomization and eventual smoothing of all the log2 (ChIP/input) values from the array across the relative distances indicates the background level of the array. ChIP-seq Sequencing libraries were obtained from 10 ng of ChIP DNA by adaptor ligation, gel purification and 18 cycles of PCR. Sequencing was carried out using the Illumina Genome Analyzer (GA) I system according to the manufacturer's protocol. 4468,089 uniquely alignable sequence tags were mapped to the mouse genome (NCBI build 36) using the GA-Pipeline V0.3. Tags passing the standard GA-Pipeline quality threshold and mapping uniquely with not more than two mismatches were used for data analyses. High density map: A theoretical fragment density for each 25 bp window was calculated. Uniquely aligned tags within 200 bp and oriented toward it, were counted as 1 for each 25 bp window and as 0.25 if they were within 200 bp and 300 bp. Low-density map: For each 200 bp window the uniquely aligned tags were counted. Peaks of significant enrichment were identified using the USeq toolkit (http://useq.sourceforge.net/): A sliding window of 1 kb was used to calculate smoothened window scores (ScanSeqs). For each window a Bonferroni corrected P-value was estimated using a global Poisson distribution and windows with a minimum score of 10 (low cut off) or 60 (high cut off) and a maximum distance of 500 bp were combined to enriched regions (EnrichedRegionMaker). Statistical significance
P-values were calculated using an unpaired t-test on http://www.graphpad.com using the mean and the standard deviation. Whole chromosome profiles The average normalized log2 (ChIP/input) ratios of all probes within nonoverlapping 200 kb windows was calculated for each replicate (Supplemental Table 1) of each histone modification examined in MEFF and MEFB1 cell lines. The data of all replicates were combined by averaging the 200 kb windows. Repeat and gene densities (“known genes”) were calculated as percent sequence coverage of 200 kb nonoverlapping windows. The positions of SINE, LINE, LTR repeats, and known genes were obtained from the UCSC genome browser (http://genome.ucsc.edu). HMM-Seg Paired H3K27me3 and RNA chr 17 profiles were analyzed by HMM-Seg with described parameters, except that the data were “smoothened” over 15 kb in Supplemental Figure 7 (Day et al. 2007). The unmodified ChIP/input ratios were averaged over 1 kb nonoverlapping windows and these data sets were used as the HMM-Seg input. In Figure 6 Quantitative analysis of HMM-Seg overlap HMM-Seg converts the whole chromosome into blocks of two states, 0 and 1. Two HMM-Seg results were compared by testing the whole chromosome in 1 kb nonoverlapping windows to determine if both segmentations showed the same state (0 or 1). The percentage of similarity was calculated by dividing the number of windows showing the same state by the total number of windows on chr 17. The result was visualized as pie charts using MS-Excel. Real-time qPCR Primers (Supplemental Table 2) were designed by PrimerExpress and qPCR performed with the ABI PRISM 7000 using MESA GREEN qPCR (Mastermix Plus for SYBR ASSAY- dTTP), with the primers under the following cycling conditions: 2 min 50°C, 10 min 95°C, 40 cycles of 15 s 95°C and 1 min 60°C. ChIP and Mock material were assayed undiluted, while input DNA was diluted 1:100. DNA quantification was made by the standard curve method using serial dilutions of input DNA. Relative quantification and statistics were performed as described in the manufacturer's protocol (Applied Biosystems). Acknowledgments We thank all members of the Barlow group and members of the GEN-AU Epigenetics project for support, Leonie Ringrose and Anton Wutz, for their comments on the paper. Project support was from GEN-AU Epigenetic Plasticity of the Mammalian Genome (GZ200.141/1-VI/2006), the EU-FW6 IP “HEROIC” (LSHG-CT-2005-018883), the NoE “The Epigenome” (LSHG-CT-2004-053433), and FWF SFB F17 Modulators of RNA Fate and Function (SFBF01718 B10). Footnotes [Supplemental material is available online at www.genome.org. The microarray and sequence data from this study have been submitted to Gene Expression Omnibus (GEO) (http://www.ncbi.nlm.nih.gov/geo/) under accession no. GSE11389.] Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.080861.108. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||
Cell. 2007 Feb 23; 128(4):669-81.
[Cell. 2007]Nat Rev Genet. 2008 Mar; 9(3):179-91.
[Nat Rev Genet. 2008]Nat Genet. 2007 Mar; 39(3):311-8.
[Nat Genet. 2007]Proc Natl Acad Sci U S A. 2006 Oct 24; 103(43):15782-7.
[Proc Natl Acad Sci U S A. 2006]Cell. 2007 May 18; 129(4):823-37.
[Cell. 2007]Nature. 2006 May 18; 441(7091):349-53.
[Nature. 2006]Mol Cell Biol. 2007 May; 27(10):3769-79.
[Mol Cell Biol. 2007]Stem Cells. 2008 Jun; 26(6):1496-505.
[Stem Cells. 2008]Nat Cell Biol. 2006 May; 8(5):532-8.
[Nat Cell Biol. 2006]Genes Dev. 2006 May 1; 20(9):1123-36.
[Genes Dev. 2006]Bioessays. 1993 May; 15(5):349-54.
[Bioessays. 1993]Proc Natl Acad Sci U S A. 1990 Oct; 87(19):7757-61.
[Proc Natl Acad Sci U S A. 1990]Bioinformatics. 2007 Jun 1; 23(11):1424-6.
[Bioinformatics. 2007]Mol Cell. 2007 Aug 3; 27(3):353-66.
[Mol Cell. 2007]Genome Biol. 2005; 6(11):R97.
[Genome Biol. 2005]Genome Biol. 2005; 6(11):R97.
[Genome Biol. 2005]Bioinformatics. 2007 Jun 1; 23(11):1424-6.
[Bioinformatics. 2007]Cell. 2007 May 18; 129(4):823-37.
[Cell. 2007]Cell Stem Cell. 2007 Sep 13; 1(3):299-312.
[Cell Stem Cell. 2007]Cell Stem Cell. 2007 Sep 13; 1(3):286-98.
[Cell Stem Cell. 2007]Cell Stem Cell. 2007 Sep 13; 1(3):299-312.
[Cell Stem Cell. 2007]Science. 2004 Mar 12; 303(5664):1626-32.
[Science. 2004]Mol Cell. 2007 Aug 3; 27(3):353-66.
[Mol Cell. 2007]Bioinformatics. 2007 Jun 1; 23(11):1424-6.
[Bioinformatics. 2007]Cell. 2007 May 18; 129(4):823-37.
[Cell. 2007]Proc Natl Acad Sci U S A. 1990 Oct; 87(19):7757-61.
[Proc Natl Acad Sci U S A. 1990]Cell. 2007 Feb 23; 128(4):669-81.
[Cell. 2007]Nat Genet. 2007 Mar; 39(3):311-8.
[Nat Genet. 2007]Nat Rev Genet. 2008 Mar; 9(3):179-91.
[Nat Rev Genet. 2008]Mol Cell. 2007 Aug 3; 27(3):353-66.
[Mol Cell. 2007]Cell. 2006 Apr 21; 125(2):301-13.
[Cell. 2006]Curr Opin Cell Biol. 2007 Jun; 19(3):290-7.
[Curr Opin Cell Biol. 2007]Genome Res. 2006 Jul; 16(7):890-900.
[Genome Res. 2006]Cell. 2007 May 18; 129(4):823-37.
[Cell. 2007]Cell Stem Cell. 2007 Sep 13; 1(3):299-312.
[Cell Stem Cell. 2007]Cell Stem Cell. 2007 Sep 13; 1(3):286-98.
[Cell Stem Cell. 2007]Cell Stem Cell. 2007 Sep 13; 1(3):299-312.
[Cell Stem Cell. 2007]Nature. 2006 May 18; 441(7091):349-53.
[Nature. 2006]Mol Cell Biol. 2007 May; 27(10):3769-79.
[Mol Cell Biol. 2007]Mol Cell Biol. 2008 Feb; 28(3):1041-6.
[Mol Cell Biol. 2008]Nat Genet. 2006 Jun; 38(6):700-5.
[Nat Genet. 2006]Genome Res. 2006 Jul; 16(7):890-900.
[Genome Res. 2006]Genes Dev. 2005 Feb 15; 19(4):431-5.
[Genes Dev. 2005]Genes Dev. 2006 Aug 1; 20(15):2041-54.
[Genes Dev. 2006]Proc Natl Acad Sci U S A. 1990 Oct; 87(19):7757-61.
[Proc Natl Acad Sci U S A. 1990]Bioessays. 1993 May; 15(5):349-54.
[Bioessays. 1993]EMBO J. 2005 Feb 23; 24(4):800-12.
[EMBO J. 2005]Cell. 2007 May 18; 129(4):823-37.
[Cell. 2007]Genome Res. 2006 Jul; 16(7):890-900.
[Genome Res. 2006]Genome Res. 2007 Jun; 17(6):917-27.
[Genome Res. 2007]Blood. 2008 Nov 1; 112(9):3889-99.
[Blood. 2008]Chromosoma. 2007 Apr; 116(2):147-57.
[Chromosoma. 2007]Curr Opin Genet Dev. 2008 Apr; 18(2):188-92.
[Curr Opin Genet Dev. 2008]Mol Cell. 2007 Aug 3; 27(3):353-66.
[Mol Cell. 2007]Genome Biol. 2005; 6(11):R97.
[Genome Biol. 2005]Bioinformatics. 2007 Jun 1; 23(11):1424-6.
[Bioinformatics. 2007]