(A) Tree-view representation of the hierarchical clustering of histone H3 K27 trimethylation in HSF1 (hESC1), HSF6 (hESC2), two human fibroblast lines (hFibr1 and -2), and late-passage hiPSCs (l-hiPSC1 and l-hiPSC18) across the promoter regions of all genes considered significantly differentially methylated between fibroblasts and hESCs (p = 0.05). Genes were classified as E (hESC-like, 950 genes), N (neutral, 19 genes), or F (fibroblast-like, 9 genes) based on the similarity of the methylation patterns in l-hiPSCs with hESCs or fibroblasts. For N and F class genes, the y axis is scaled 33 to make the methylation patterns visible. Each row represents the −5.5 kb to +2.5 kb region with respect to the transcription start site (TSS). The 8 kb promoter regions are divided into sixteen 500 bp regions displayed in pseudocolor based on the average log ratio of the IP to input probe signal intensity. Probes within a given 500 bp region are averaged. Dark gray coloring indicates missing values for enrichment due to the lack of probes. Expression levels for the represented genes are shown on the right for hESCs, e-hiPSCs, and l-hiPSCs relative to fibroblasts.
(B) Table showing the overlap between genes shown in (A) (which demonstrate differences in H3K27 methylation between hESCs and fibroblasts), grouped according to the methylation pattern in l-iPSCs, and the late and common hiPSC expression signature genes as defined in Figures 1C and 1D (these genes are differentially expressed between l-hiPSCs and hESCs). *p value = 0.0063.
(C) Histograms detailing the differences of H3K27me3 patterns between hESC and fibroblasts, measured as the Euclidean distance across the sixteen 500 bp regions of the promoters for the indicated gene sets. The black outlined bars denote the distribution of Euclidean distances for all genes on the array (17,000), while the red outlined bars show the distribution for the indicated subset of signature genes.
(D) As in (C) but for histone H3K4 trimethylation. Note: hESC signature genes must undergo a significantly larger degree of change in both H3K4me3 (**p value = 0.005) and H3K27me3 (***p < 10−7) than the global population of genes. None of the distributions for the other subsets of genes are significantly different from the global population.