![]() | ![]() |
Formats:
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright : © 2007 Lin et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Whole-Genome Cartography of Estrogen Receptor α Binding Sites 1 Genome Institute of Singapore, Singapore, Republic of Singapore 2 Department of Molecular and Integrative Physiology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America 3 Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Republic of Singapore Stuart K Kim, Editor Stanford University School of Medicine, United States of America #Contributed equally. * To whom correspondence should be addressed. E-mail: liue/at/gis.a-star.edu.sg Received October 9, 2006; Accepted April 17, 2007. This article has been cited by other articles in PMC.Abstract Using a chromatin immunoprecipitation-paired end diTag cloning and sequencing strategy, we mapped estrogen receptor α (ERα) binding sites in MCF-7 breast cancer cells. We identified 1,234 high confidence binding clusters of which 94% are projected to be bona fide ERα binding regions. Only 5% of the mapped estrogen receptor binding sites are located within 5 kb upstream of the transcriptional start sites of adjacent genes, regions containing the proximal promoters, whereas vast majority of the sites are mapped to intronic or distal locations (>5 kb from 5′ and 3′ ends of adjacent transcript), suggesting transcriptional regulatory mechanisms over significant physical distances. Of all the identified sites, 71% harbored putative full estrogen response elements (EREs), 25% bore ERE half sites, and only 4% had no recognizable ERE sequences. Genes in the vicinity of ERα binding sites were enriched for regulation by estradiol in MCF-7 cells, and their expression profiles in patient samples segregate ERα-positive from ERα-negative breast tumors. The expression dynamics of the genes adjacent to ERα binding sites suggest a direct induction of gene expression through binding to ERE-like sequences, whereas transcriptional repression by ERα appears to be through indirect mechanisms. Our analysis also indicates a number of candidate transcription factor binding sites adjacent to occupied EREs at frequencies much greater than by chance, including the previously reported FOXA1 sites, and demonstrate the potential involvement of one such putative adjacent factor, Sp1, in the global regulation of ERα target genes. Unexpectedly, we found that only 22%–24% of the bona fide human ERα binding sites were overlapping conserved regions in whole genome vertebrate alignments, which suggest limited conservation of functional binding sites. Taken together, this genome-scale analysis suggests complex but definable rules governing ERα binding and gene regulation. Author Summary Estrogen receptors (ERs) play key roles in facilitating the transcriptional effects of hormone functions in target tissues. To obtain a genome-wide view of ERα binding sites, we applied chromatin immunoprecipitation coupled with a cloning and sequencing strategy using chromatin immunoprecipitation pair end-tagging technology to map ERα binding sites in MCF-7 human breast cancer cells. We identified 1,234 high quality ERα binding sites in the human genome and demonstrated that the binding sites are frequently adjacent to genes significantly associated with breast cancer disease status and outcome. The mapping results also revealed that ERα can influence gene expression across distances of up to 100 kilobases or more, that genes that are induced or repressed utilize sites in different regions relative to the transcript (suggesting different mechanisms of action), and that ERα binding sites are only modestly conserved in evolution. Using computational approaches, we identified potential interactions with other transcription factor binding sites adjacent to the ERα binding elements. Taken together, these findings suggest complex but definable rules governing ERα binding and gene regulation and provide a valuable dataset for mapping the precise control nodes for one of the most important nuclear hormone receptors in breast cancer biology. Introduction Cellular transcriptomes are dictated by complex interactions between signal transduction pathways, general and specific transcription factors, chromatin remodeling proteins, and the RNA polymerase complexes. Precise transcriptional responses are achieved in part by targeting transcription factor complexes to the cis-regulatory regions of target genes via specific binding site sequences. The importance of these binding site sequences is reflected in the conservation of sequence motifs in coregulated genes and through evolution. A number of studies have examined transcriptomic changes to breast cancer cells following estrogen treatment [1–7]. Estrogen receptors (ERs) (specifically ERα and ERβ) are ligand-dependent transcription factors that mediate cellular responses to hormone exposure in vertebrate development, physiological processes, and endocrine-related diseases. ERα, in particular, has been implicated in the etiology of breast cancer and is a major prognostic marker and therapeutic target in disease management [8]. At the molecular level, ERs interact either directly with genomic targets encoded by estrogen response elements (EREs) (5′-GGTCAnnnTGACC-3′) or indirectly by tethering to nuclear proteins, such as AP-1 and Sp1 transcription factors [9–11]. The mechanisms of ER binding site specificity, however, are not clear since these binding site sequence motifs are ubiquitous in the genome, and there is no discernable difference between functional and nonfunctional sites by computational modeling approaches. This ambiguity is likely due to a lack of systemic information on binding site usage and architecture and mechanistic complexity involving additional transcription factors and epigenetic modifications [12,13]. Chromatin immunoprecipitation (ChIP) assays have facilitated characterizations of in vivo protein-DNA complexes such as histone modifications and recruitment of transcription factor complexes to specific binding sites [14]. Coupled with microarray technology, the ChIP-on-chip experiments have resulted in more global binding site maps for a number of human transcription factors in proximal promoters and in specific genomic regions. For example, Carroll and colleagues recently mapped ER binding sites first in human Chromosomes 21 and 22 and more recently across the entire human genome [15,16]. In spite of the success of ChIP-on-chip studies, there remain caveats regarding probe specificity and performance, including constraints on probe design in certain genomic regions and potential biases introduced by amplification protocols. Thus, the analysis of ERα binding sites using alternative genome-wide technologies is warranted. Previously we developed a high throughput cloning and sequencing approach for mapping full-length transcripts. By employing specialized cloning techniques and vectors, paired-end diTags (PETs) from ends of transcripts and this gene identification signature (GIS) can be sequenced and mapped precisely to the genome [17]. The GIS-PET technique increases sequencing efficiency by 30-fold as compared to sequencing the entire transcript insert. We subsequently showed that binding site fragments from ChIP experiments can also be subjected to PET analysis to generate an unbiased whole-genome map of p53 tumor suppressor protein binding sites and demonstrated an association of binding sites and adjacent target genes with p53 functions in patient tumor samples [18]. The PET technology has also been utilized to study OCT4 and Nanog binding sites in stem cell biology [19]. To obtain a global map of ERα binding sites in breast cancer cells, we applied ChIP-PET to generate a library of ERα binding sites in MCF-7 cells following estrogen treatment. We then combined the binding site data with hormone-responsive and breast tumor sample microarray gene expression studies to discern correlations between ER binding, transcriptional activity, and disease status and outcome in patients. We also compared our findings with data from ChIP-on-chip studies to evaluate the performance of each respective technology. Herein, we describe the comprehensive cartographic results and outline the insights they provide into binding site usage and molecular mechanisms of ER transcriptional regulatory functions. Results ChIP-PET Analysis Mapped ER Binding Sites across the Human Genome Hormone-deprived MCF-7 cells were treated with 10 nM estradiol for 45 min, and then DNA-bound receptor complexes were isolated through ChIP using anti-ERα antibodies (HC-20, Santa Cruz Biotechnology, http://www.scbt.com). Prior to generating the PET library for sequencing, we qualified the ChIP products by measuring DNA fragment size and enrichment of known ER binding site in the pS2/TFF1 gene promoter after immunoprecipitation to ensure sample quality. The ChIP DNA fragments ranged from 300 bp to 1 kb, and there was an 80-fold enrichment of ER binding at the known pS2/TFF1 ERE as compared to the irrelevant antibody control. Once ChIP DNA quality had been confirmed, the PET library was constructed and sequenced as described previously [18]. A total of 635,371 PETs were sequenced, of which 361,241 (~56.86%) were unambiguously mapped to unique loci in the human genome (hg17/NCBI build 35) and localized to 136,158 distinct genomic coordinates. One of the first questions we asked was whether the sequencing of these 635,371 clone “equivalents” in the form of pair-end tags provided sufficient representation of the chip library. To assess the degree of saturation of the PET library sequenced (the total number of distinct ChIP DNA fragments that can be captured from the library), we fitted a Hill Function [20] using extrapolated and historical sequencing data (see Figure S1). The degree of saturation of the ER ChIP PET library is estimated at 73.24% (136,158 actual/185,915 expected), suggesting that ~73% of the extrapolated hypothetical limit of coverage by our library was sequenced. Sequencing results are summarized in Figure 1
ERα binding regions defined by ChIP-PET are located in every chromosome in the human genome, with the exception of the Y chromosome, which is not present in MCF-7 cells derived from a female breast cancer patient (see Figure S3A and S3B). When regions of gene amplification are accounted for, the frequency of binding clusters per chromosome generally corresponds to the size and gene density of the chromosome, and ER does not appear to localize to specific chromosomal regions within the genome. Figure S3C shows ER binding clusters distribution relative to the nearest University of California Santa Cruz (UCSC) Known Genes (KG) (see Materials and Methods). Binding regions were mapped to the precise positions relative to the 5′ and 3′ ends of transcripts in the UCSC KG database. Only 5% of the regions map to the proximal promoter regions, defined as 0–5 kb upstream of the transcriptional start site (TSS), where the vast majority of current known EREs have been identified and characterized thus far. The largest fraction (38%) of binding regions map to intragenic regions of transcripts and are localized to introns, whereas 23% are within 100 kb from the 5′ start sites, and 19% are within 100 kb of 3′ polyadenylation sites. Only 20% of the ER binding regions are located in gene deserts where the nearest KG is >100 kb away. These findings initially suggest that DNA-bound ER can interact with the transcriptional machinery through both proximal- and distal-acting mechanisms, and these interactions are not likely to be limited by binding site orientation (5′ or 3′) relative to the TSSs. Intriguingly, functional ER binding sites were rarely in exons and when in exons were in probable untranslated regions. We did not detect any binding regions that mapped to a protein-coding domain of a transcriptional unit. These observations further suggest a dynamic selection of ER binding sites that excludes exonic regions and raise the possibility that transcription factor binding sites (TFBSs) in exons may undergo negative selection during evolution. The ERE Sequence Motif Is Enriched in Validated ChIP-PET Binding Sites As a preliminary assessment on the fidelity of the 1,234 high quality binding regions, we considered presence of putative ERE motif as a proxy for a real binding event. A total of 13 base pair sites that were at most two Hamming distance away (i.e., two base deviation) from the consensus ERE (GGTCA-nnn-TGACC) were called putative EREs. Upon scanning the 1,234 binding regions, we discovered that 884 (~71%) binding regions contained at least one ERE-like sequence, 25% encoded putative half-ERE sites, and the remaining 4% bore no ERE sequence motifs whatsoever (Figure 2
Out of the ten clusters that failed validation, eight of the loci that were misclassified as true binders belonged to the moPET 3 category, which would be considered in the borderline confident range; one was a moPET 4 cluster, whereas a false positive in a moPET7 was located in an amplified region of the MCF-7 genome on Chromosome 20 (which we have shown overestimates the binding efficiency of a DNA fragment to ER). When adjusted for the frequency of full, half, and no ERE motifs in the PET-defined binding loci, our validation rate for binding calls is 94%. These validation results are in line with our previous whole-genome ChIP-PET analysis of p53, Oct4, and Nanog binding sites [18,19] and compares favorably with other genome-wide technologies such as ChIP-on-chip. To examine whether ER ChIP-PET binding sites containing putative full EREs harbor transcriptional enhancer activities, we PCR amplified 11 binding sites from MCF-7 genomic DNA and cloned them upstream of the pGL4-TATA luciferase reporter. For negative and positive controls we used pGL4-TATA and pGL4-2ERE-TATA, respectively. These constructs were transiently transfected into hormone-depleted MCF-7 cells, treated with either ethanol or E2 for 18–24 hours, and then assayed for luciferase activity. A total of eight out of 11 ER ChIP-PET binding site reporter constructs tested were E2 responsive (Figure 4
Comparative Analysis of ChIP-PET and ChIP-on-Chip Data Reveals Differential Binding Site Discovery by These Technologies Other technologies have been used to map ER binding sites on a genomic scale. Carroll et al. using ChIP and human genome tiling arrays have mapped ER binding sites in Chromosomes 21 and 22 and across the human genome in MCF-7 cells [15,16]. We first compared our mapped ER binding sites to the previously published results in Chromosomes 21 and 22 [15]. In the 1,234 binding sites mapped by ChIP-PET, we detected 36 ER binding sites in these chromosomes and found that 20 binding sites were identified by both techniques (Figure 5
It is likely that the difference between the two platforms are due to lower affinity ER binding sites being more susceptible to constraints and limitations inherent in the detection technologies and to possible differences in biological handling of cell lines in each study. Moreover, there appears to be content differences between the discovery capacity of the two technologies. An inherent disadvantage of the ChIP-on-chip approach is that arrays mask sites that contain repetitive sequences, whereas the output of the ChIP-PET technology is completely unbiased in regard to the presence or absence of repetitive sequences. To this point, we found that ~27.9% of the base pairs in bona fide binding sites discovered by the pair-end tag approach were associated with repetitive sequences, whereas 5.3% of those in Carroll et al. bore repeats. Taken together, these results further suggest that the ChIP-PET and the ChIP-on-chip are complementary methods for the identification of TFBSs in a whole genome manner. Integration of Binding Site and Gene Expression Data Indicates Diversity in Regulatory Region Architecture and Transcriptional Response The binding site map denotes ER transcriptional regulatory potential for a large number of genes. To determine the specific transcriptional responses corresponding to estrogen treatment and ER recruitment to cis-regulatory sites in MCF-7 cells, we performed gene expression profiling experiments using Affymetrix U133 microarrays on a time course following estradiol exposure. Differentially expressed genes were selected based on a q-value cut-off of less than 2% using a stringent significance analysis of microarrays analysis algorithm. We identified 802 probe sets, representing 544 unique named genes, whose expression levels were up-regulated in response to 10 nM E2 treatment for 12, 24, or 48 h, and 1,168 probe sets corresponding to 704 unique named genes, were down-regulated following hormone treatment. When combined with the ER binding site mapping data (within 100 kb of and closest to high quality binding regions identified by ChIP-PET), 171 up-regulated genes and 116 down-regulated genes were associated with high confidence ER binding sites. Table S2 contains the complete listing of estrogen responsive genes with adjacent ER binding sites identified in this study. We next examined whether there was a preference for positioning of the ER binding sites in up-regulated versus down-regulated genes. Our analysis revealed that there was a statistically significant association between the presence of an ER ChIP-PET cluster near an up-regulated gene and an under-representation of ER ChIP-PET clusters associated with down-regulated genes (p = 8.379e−08) (see Table 1). The over-representation of ER ChIP-PET clusters that can be associated with E2-up-regulated genes is particularly noticeable at the early 12-h time point. That more binding site clusters are associated with E2 up-regulated genes (60%) compared with down-regulated genes (40%) suggests that the ER protein more frequently is directly involved in the transcriptional regulation of E2 up-regulated genes as compared to down-regulated genes. This finding is in concordance with a previous study we conducted in T-47D human mammary carcinoma cells, where we found that out of 89 genes identified as direct target genes (i.e., E2-responsive and cycloheximide insensitive), 59 (66.3%) were up-regulated and only 30 (33.7%) were down-regulated by E2 treatment [6].
To further confirm the observed association of binding sites with the transcriptional response, we examined the association of the 107 sites validated by conventional ChIP qPCR to E2 regulated gene expression data. Of the 107 sites tested, 22 sites were found to be associated with an E2-regulated probe from the Affymetrix dataset. Out of these 22 sites, 16 were up-regulated by E2 whereas six were down-regulated by E2-treatment. The 16 sites associated with E2 up-regulated probes all showed high levels of enrichment (~25–473×) when analyzed by ChIP qPCR, whereas the six sites associated with down-regulated genes showed lower levels of enrichment (~1–25×) (Figure 6
Exploring the association between ER binding sites and the directionality of the transcriptional response further, we mapped the locations of the binding sites relative to the start and termination sites of E2 up- and down-regulated genes as assessed by time course studies using Affymetrix expression arrays. As shown in Figure 6 Genes Adjacent to ER Binding Sites Are Associated with Breast Cancer Biology We posited that the genes adjacent to the ER binding sites identified by ChIP-PET are putative ER target genes and should reflect ER function in vivo. To assess this possibility, we examined whether the behavior of the collection of adjacent genes could determine the ER status of human breast cancers. All genes within 100 kb of an ER binding site or with an ER binding cluster within an intron were used to cluster 251 breast tumors from a cohort from Uppsala, Sweden previously analyzed for gene expression using Affymetrix U133 A and B arrays [23]. Our results showed that this proximate gene list could easily segregate ER-positive and ER-negative breast tumors (see Figure 7
There Is Limited Conservation of ER Binding Site Sequences and Specific ERE-Like Motifs We have previously shown that ERE sequences in promoter regions of putative ER target genes are not highly conserved, even though both conserved and nonconserved sites appear to be involved in ER binding [6]. Carroll et al., however, indicated conservation of ER binding sites based on their analysis of binding sites discovered in Chromosomes 21 and 22 [15]. To resolve this apparent difference in our observations, we performed comparative analysis of binding site regions and ERE motifs across species using this genome-wide dataset. The overall conservation of a binding region is measured using the base-by-base conservation score and the presence of conserved elements (PhastCons score and PhastCons Conserved Elements [24]), (see Materials and Methods). Using this approach, although a clear conservation signal is visible, as compared to a randomly generated set of regions, the actual proportion of binding regions that are conserved is hidden. To analyze this conservation further, we examined the presence of PhastCons Elements in the ER binding sites. Surprisingly, only 273 (22%) of the initial 1,234 binding regions overlap with such conserved elements, but partitioning the binding sites using this criterion showed that these 22% carry most of the conservation signal (Figure 8
Since TFBS motifs are short, typically 10–20 bp, they may not necessarily be located in a conserved region as detected by standard algorithms. We sought, therefore, to identify ER binding sites with ERE-like sequences and determine whether the specific ERE-like motifs are conserved in homologous regions in chimpanzee, mouse, and dog regardless of conservation of surrounding sequences. We extracted the sequences associated with the 1,234 binding regions in human (hg17) and identified the corresponding homologous regions in chimpanzee (panTro1), mouse (mm5), and dog (canFam2) using the tool liftOver (UCSC Genome Browser utility tool, http://genome.ucsc.edu/cgi-bin/hgLiftOver). We then scanned for the presence of consensus EREs with up to two mutations. Using optimized 500-bp windows in human and chimpanzee and 1-kb windows in mouse and dog (see Table 2), 754 of the 1,234 human binding regions contained a full ERE. As expected, the vast majority (698 or 93%) of the homologous binding windows in chimpanzee also contain ERE-like sequences. The ERE motif is also found in 283 (38%) of the mouse homologous regions and in 357 (47%) of the dog windows. Because the ERE-like sequences are common in any genome, we considered a site bearing a conserved motif only if the ERE is retained in chimp, mouse, and dog. Using this more stringent criterion, we found 179 (24%) of the sites bearing motif conservation in all four species with a background of ~7%. This suggests that ~17% of the sites are under positive selection. Taken together, the sequence and motif conservation results indicate that the majority of binding sites identified in this study are poorly conserved between primates and other mammalian species, and the conservation of binding sites reported previously [15,16] likely resulted from a minority (22%–24%) of highly conserved sites when assessed by multiple conservation metrics. We should note that the actual conservation of binding sites may be higher than observed due to alignment errors [25]. Even with adjustment for this potential error, however, there will likely be a large number of nonconserved ER binding sites.
ER Binding Regions Are Enriched for Other TFBS Motifs Though the ERE appears to be the dominant recognition sequence for ER on DNA, other transcription factors and their binding sites are also involved in directing ER to their specific target sites. Indeed, Carroll and colleagues discovered an enrichment of forkhead binding site motifs within ER binding regions of human Chromosomes 21 and 22 and demonstrated a role for the FOXA1 transcription factor in facilitating ER's ability to bind EREs and regulate target gene expression. In other nongenomic based studies, ER is also known to bind DNA indirectly through interactions with Sp1 and AP-1 transcription factors [9,10]. To determine the presence of additional TFBS motifs in the 1,234 ChIP-PET binding sites across the genome, we analyzed the 1,234 cluster sequences for putative TFBS based on TRANSFAC (professional version 9.1) using the accompanying MATCH program [26] with the “minimize False Positive” setup. To compute the statistical significance, we generated a background sequence set, matching the total length of 1,234 clusters, using a third order Markov Chain sequence model trained on the whole human genome (hg17) and scanned them similarly for putative TFBS. This was done 1,000 times. For each TFBS matrix, the average number of sites found per nucleotide represents the background probability of finding its putative sites. The p-values were computed under the binomial distribution and were adjusted for multiple hypotheses testing using the conservative Bonferroni correction. Table 3 lists the top matrices (see Supporting Information for additional details of the analysis).
As expected, the predominant sequence motif enriched in ER binding sites is the ERE, Interestingly, however, a large number of transcription-factor binding site motifs are statistically enriched in these ER binding regions even when corrected for multiple sampling suggesting that ER may interact extensively with other transcription factors at the DNA sites. Previous investigations have shown that FOXA1 bound to forkhead binding site motifs adjacent to EREs and interacted with ER, as do AP-1 and Sp1. We also found FOXA1, AP-1, and Sp1 binding motifs significantly associated with the GIS-PET clusters. To further assess the functional significance of detected binding sites, we performed RNA interference experiments using small interfering RNA (siRNA) to knock-down Sp1 and then examined the expression of ten estrogen-responsive genes, which from our data were found to have adjacent ChIP-PET ER binding sites and predicted Sp1 binding sequences (Figure 9
Using these specific validated associations as reference points to assess the relative importance of other associated binding site motifs, we found that 46 other transcription factors are as significantly associated with the 1,234 GIS-PET binding sites as these three validated cis-partners of ER. This suggests that a wider range of transcription factors may partner with ER at the site of DNA binding than previously thought. Given these findings, we next asked whether there was discernable structure within the ER binding sites relative to the transcription factor response elements that are significantly over-represented. To this end, we assessed whether some motifs were nonrandomly distributed within a 500-bp window encompassing the center of the PET defined binding site or centered on the main ERE or half ERE (Tables S3, S4, and S5). Our results show that factors such as CDX, PAX, AP-1, SF1, and MAF are distributed within these sites in a nonrandom fashion. To dissect the anatomy of these adjacent binding motifs, we examined the specific position of these motifs vis-à-vis the central ERE (Figure 10
It has been determined that ER can interact with AP-1 and Sp1 factors to regulate gene expression through a tethering mechanism where the DNA binding moiety is AP-1/Sp1. In our genome-wide analysis, we asked whether our ER binding sites without discernable EREs had a predominant transcription factor binding motif. Our results show that predominantly forkhead transcription factors, followed by SRY recognition sequences are significantly enriched in these regions (Table 5). AP-1 sites, though not on the enriched list is however very similar to the MAF recognition sequences, which appear as borderline significant after SRY. Since AP-1 can bind to MAF sites, AP-1 involvement in these purely tethered sites is projected. Thus, surprisingly, ER binding sites without EREs appear highly enriched for recognition motifs for the forkhead family of transcription factors and above that of the known AP-1 interacting factors.
Discussion Whole genome analysis of transcription factors provides an unbiased view of their regulatory dynamics. Here we present a genome-wide analysis of the DNA binding sites of ERα as present in the MCF-7 breast cancer cell line and map these sites to transcripts regulated by estrogen. We used a cloning and sequencing based technology and identified 1,234 high probability binding sites using an algorithm that minimizes false positives from amplified regions of the genome. That 94% of a sample of these sites could be validated by standard ChIP suggests that the majority of the 1,234 sites identified by ChIP-PET represent bona fide binding regions for ERα. Of note is that 96% of the validated binding sites harbored either full ERE-like (71%) or solely half-ERE motifs (25%). Only 4% had no ERE-like sequences detectable using a two-position degeneracy cut-off, and therefore a pure tethered mechanism of ER transcriptional regulation must occur infrequently. This dispersed nature of these 1,234 sites vis-à-vis the TSSs makes the direct molecular assessment of whether these adjacent genes can be regulated by ER highly impractical. We sought to resolve this problem by examining the clinical behavior of these genes adjacent to ER binding sites. We posited that if these adjacent genes were under ER regulation, then their expression in breast cancers should readily determine ER status of primary breast cancers. Our results using a cohort of 251 breast cancers showed that these putative ER regulated genes can significantly separate ER status in breast tumors and therefore represent a transcriptional regulatory cassette that appears to affect ER response. We further examined this question by studying the behavior of these genes in MCF-7 cells as assessed using expression arrays. Though only 23% of the genes proximal to an adjacent ER binding site are responsive following estrogen treatment, this represents a significant enrichment of bona fide ER binding sites adjacent to estrogen responsive versus unresponsive genes (p = 3.018433e − 51) (unpublished data). Therefore, our in vitro and in vivo probabilistic analysis all point to the biological significance of the ER binding sites identified by our ChIP-PET analysis in the regulation of gene expression by ER.It is important to note the ability of ChIP-PET to identify, in an unbiased manner, bona fide ER binding sites among nearby EREs predicted only by computational methods. For example, for the carbonic anhydrase XII (CA12) gene, matrix-based computational approaches used to identify potential cis-regulatory elements directing ER-regulation of CA12 indicate that five putative EREs reside in the proximal 5′ 5 kb with an additional ERE found in the first intron of the gene. However, we have identified a moPET 5′ binding site approximately 6 kb 5′ to the gene, which was found to be the major regulatory site directing ER-mediated transactivation as a distal enhancer (D.H. Barnett and B.S. Katzenellenbogen, unpublished data). CA12 mRNA is up-regulated by estradiol in MCF-7 breast cancer cells [3,32] and in other ER-positive cells [33] and is positively associated with ERα status in primary breast tumors [34]. Hence, our findings highlight the ability of ChIP-PET to identify previously undiscovered enhancers of biologically relevant target genes. Much of the research of ER transcriptional regulation has focused on a few EREs located within the proximal promoter. We have shown with our global binding site data that, in fact, the vast majority of sites are located in distal or intragenic regions relative to the nearest regulated transcripts. Our genome wide analysis confirmed the more limited observations previously seen in Chromosomes 21 and 22 that only a small portion (5%) of the binding sites are within 5 kb of the TSS and consistent with our previous predictions [6,35]. Intriguingly, however, detailed analyses revealed that the statistical preponderance of genes responsive in MCF-7 cells to E2 adjacent to ChIP-PET identified ER binding sites were up-regulated rather than down-regulated. Moreover, the location of these sites next to E2 induced genes showed an obvious enrichment around the TSS both in prestart locations and in 5′ introns and within 50 kb from the TSS (Figure 6 Equally intriguing is the possibility that ER-mediated gene repression may use mechanisms very different than gene induction, and that genomic topography (i.e., binding site location and affinity) may have a significant role. Consistent with this is our quantification study using ChIP-qPCR on ChIP-PET identified ER binding sites where genes repressed by ER uniformly had ER binding sites that had the lowest fold induction after E2 exposure (~1–25), as compared to those binding sites adjacent to induced genes (~25–473), and were less likely to harbor a full ERE-like motif. When all sites are taken into account and measured by the number of overlapping PETs (moPETs) up-regulated genes have significantly higher moPET counts than down-regulated genes (p = 4.575e − 4, unpublished data). Moreover, in reporter assays performed with the 11 candidate ER binding sites, the only three that did not induce transcription off a TATA promoter were sites associated with repressed genes. These observations are consistent with previous findings that deviations from the full ERE motif reduces ER binding affinity and that the binding site dynamics may differ in genes that are induced by ER than those repressed by ER [11]. The large number of bona fide and nonproximal ER binding sites reported here represents ideal candidates for further characterization of these distinct mechanisms.It is known that ER can regulate gene expression not by direct DNA binding but through association with an intermediary transcription factor such as AP-1. Theoretically, this mechanism of ER transcriptional regulation does not require ER binding to an ERE. Our motif searches in these non-ERE sites revealed that the predominant motifs in the pure tethered bin are those for the forkhead transcription factors, SRY, with MAF reaching borderline statistical significance (p = 0.056). MAF recognizes sequences related to the AP-1 target site and are considered as part of the larger AP-1 family of transcription factors and, therefore, our results suggest that AP-1 and MAF can bind to these sites [36]. The interesting observation is that in the absence of a minimum of an ERE half site, the fold enrichment of ER binding in these sites is lower (a median of 51-fold enrichment of binding as compared to 81-fold for ERE; Student's t-test p = 0.027). Moreover, our analysis of AP-1 sites within EREs show perfect orientation with one half site with similarities to AP-1 recognition sequences, and the second (cognate) half site primarily an ERE recognition sequence. These “hybrid” sites show higher levels of ER binding. This suggests that AP-1–associated tethering may favor sites with ERE half-site “anchors.” Indeed, previous analysis of the ERE half site associated with the AP-1 site found in the progesterone receptor promoter showed that the integrity of the ERE half site is required for ER and AP-1 binding and estrogen responsive promoter activity [37]. These genome wide approaches to nuclear hormone receptor binding sites are revealing in that the large number of validated binding sites provide statistical power in assessing underlying motif structure in the binding sites. The results of our motif search analysis also point to the potential involvement of a number of other transcription factors participating in ER transcriptional regulatory activity. Included in the list of putative transcriptional coregulators is FOXA1, which has been previously shown to be required for ER functions [15]. However, the fact that 46 other factors are enriched in the ER binding sites with the same probability as the proven interactors of FOXA1, AP-1, and Sp1 suggests the potential for highly complex interactions. Of course not all cis-partner transcription factors will be expressed in every cell type. But though it would be highly improbably that each co-occurrence will predict binding by both factors, our analysis of Sp1 action on ten estrogen-responsive genes with adjacent ChIP-PET ER binding sites and predicted Sp1 binding sequences showed down-regulation of all ten. Moreover, we have validated the effect of adjacent GATA3 and BACH interactions in ER binding to EREs (J. Thomsen and E.T. Liu, unpublished data). This suggests that our algorithms to predict adjacent transcription-factor binding are potentially highly accurate. Perhaps even more interesting is the systematic order of these potential partner transcription factors relative to the position of the central ERE in bona fide ER binding sites. Consistent with the model where AP-1 binding appears “anchored” by an adjacent ERE is that AP-1 is distributed in a nonrandom manner within a 500-bp window of an ER binding site. In this distribution, the sequence of a number of full EREs are actually composite binding elements with an AP-1 site posing as an ERE half site. These composite EREs are seen with SF-1, MAF/BACH, AP-1, and PAX2 and PAX3. All these factors have recognition sequences that overlap with (but are distinct from) the ERE half site. Unexpectedly, highly skewed positioning was found with the SF-1 and PAX3 recognition sequences (Figure 10 We have previously reported very little conservation of ERE motifs within promoter regions of human and mouse genes even though conserved and nonconserved sites both bind ER [6]. In the promoter regions of putative direct target genes, approximately 6% of predicted EREs were conserved in the mouse. In contrast, Carroll and colleagues reported conservation in sequences flanking ER binding sites they experimentally mapped to human Chromosomes 21 and 22 and in their whole-genome study [15,16]. To reconcile these apparent differences, we examined the 1,234 ChIP-PET ER binding sites and determined conservation in both flanking sequences and detected ERE motifs. Using similar analytical approaches as those used by Carroll et al., we also find evidence of conservation within the 500-bp windows around the discovered binding sites. However, a more in-depth analysis showed that the conservation signal observed was driven by only 22% of all sites tested. There was limited conservation regardless of whether local sequence similarity or presence of an ERE motifs were used as the metric for conservation (Table 2). Thus, the conservation also observed by Carroll et al. is likely due to a small number of highly conserved sequences and does not represent global conservation of binding sites [15,16]. We have noted that the conservation may be underestimated due to alignment errors in the comparative analysis of whole-genome sequence data [25], but these errors will not fully explain the large number of nonconserved sites. The list of genes with conserved ER binding sites does not appear to have functional coherence (unpublished data). Genes classically thought of as prototypes of ER responsiveness, such as pS2/TFF1, and the progesterone receptor have bona fide ER binding sites in the human MCF-7 cell line that are not conserved by sequence or motif presence across mammalian species. Moreover, both conserved and nonconserved sites are associated with ER-regulated genes. A total of 287 of all 1,234 binding sites (23.3%) are associated with ER-regulated genes, while 63 of the 273 conserved binding sites (23.1%) are associated with regulated genes (not significantly different). The limited conservation of ER binding sites does not imply that the genes that are important in ER function are not regulated by ER, but that the precise DNA targets may differ. Given the distance of 100 kb, in which an occupied ERE can potentially regulate its associated promoter, there is much flexibility in the placement of ER regulatory elements. Nevertheless, these observations indicate that there are likely species-specific differences in the components and the dynamics of estrogen action and that results from animal studies need to be interpreted with this caveat in mind. In summary, our work provides a new cartography of ER binding on a genome-wide scale. The collective configuration of these binding sites has revealed fundamental rules that describe the characteristics of a bona fide ER recognition motif. The dominance of the ERE, the distributed nature of the binding sites distant to their associated genes, the separate nature of up- versus down-regulated genes, the importance of adjacent binding motifs of other transcription factors, and the frequency of composite ER response elements are all findings that would have been difficult to assess on a gene-by-gene basis. Data from this work will provide the experimental targets that will further dissect the intricacies of ER transcriptional regulation. Materials and Methods Cell culture and treatments. MCF-7 cells were grown to 80% confluence in D-MEM/F-12 (Invitrogen/Gibco, http://www.invitrogen.com) supplemented with 10% FBS (Hyclone, http://www.hyclone.com). Cells were washed with PBS and incubated in phenol red-free D-MEM/F-12 medium (Invitrogen/Gibco) supplemented with 0.5% charcoal-dextran stripped FBS (Hyclone) for 24 h in preparation for 17β-estradiol (E2; Sigma, http://www.sigmaaldrich.com) treatment. ChIP. Estrogen-deprived MCF-7 cells were treated with 10 nM E2 for 45 min prior to the ChIP procedures. ChIP was carried out as described previously [6] using the HC-20 anti-ERα antibody (Santa Cruz Biotechnology). Following ChIP, DNA fragments were either pooled for PET library generation or analyzed for ER binding at specific sites by real-time PCR. Proper DNA fragment length and ER binding to the known pS2/TFF1 ERE were confirmed by gel electrophoresis and real-time PCR, respectively. ChIP assays using antibodies to Sp1 were performed as previously described [40]. The antibodies used were from Santa Cruz Biotechnology (Sp1 PEP-2, rabbit IgG) and Upstate Biotechnology (Sp1) (http://www.upstate.com). DNA obtained from ChIP was analyzed by quantitative real-time PCR using specific primers for the ER binding sites closest to selected ER-regulated genes. Real-time PCR. PCR quantification was performed on the ABI7500 Real-time PCR System (Applied Biosystems, http://www.appliedbiosystems.com) with 20 μl reaction volume consisting of 20 ng of ChIP samples or 20 ng of input DNA as templates, 0.2 μM primer pairs, and 10 μl of 2× SYBR Green PCR Master Mix (Applied Biosystems). For each PCR run, the samples underwent 40 amplification cycles. Fluorescence was acquired at the conclusion of each cycle at 60 °C during the amplification step. ChIP-PET library construction and sequencing. Around 140 ng of ChIP DNA were used for construction of the ChIP-PET library for mapping ER binding sites in the human genome, following a procedure described previously [18]. Briefly, End-It DNA End-Repair Kit (Epicentre, http://www.epibio.com) was used to repair the ends of the ChIP DNA. DNA fragments larger than 500 bps were selected by using cDNA size fractionation columns (Invitrogen) and cloned into pGIS-3a vector [18], which contains the Mme I cassettes flanking the cloning site (XhoI). The ligation mixture was transformed into the One Shot Top10 Electrocomp Cells (Invitrogen). A total of 2.3 million clones were obtained. Around 90% of the clones contained inserts. We plated out 1.2 million clones on LB-agar (ampicillin 50 ng/ml) and scraped off the cells for plasmid DNA isolation. Around 10 μg of purified plasmid DNA mixture was digested with MmeI and end-polished with T4 DNA polymerase to remove the 3′-dinucleotide overhangs. The resulting plasmids containing a signature tag from each terminal of the original ChIP DNA insert were self-ligated to form single-PET plasmids. These were then transformed into One Shot Top10 Electrocomp Cells (Invitrogen) to form a “single-PET library.” We plated out 1.2 million clones from this library on LB-agar (ampicillin 50 ng/ml) and extracted plasmid DNA from the cells. Around 250 μg of plasmid DNA were digested with BamHI to release the 50 bp PETs. About 600 ng of single-PETs were PAGE-purified, then concatenated and separated on 4%–20% gradient TBE-PAGE. Appropriate size fraction (600–1,100 bps) of the concatenated DNA was excised, extracted, and cloned into EcoRV-cut pZErO-1 (Invitrogen) to form the final ChIP PET library. The clones were grown on LB-Agar (Zeocin 25 μg/ml). The plasmids were prepared and sequenced using ABI3730 DNA analyzer. Microarray experiments and analysis. All microarray experiments were carried out on Affymetrix U133 A and B GeneChips. MCF-7 cells were treated with 10 nM E2 for 12, 24, and 48 h and RNA extraction, labeling, and hybridizations were performed according to manufacturer protocols. Affymetrix analysis software was used to perform the preliminary probe-level quantitation of the microarray data. These data were further normalized using the RMA [41] normalization method. The default option of RMA (with background correction, quantile normalization, and log transformation) was used to generate the normalized intensity for each probeset. Differentially expressed genes were identified at each time point separately using the three untreated at the time point as controls against the three treated samples. The SAM [42] statistical method was used to select differentially expressed genes. Genes were selected based on the q-value less than 2%. Experiments using patient samples were performed as described in a previous publication [23], and the data used in this study were obtained from the Uppsala cohort from the previous study. Construction of plasmids and luciferase reporter assay. ER ChIP-PET binding sites were amplified from MCF-7 genomic DNA by PCR and cloned into the pGL4-TATA vector (a minimal TATA box upstream of pGL4-Basic) by homologous recombination using the In-Fusion CF Dry-Down PCR Cloning kit (Clontech, http://www.clontech.com). Putative EREs were mutated using the QuickChange Site Directed Mutagenesis kit according to the manufacturers instructions (Stratagene, http://www.stratagene.com). MCF-7 cells, grown in hormone-depleted medium for at least 3 d, were cotransfected with the ChIP-PET constructs and HSV-TK renilla with Fugene (Roche, http://www.roche.com). After the cells were treated with 10 nM estradiol or ethanol for 18–24 h, cell lysates were harvested and assessed for firefly and renilla luciferase activity using the Dual Luciferase Reporter Assay system (Promega, http://www.promega.com). Sp1 RNA interference. Estrogen-deprived MCF-7 cells were transfected with Sp1 SMARTpool or GL3 luciferase control siRNA (Dharmacon, http://www.dharmacon.com), according to the manufacturer's instructions. After 72 h, cells were treated with 1nM E2 for 4 h. Total RNA was harvested and prepared using Trizol reagent (Invitrogen). Quantitative real-time PCR was performed as previously described [33]. The fold change in expression was calculated using the ribosomal protein 36B4 as an internal control as previously described [3,35]. Primer sequences are available upon request. Proteins were extracted from MCF-7 cells using RIPA buffer, separated on SDS-PAGE, transferred to nitrocellulose membrane, and immunoblotted using anti-Sp1 antibodies (Upstate Biotechnology). ChIP-PET mapping and primary annotations. Library saturation analysis. To assess the saturation of the library, we fitted a Hill Function:
Identification of high quality binding regions. The underlying aberrant genome of MCF-7 presented an additional challenge in determining which of the ChIP-PET clusters were truly bound by ER. Presence of amplified regions [45], with high and varying copy numbers, increased the probability of those regions being sampled during the ChIP assays, which translated into unusual overall ChIP-PET enrichments in multiple genomic pockets. Relying solely on the raw count of overlapping PETs would introduce undesirable false positives. To address this issue, we have developed a binding region identification algorithm (unpublished data) that produces lower false positives when predicting binding clusters in amplified regions, compared to using raw counts. When assessing the likelihood of a given ChIP-PET cluster being bound by ER, the two-stage algorithm first estimates the amount of noisy PETs surrounding the cluster of interest within its 25-kb flanking regions. Based on the estimated noise level and the neighborhood size (i.e., 50 kb), a moPET cut-off value can be calculated, such that the false positive probability is less than 1e−2. If the given ChIP-PET cluster has a stronger overlapping region (i.e., higher moPET value) than the calculated cut-off, we consider the cluster to be truly bound by ER. Comotif analysis. The rich presence of putative EREs points to the canonical and dominant theme of direct ER-DNA interaction. Nevertheless, ER interplays with other transcription factors have previously been reported and are expected, for it to exert wider and more diverse regulatory roles. These high quality binding regions present an unprecedented opportunity for the study of regulatory partners of ER. We employed a three-pronged approach to mine the binding regions for potential enrichment of binding motifs of other transcription factors, where the first assessed the enrichment of certain motifs in a given set of sequences, the second tested whether putative motifs of other transcription factors exhibited certain spatial correlation with respect to the main ERE or half ERE motif, and lastly the Genomatix suite was used for a low-throughput high quality semi-automatic assessment and visualization of potential comotifs. For the first and second sets of analysis, putative binding sites were identified based on weight matrices available in TRANSFAC (professional version 9.1) and using the accompanying MATCH program [26] with the “minimize False Positive” configuration. To compute the statistical significance of motif enrichment in a given set of sequences, a background sequence set, with its total length matching that of the sequence set, was generated using a third-order Markov Chain sequence model (trained on the whole human genome [hg17]) and was similarly scanned for putative TFBS. This was done 1,000 times, and for each TFBS matrix, the average number of sites found per nucleotide represents the background probability of finding its putative sites. The p-values for motif enrichment were computed under the Binomial distribution and were adjusted for multiple hypotheses testing using the conservative Bonferroni Correction procedure. Evaluation of spatial correlation between main ERE or ERE half sites was carried out using Kolmogorov-Smirnov test. A 500-bp sequence window was defined for each binding region, centering on its main ERE or ERE half site. The putative binding sites of each transcription factor were tested whether they were uniformly distributed within the sequence window. Conservation analysis. PhastCons scores are base-by-base values between 0 and 1 that give a measure of evolutionary conservation in eight vertebrate genomes (human, chimp, mouse, rat, dog, chick, fugu, and zebrafish) based on a phylogenetic hidden Markov model, phastCons [24], and Multiz alignments [46]. PhastCons Conserved Elements identify regions of the genome with high conservation scores. These tracks were obtained through the UCSC Genome Browser [47]. A binding site is identified as sequence conserved if its overlapping region overlaps any PhastCons Conserved Elements. Motif conservation analysis was carried out as follows: (1) sequences centered on the middle of the overlapping region of the 1,234 binding regions in human (hg17) were identified; (2) corresponding homologous regions in chimpanzee (panTro1), mouse (mm5), and dog (canFam2) were identified using the tool liftOver (UCSC Genome Browser utility tool); (3) corresponding fasta sequences were extracted; and (4) all sequences were scanned for the consensus ERE motif allowing for two mismatches. Process was repeated for various window sizes in human varying from 250 bp to 5 kb (unpublished data). Figure S1: Fitting of the Hill Function to Assess the Library Saturation The number of distinct unique PETs of the library, representing nonredundant information, is plotted against the number of PET sequenced (in chronological order) to attain it. A Hill Function, shown adjacent to the curve, is then fitted to the empirical data to estimate the total number of distinct PETs attainable should the PETs continue to be sequenced indefinitely. From that estimate, we conclude that the ER ChIP PET library used in this study reaches a saturation level of 73.24%. (1.2 MB AI). Click here for additional data file.(1.2M, pdf) Figure S2: Comparative Genome Hybridization Reveals Amplified Regions in the MCF-7 Breast Cancer Cell Line Genome (1.2 MB AI). Click here for additional data file.(1.2M, pdf) Figure S3: High Confidence ER Binding Sites Are Distributed throughout the Genome and Are Not Enriched in Specific Chromosomes When Amplified Regions Are Taken into Account (A) Comparison of number of ChIP-PET binding sites (open bars) to chromosome size (closed bars) is presented. (B) Binding site distribution (open bar) as compared to gene density (closed bars) on each chromosome is presented. (C) Location of ER binding sites relative to the nearest genes in the UCSC KG database shows a large majority of sites distal to the genes (>5 kb) or within intragenic regions. (1.3 MB AI). Click here for additional data file.(1.2M, pdf) Table S1: Complete Table of 1,234 High Confidence ChIP-PET Clusters Denoting ER Binding Sites in MCF-7 Cells (210 KB XLS) Click here for additional data file.(211K, xls) Table S2: List of Estrogen Responsive Genes Identified in Microarray Experiments with Adjacent ER Binding Sites (47 KB XLS) Click here for additional data file.(47K, xls) Table S3: Nonuniform Distribution of TFBSs in 1,234 ChIP-PET Clusters The Kolmogorov-Smirnov Test was employed to test whether the observed putative binding sites locations follow uniform distribution. (15 KB XLS) Click here for additional data file.(15K, xls) Table S4: Nonuniform Distribution of TFBSs Relative to the Main EREs of the Binding Regions, Based on Kolmogorov-Smirnov Test, as Described Earlier (16 KB XLS) Click here for additional data file.(17K, xls) Table S5: Nonuniform Distribution of TFBSs Relative to the Main Half EREs of the Binding Regions, Assessed under the Kolmogorov-Smirnov Test (15 KB XLS) Click here for additional data file.(15K, xls) Acknowledgments We thank Senali Abayratna for generating the pGL4-TATA and pGL4-2ERE-TATA constructs used in the study. Abbreviations
Footnotes ¤ Current address: Department of Microbiology and Molecular Biology, Brigham Young University, Provo, Utah, United States of America A previous version of this article appeared as an Early Online Release on April 17, 2007 (doi:10.1371/journal.pgen.0030087.eor). Author contributions. CYL, VBV, BSK, YR, and ETL conceived and designed the experiments. CYL, JST, TZ, SLK, MX, DHB, FS, AY, YKL, NP, LDM, EC, YR, and CLW performed the experiments. CYL, VBV, JST, KPC, LL, FS, JG, VAK, THC, LDM, EC, BSK, GB, CLW, and ETL analyzed the data. VBV, JST, DHB, FS, VAK, NP, LDM, EC, YR, GB, and CLW contributed reagents/materials/analysis tools. CYL, VBV, JST, BSK, GB, and ETL wrote the paper. Funding. This study was supported by the Singapore Agency for Science Technology and Research (A*STAR), the National Institutes of Health grant CA18119 (BSK), and a grant from The Breast Cancer Research Foundation (BSK). Competing interests. The authors have declared that no competing interests exist. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Cancer Res. 2000 Nov 1; 60(21):5977-83.
[Cancer Res. 2000]J Mol Endocrinol. 2001 Dec; 27(3):259-74.
[J Mol Endocrinol. 2001]J Mammary Gland Biol Neoplasia. 2000 Jul; 5(3):271-81.
[J Mammary Gland Biol Neoplasia. 2000]J Steroid Biochem Mol Biol. 2000 Nov 30; 74(5):311-7.
[J Steroid Biochem Mol Biol. 2000]Nucleic Acids Res. 2001 Jul 15; 29(14):2905-19.
[Nucleic Acids Res. 2001]Methods. 1997 Feb; 11(2):205-14.
[Methods. 1997]Cell. 2005 Jul 15; 122(1):33-43.
[Cell. 2005]Nat Genet. 2006 Nov; 38(11):1289-97.
[Nat Genet. 2006]Nat Methods. 2005 Feb; 2(2):105-11.
[Nat Methods. 2005]Cell. 2006 Jan 13; 124(1):207-19.
[Cell. 2006]Nat Genet. 2006 Apr; 38(4):431-40.
[Nat Genet. 2006]Cell. 2006 Jan 13; 124(1):207-19.
[Cell. 2006]Genome Biol. 2004; 5(9):R66.
[Genome Biol. 2004]Cancer Treat Res. 1988; 40():185-206.
[Cancer Treat Res. 1988]Mol Endocrinol. 2004 Jun; 18(6):1411-27.
[Mol Endocrinol. 2004]Cell. 2006 Jan 13; 124(1):207-19.
[Cell. 2006]Nat Genet. 2006 Apr; 38(4):431-40.
[Nat Genet. 2006]Cell. 2005 Jul 15; 122(1):33-43.
[Cell. 2005]Nat Genet. 2006 Nov; 38(11):1289-97.
[Nat Genet. 2006]Cell. 2005 Jul 15; 122(1):33-43.
[Cell. 2005]Nat Genet. 2006 Nov; 38(11):1289-97.
[Nat Genet. 2006]Genome Biol. 2004; 5(9):R66.
[Genome Biol. 2004]Endocrinology. 2003 Oct; 144(10):4562-74.
[Endocrinology. 2003]Nat Genet. 2006 Nov; 38(11):1289-97.
[Nat Genet. 2006]Proc Natl Acad Sci U S A. 2005 Sep 20; 102(38):13550-5.
[Proc Natl Acad Sci U S A. 2005]Genome Biol. 2004; 5(9):R66.
[Genome Biol. 2004]Cell. 2005 Jul 15; 122(1):33-43.
[Cell. 2005]Genome Res. 2005 Aug; 15(8):1034-50.
[Genome Res. 2005]Cell. 2005 Jul 15; 122(1):33-43.
[Cell. 2005]Nat Genet. 2006 Nov; 38(11):1289-97.
[Nat Genet. 2006]BMC Bioinformatics. 2006 Aug 14; 7():376.
[BMC Bioinformatics. 2006]J Steroid Biochem Mol Biol. 2000 Nov 30; 74(5):311-7.
[J Steroid Biochem Mol Biol. 2000]Mol Endocrinol. 1997 Oct; 11(11):1569-80.
[Mol Endocrinol. 1997]Nucleic Acids Res. 2003 Jul 1; 31(13):3576-9.
[Nucleic Acids Res. 2003]Endocrinology. 1998 Apr; 139(4):1981-90.
[Endocrinology. 1998]Mol Cell Endocrinol. 2003 Mar 28; 201(1-2):165-75.
[Mol Cell Endocrinol. 2003]Mol Cell Biol. 2006 Aug; 26(15):5759-70.
[Mol Cell Biol. 2006]Nucleic Acids Res. 1987 Feb 25; 15(4):1353-61.
[Nucleic Acids Res. 1987]Endocrinology. 2003 Oct; 144(10):4562-74.
[Endocrinology. 2003]Cancer Res. 2004 Feb 15; 64(4):1522-33.
[Cancer Res. 2004]Endocrinology. 2004 Jul; 145(7):3473-86.
[Endocrinology. 2004]Nature. 2002 Jan 31; 415(6871):530-6.
[Nature. 2002]Genome Biol. 2004; 5(9):R66.
[Genome Biol. 2004]Genome Biol. 2006; 7(9):R82.
[Genome Biol. 2006]Cell. 2005 Jul 15; 122(1):33-43.
[Cell. 2005]Nat Methods. 2005 Feb; 2(2):105-11.
[Nat Methods. 2005]Nucleic Acids Res. 2001 Jul 15; 29(14):2905-19.
[Nucleic Acids Res. 2001]Genes Cells. 2006 Jun; 11(6):575-91.
[Genes Cells. 2006]Endocrinology. 2002 Dec; 143(12):4583-91.
[Endocrinology. 2002]Cell. 2005 Jul 15; 122(1):33-43.
[Cell. 2005]EMBO J. 1999 Aug 2; 18(15):4270-9.
[EMBO J. 1999]Mol Endocrinol. 2004 Jul; 18(7):1610-9.
[Mol Endocrinol. 2004]Genome Biol. 2004; 5(9):R66.
[Genome Biol. 2004]Cell. 2005 Jul 15; 122(1):33-43.
[Cell. 2005]Nat Genet. 2006 Nov; 38(11):1289-97.
[Nat Genet. 2006]BMC Bioinformatics. 2006 Aug 14; 7():376.
[BMC Bioinformatics. 2006]Genome Biol. 2004; 5(9):R66.
[Genome Biol. 2004]J Biol Chem. 2006 Jun 16; 281(24):16272-8.
[J Biol Chem. 2006]Cell. 2006 Jan 13; 124(1):207-19.
[Cell. 2006]Bioinformatics. 2003 Jan 22; 19(2):185-93.
[Bioinformatics. 2003]Nucleic Acids Res. 2003 Feb 15; 31(4):e15.
[Nucleic Acids Res. 2003]Proc Natl Acad Sci U S A. 2005 Sep 20; 102(38):13550-5.
[Proc Natl Acad Sci U S A. 2005]Endocrinology. 2004 Jul; 145(7):3473-86.
[Endocrinology. 2004]Endocrinology. 2003 Oct; 144(10):4562-74.
[Endocrinology. 2003]Genome Biol. 2006; 7(9):R82.
[Genome Biol. 2006]Cell. 2006 Jan 13; 124(1):207-19.
[Cell. 2006]Nat Genet. 2006 Apr; 38(4):431-40.
[Nat Genet. 2006]BMC Bioinformatics. 2006 Aug 25; 7():390.
[BMC Bioinformatics. 2006]Anal Biochem. 1990 Sep; 189(2):235-43.
[Anal Biochem. 1990]Breast Cancer Res. 2005; 7(6):R1186-98.
[Breast Cancer Res. 2005]Nucleic Acids Res. 2003 Jul 1; 31(13):3576-9.
[Nucleic Acids Res. 2003]Genome Res. 2005 Aug; 15(8):1034-50.
[Genome Res. 2005]Genome Res. 2004 Apr; 14(4):708-15.
[Genome Res. 2004]Nucleic Acids Res. 2006 Jan 1; 34(Database issue):D590-8.
[Nucleic Acids Res. 2006]