![]() | ![]() |
Formats:
|
||||||||||||||||||||
Copyright © 2008 by The National Academy of Sciences of the USA Genetics ChIP-on-chip significance analysis reveals large-scale binding and regulation by human transcription factor oncogenes aDepartment of Biomedical Informatics, bJoint Centers for Systems Biology, dInstitute for Cancer Genetics, eDepartment of Pathology, and fDepartment of Pediatrics, Columbia University, New York, NY 10032; and cFunctional Genomics and Systems Biology Group, IBM T.J. Watson Research Center, Yorktown Heights, NY 10598 3To whom correspondence may be addressed. E-mail: gustavo/at/us.ibm.com, Email: califano/at/c2b2.columbia.edu, or Email: af2196/at/columbia.edu Edited by Barry H. Honig, Columbia University, New York, NY, and approved November 1, 2008 Author contributions: A.A.M., P.S., A.C., A.A.F., and G.S. designed research; A.A.M., T.P., and P.S. performed research; T.P. and A.A.F. contributed new reagents/analytic tools; A.A.M. and P.S. analyzed data; and A.A.M., T.P., P.S., A.C., A.A.F., and G.S. wrote the paper. 1Present address: The Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, MA 02142. 2A.C., A.A.F., and G.S. contributed equally to this work. Received July 9, 2008. Freely available online through the PNAS open access option. This article has been cited by other articles in PMC.Abstract ChIP-on-chip has emerged as a powerful tool to dissect the complex network of regulatory interactions between transcription factors and their targets. However, most ChIP-on-chip analysis methods use conservative approaches aimed at minimizing false-positive transcription factor targets. We present a model with improved sensitivity in detecting binding events from ChIP-on-chip data. Its application to human T cells, followed by extensive biochemical validation, reveals that 3 oncogenic transcription factors, NOTCH1, MYC, and HES1, bind to several thousand target gene promoters, up to an order of magnitude increase over conventional analysis methods. Gene expression profiling upon NOTCH1 inhibition shows broad-scale functional regulation across the entire range of predicted target genes, establishing a closer link between occupancy and regulation. Finally, the increased sensitivity reveals a combinatorial regulatory program in which MYC cobinds to virtually all NOTCH1-bound promoters. Overall, these results suggest an unappreciated complexity of transcriptional regulatory networks and highlight the fundamental importance of genome-scale analysis to represent transcriptional programs. Keywords: regulatory networks, T cell lymphoblastic leukemia, transcriptional regulation, systems biology The dysregulated activity of oncogenic transcription factors (TFs) contributes to neoplastic transformation by promoting aberrant expression of target genes involved in regulating cell homeostasis. Therefore, characterization of the regulatory networks controlled by these TFs is a critical objective in understanding the molecular mechanisms of cell transformation. ChIP-on-chip (ChIP2) (1) has emerged as a promising technology in the dissection of transcriptional networks by providing high-resolution maps of genome-wide TF–chromatin interactions. ChIP2 uses microarray technology to measure the relative abundance of genomic fragments derived from an immunoprecipitate (IP) sample, which is enriched in fragments bound by an immunoprecipitated protein (usually a TF), and a whole-cell extract (WCE) sample, containing fragments derived from a total chromatin preparation (input control) or an immunoprecipitation with a nonspecific control antibody (2). The 2 samples may either be hybridized to different arrays or labeled with different dyes and hybridized to the same array. Correct interpretation of ChIP2 data depends critically on an accurate statistical model to compute the probability that a given IP/WCE ratio is produced by a binding event rather than experimental noise. Recently, several elegant ChIP2 analysis methods have been proposed to tackle problems such as integrating measurements from adjacent probes (3–6) or inferring binding site locations at subprobe resolution (7). However, the lower-level problem of developing an accurate error model to define meaningful statistical thresholds has received comparably little attention [see SI and Fig. 1
Using an empirically determined model of the distribution of intensity ratios for non-IP-enriched probes in ChIP2 experiments, we developed an analytical method called ChIP2 Significance Analysis (CSA). When applied to ChIP2 data from the NOTCH1, MYC, and HES1 protooncogenes in human T cell acute lymphoblastic leukemia (T-ALL) cells, CSA increased the number of detected binding sites by up to an order of magnitude compared with other routinely used methods. Both binding site analysis and biochemical validation demonstrate quantitative agreement with CSA-predicted false-positive rates. Analysis of gene expression signatures indicates functional regulation by NOTCH1 across the entire range of predicted targets. Finally, the increased sensitivity reveals that virtually all NOTCH1-bound promoters are also bound by MYC. Overall, these results highlight the power of the proposed analysis framework for the identification of transcriptional networks and provide an improved and fundamentally different picture of the transcriptional programs controlled by NOTCH1, HES1, and MYC in T-ALL. Results Probe Statistics Are Accurately Modeled by CSA. T-ALL is a malignant tumor characterized by the aberrant activation of oncogenic TFs (13). We recently demonstrated that constitutive activation of NOTCH1 signaling due to mutations in the NOTCH1 gene activates a transcriptional network that controls leukemic cell growth (11, 14–16). These studies also demonstrated a fundamental role for HES1 and MYC as transcriptional mediators of NOTCH1 signals (15, 17). To characterize the structure of the oncogenic transcriptional network driven by activated NOTCH1 in T cell transformation, we sought to identify the direct transcriptional targets of NOTCH1, HES1, and MYC. We hypothesized that the development of an accurate statistical model would result in improved sensitivity in the identification of TF targets and a more accurate description of the individual and combinatorial regulatory programs controlled by these TFs. We first generated an empirical model of the distribution of IP/WCE intensity ratios for probes associated with unbound fragments (see Materials and Methods), and we used it to assign a P value to each probe in the analysis of ChIP2 assays representing replicate experiments for NOTCH1, MYC, and HES1. ChIP2 assays for these TFs were performed in HPB-ALL cells, a well-characterized T-ALL cell line with high expression levels of activated NOTCH1, MYC, and HES1. For NOTCH1, ChIP2 assays were also performed in CUTLL1 cells, another NOTCH1-dependent T-ALL cell line. The magnitude versus amplitude plots (Fig. 2
Improved ChIP2 Sensitivity by CSA. CSA then incorporates the probe significance model with an analytical method that integrates the statistics for replicate experiments and probes with nearby genomic locations (to account for ChIP2 fragmentation lengths, see Materials and Methods). We used CSA to compute the false discovery rate (FDR) associated with the most significant 500-bp region on each of the 16,697 promoters represented on the array. Analysis of NOTCH1, MYC, and HES1 promoter occupancy in T-ALL showed a larger than anticipated number of candidate target genes for these TFs. Specifically, using CSA at a conservative FDR of 0.05, the number of promoters on the array bound by the TFs in this study are: MYC (8,016; 48.0%), NOTCH1 in CUTLL1 (3,154; 18.9%), HES1 (3,074; 18.4%), and NOTCH1 in HPB-ALL (2,471; 14.8%) (Table 1).
Although the numbers reported above are far larger than the number of predicted targets commonly reported in ChIP2 analysis studies, we also compared against predictions from several published analysis methods with available software. One class of methods relies heavily on analyzing the shape of ratio values from multiple probes with nearby genomic proximity (3–7). These methods are generally not applicable to the relatively sparse arrays used in this study and produced very few predicted targets. As an initial benchmark, we compared against the Single Array Error Model (SAEM) (1, 18), the standard method packaged with the Agilent analysis software. This method models the intensity-dependent variance of probe ratio values, but then computes significance based on whole-dataset statistics. CSA predicted approximately an order of magnitude more bound promoters than SAEM (Table 1). Two published methods, ChIPOTle (19) and Chipper (8), compute significance using only probes with low ratio values, and these methods indeed predicted more targets than SAEM. However, both methods use normalization techniques that, to varying extents, rely on whole-dataset statistics, and as a result CSA predicted more targets than both. This was most apparent for MYC, which contained the largest number of high ratio probes. For the other TFs, CSA predicted roughly the same number of targets as ChIPOTle and twice as many targets as Chipper (Table 1). Accuracy of CSA Predictions Is Supported by Binding Site Enrichment Analysis. As a first test of the broad TF binding predictions generated by CSA, we evaluated the enrichment of MYC binding sites, using the TRANSFAC (20) position-specific scoring matrix M00322, in the promoters of target genes identified by CSA and other analysis methods. The DNA-binding component of NOTCH1 transcriptional complexes, CSL, is not represented in TRANSFAC or JASPAR (21), and the only HES1-associated matrix was found to be of low quality and a poor predictor of HES1 binding, independent of the algorithm used. For each analysis method, promoters were ranked by their P values computed from the MYC ChIP2 experiment, and MYC/M00322 matching sites were identified in the 600-bp fixed-length window centered on the most significant probe in the highest-scoring promoter region. The match threshold was set so that a negative set, S(−), of 3,000 fragments showing the least amount of MYC binding would produce a false-positive rate of 30%. Details on the procedure are given in the SI. Analysis of the cumulative proportion MYC/M00322-matching fragments as a function of their ChIP2 ranking by the corresponding method showed that fragments inferred by all methods were enriched in MYC/M00322 sites and that site enrichment was correlated with the ChIP2 ranking (Fig. 3
To compare the predicted false-positive rate by CSA with the significance of MYC binding site enrichment, we binned the fragments based on their CSA rankings (100/bin) and assessed whether the MYC/M00322 motif could be successfully used to distinguish the fragments in each bin from those in the negative set, S(−). The classification P value based on binding site enrichment was in excellent agreement with the CSA-inferred false-positive rate of each bin, suggesting significant enrichment of MYC sites in the promoters of ≈7,000 genes, corresponding to the range of high-confidence targets predicted by CSA (Fig. 3 Overall, these results suggest that CSA has increased sensitivity in identifying a larger number of binding events and that meaningful statistical cutoffs can be determined from data. Experimental Validation of CSA TF Binding Predictions. To further test the accuracy of CSA-based TF target predictions, we performed independent chromatin immunoprecipitation (ChIP) experiments for each of the 4 ChIP2 conditions and tested the IP enrichment of specific promoters by quantitative PCR (qPCR). We first analyzed 8 predicted NOTCH1 targets in HPB-ALL cells, randomly sampled at an FDR ≤20%. Seven of these 8 predicted fragments were validated as bound by NOTCH1, and only the least significant fragment failed validation (Table 2).
We tested an additional 12 targets for HES1 and MYC in HPB-ALL and for NOTCH1 in CUTLL1, sampling predicted targets uniformly at an FDR of 20% (i.e., 20% expected false-positives) (Table 2). In this analysis, 26 of 36 targets (72.2%) were positive by ChIP/qPCR, and 9 (25%) were negative. The remaining gene (the second least significant for MYC) could not be amplified by qPCR. Nonvalidated/false-positive targets were, in general, at the end of the ranked lists (Table 2). The only outlier was the first-ranked fragment for HES1 (KIAA1407 gene promoter). To obtain experimental evidence on the robustness of our validation assay, we randomly selected 10 genomic regions not identified as bound by MYC and 10 not identified as bound by HES1. Nine selected regions were within promoters, and 11 were in intergenic regions. As expected, none of these 20 regions showed evidence of binding by MYC or HES1 when tested by ChIP/qPCR. For all experiments, numerous validated genes had CSA ranks in the thousands. The lowest-ranking validated genes before encountering a false-positive were as follows: 2,223 for NOTCH1 in CUTLL1; 2,958 for NOTCH1 in HPB-ALL; 4,901 for MYC; and although the top-ranking gene for HES1 failed validation, the following 7, down to rank 3,247, were positive. Notably, many of the validated targets showed subtle ChIP2 signals. For example, C6orf82, a validated HES1 target, had ChIP2 binding ratios in replicate experiments of 1.37 and 1.68 for the most significant probe in its promoter, and there was no enrichment (ratios of 0.81 and 1.15) for its adjacent probe. However, upon ChIP/qPCR validation, this region showed binding ratios of 2.69 and 4.65. ChIP/qPCR results are available in the SI. Overall, 33 of the 44 genes (75%) selected from those with an FDR of 20% by CSA were validated by ChIP/qPCR. These biochemical validation results support our computationally derived conclusions regarding the broad range of binding for all tested TFs and demonstrate the power of CSA for reducing the false-negative rate in ChIP2 experiments. NOTCH1 Regulates Direct Target Genes Predicted by CSA. To test whether CSA-predicted NOTCH1-bound genes are also functionally regulated by this TF, we treated a panel of 10 T-ALL cell lines with Compound E, a γ-secretase inhibitor that blocks an essential proteolytic cleavage step required for release of the intracellular domains of NOTCH1 from the membrane and their translocation to the nucleus (22). Genome-wide expression profiles of cells treated for 72 h with Compound E (100 nM) or vehicle only (DMSO) were measured using microarrays, and expression changes were compared with NOTCH1 promoter occupancy identified by CSA analysis of the ChIP2 data. Overall, 11,606 genes were represented on both the ChIP2 and the expression arrays. For each gene we computed: (i) the ChIP2 FDR based on the highest-scoring 500-bp region in its promoter; (ii) the log2 expression ratio of the control versus treatment, averaged over the 10 cell lines and duplicate experiments; and (iii) the number of microarray experiments in which the gene was expressed (not called absent by MAS5), considering both Compound E-treated and DMSO-treated samples (because the group of expressed genes is essentially the same for both treatments, considering expressed genes using only one subgroup does not substantially change the results). Predicted NOTCH1-bound genes were more likely to be expressed than genes not identified as bound by NOTCH1 and showed clear down-regulation upon NOTCH1 inhibition (Fig. 4
Interaction of NOTCH1 and MYC Regulatory Networks. NOTCH1 and MYC operate as highly interrelated regulators of cell growth, proliferation, and survival during T cell development and transformation. In a recent study (11), we compared the regulatory networks controlled by NOTCH1 and MYC by using the ARACNE reverse engineering algorithm (23, 24) to predict 58 and 61 targets of NOTCH1 and MYC, respectively, and observed a significant overlap of 12 genes between the 2 lists (P < 10−51). We went on to characterize a feed-forward loop in which NOTCH1 directly regulates MYC, and both TFs regulate a common set of targets promoting leukemic cell growth. Based on these findings, we sought to further investigate the relationship between the genes bound by MYC and by NOTCH1 using the much larger list of targets inferred by CSA. Strikingly, the analysis predicted that MYC bound to 1,668 of the 1,804 (92.5%; P < 10−11) genes that were bound by NOTCH1, using a ChIP2 FDR threshold of 0.01. In agreement with the fundamental role of NOTCH1 in controlling leukemia cell growth (11), the NOTCH1-bound genes were highly enriched in gene ontology (GO) (25) categories related to cellular growth and metabolism, such as cellular metabolism (P < 10−41), RNA metabolism (P < 10−24), and protein biosynthesis (P < 10−9). The complete output of the GO enrichment analysis is given in the SI. Discussion We have shown that the choice of a realistic statistical model can dramatically affect the result of ChIP2 data analysis and its biological interpretation and proposed the CSA algorithm to assign meaningful statistical significance scores used to predict a more complete range of TF–target interactions. The method of assessing probe statistical significance relies on minimal assumptions: that the null distribution is symmetric and that bound fragments do not significantly affect the left tail of the null hypothesis statistics. As a result, it should generalize well to ChIP2 experiments performed using other platforms and cellular conditions. We used an independence model for replicate experiments and adjacent probes in the null hypothesis. Although this assumption is valid for relatively sparse arrays, denser arrays may introduce correlation for unbound nearby probes that are within the DNA fragmentation length, leading to unrealistically low FDR values if independence is assumed. We therefore recommend caution that the independence assumption applies when analyzing denser arrays, in which case the CSA method may be further improved by incorporating existing, more sophisticated models for the integration of data from nearby probes (3–7). However, for the arrays used in this study, which contain an average probe spacing of more than 200 nucleotides, we show that our results are in quantitatively good agreement with biochemical validation assays and that no correction seems to be required. The analysis of ChIP2 data from 3 oncogenic TFs reveals that CSA identifies far more bound gene promoters than standard analyses. Specifically, CSA predicts that each studied TF binds to several thousand target genes, with MYC binding to roughly half of the assayed promoters, providing additional insight into the extreme pluripotency of this protooncogene (26). These predictions might still be an underestimate, because only the proximal promoter regions (−0.8 kb to +0.2 kb, relative to transcription start site) are represented on the arrays used in this study. CSA predictions were validated by 3 independent tests. ChIP/qPCR experiments are in excellent correspondence with CSA-inferred FDRs, especially considering that ChIP/qPCR itself has a 10–20% false-negative rate (9–12). Computational validation by sequence analysis further indicates that CSA-inferred FDRs are in agreement with MYC binding site enrichments. Finally, gene expression analysis after NOTCH1 inhibition both provides further support for the CSA predictions and creates a stronger than expected association between bound and regulated genes. We found that NOTCH1 binds to a large number of promoters (>2,000) and that the set of corresponding genes is consistently, albeit weakly, regulated upon NOTCH1 inhibition. These results are highly consistent with a previous study performed in yeast (27) that also observed correspondence of ChIP2 results with both binding site enrichment and expression changes for a large number of genes. GO enrichment analysis shows that NOTCH1 subtly regulates a large number of genes involved in the cellular growth machinery. These results add an additional layer of regulation to the effects of NOTCH1 signaling in promoting cell growth, with important implications for understanding the role of NOTCH1 signaling in development and transformation. Thus, in addition to the established role of NOTCH1 in promoting growth through its interaction with MYC (17) and the PI3K-AKT (15) signaling pathway, NOTCH1 also has a direct effect in promoting cell growth. This irreversibly couples the developmental programs involved in stem cell homeostasis and lineage commitment activated upon NOTCH1 activation with the metabolic pathways needed for the expansion of stem cells and T cell progenitors. Finally, the availability of a more complete repertoire of bound promoters allows us to truly assess the extent of a TF's regulatory program and the combinatorial overlap between independent programs. Our analysis shows that 92.5% of the promoters bound by NOTCH1 are also bound by MYC. Indeed, it appears that NOTCH1 coregulates a specific subset of the MYC regulatory program. Although this was previously hinted at by the similarity of the regulatory programs inferred for the 2 TFs by expression analysis (17), the true extent of this overlap can only be grasped after resolving a more complete map of NOTCH1 and MYC targets. While contributing to our understanding of transcriptional regulation at the genome-scale, our findings suggest an even greater than expected complexity of transcriptional networks. Materials and Methods CSA Algorithm. The CSA algorithm takes as input probe intensity measurements after background subtraction and correction for factors such as spatial position on the array and print tip variability. In this study we used the standard Agilent procedure to obtain background-corrected probe intensity values, and we determined the statistical significance of binding regions by using the procedure described below. Single-Probe Significance Analysis. For each probe, the statistical significance of a measurement is inferred by computing the conditional probability of the magnitude (M) given the amplitude (A), Pnull(M|A), where M = log2(IP/WCE) and A = [log2(IP) + log2(WCE)]/2, under the null hypothesis (i.e., no enrichment in the IP compared with the WCE channel). Here, IP and WCE represent, respectively, the probe intensity measurements for the IP and WCE channels. The dependency of M on A is illustrated in Fig. 2 The method begins by estimating the joint probability distribution, P(M,A), using a bivariate Gaussian kernel density estimator (28). The kernel width of the estimator is calculated using the AMISE criterion (29). Conditioning on A yields the conditional distribution P(M|A) = P(M,A)/P(A), where P(A) is calculated using a univariate Gaussian kernel. For a particular average intensity value, A0, the conditional mean of the null distribution is inferred as
The conditional null distribution given A = A0 is inferred by projecting P(M|A = A0) across μ^M|A0 for M < μ^M|A0. This procedure is used to calculate Pnull(M|A) for an evenly spaced grid of A and M values, excluding the 1% of probes with the lowest A values (which are assigned a P value of 1). In this work we used step sizes of 0.05 and 0.01 for the A and M values, respectively. The complete conditional null distribution, Pnull(M|A), is computed using 2-dimensional linear interpolation. For each probe, statistical significance is assessed using a 1-tailed test with reference to this distribution. Because the distribution is empirical, there is a limit to the inferable minimum P value, which depends on the number of arrayed probes. For the arrays used in this study, we set the minimum P value to 10−5, which is roughly 1 divided by the number of probes on the array. We stress the importance of using an empirical distribution because we have observed that the empirical data generally display significantly non-Gaussian tails. Combining Replicates. We use Fisher's method (30) to compute an aggregate P value for each probe based on measurements from replicate experiments, under the null hypothesis that the probe is unbound in all replicates. Let pij denote the P value computed for the ith probe in the jth replicate experiment. Assuming that replicates are independent in the null hypothesis, a test statistic for evaluating the probability of a joint observation of P values across experiments is the product of the individual P values, i = Πj=1Mpij, where i is the test statistic and M is the number of replicate experiments. If modeled correctly, P values under the null hypothesis should be uniformly distributed (See SI). It is useful to log-transform this equation such that we evaluate −log( i) = −Σj=1Mlog(pij). Because the logarithm of a uniform distribution is exponentially distributed with mean 1, this equation is a sum of exponentially distributed random variables, which is a gamma-distributed random variable with mean 1 and M degrees of freedom. Thus, significance can be evaluated as ΓCDFM(−Σj=1Mlog(pij)), where ΓCDFM is the gamma cumulative distribution function with mean 1 and M degrees of freedom.Combining Regions. Because of sonication, the signal derived from a binding event may be detected by multiple probes in close genomic proximity to the binding site. To compute a combined statistic representing the probability of a binding event within the region spanned by multiple probes, we adapt a commonly used strategy (19) of using a fixed-size sliding window and integrating the values of probes falling within this window. Based on published measurements of fragmentation lengths (7), in this work we used a 500-bp window and a step size of 150 bp. Assuming that measurements from adjacent probes are independent in the null hypothesis, Fisher's method can again be applied to integrate the values from nearby probes. That is, let W represent the set of probes falling within a given 500-bp window. The integrated probability for this region is then calculated as
FDR Calculation. After computing a corrected P value for each gene representing the probability that the most significant region on the gene's promoter is bound, we control for multiple tests across genes and compute a false discovery rate using the Benjamini–Hochberg procedure (32). Let pk represent the corrected P value computed for gene k, let rk represent the rank of gene k sorted by the ChIP2 P values, and let G represent the total number of genes on the array; then, the false discovery rate for gene k is computed as FDRk = G*pk/rk. Supporting Information
Acknowledgments. A.A.M. was supported by an IBM PhD fellowship. This work was supported by National Cancer Institute Grants R01CA109755 (to A.C.) and R01CA120196 (to A.A.F.), National Institute of Allergy and Infectious Diseases Grant R01AI066116, the Alex Lemonade Stand Foundation (T.P.), The Cancer Research Institute, the WOLF Foundation, the National Centers for Biomedical Computing National Institutes of Health Roadmap Initiative (U54CA121852), and the Leukemia and Lymphoma Society (Grants 1287-08 and 6237-08). A.A.F. is a Leukemia and Lymphoma Scholar. Footnotes The authors declare no conflict of interest. This article is a PNAS Direct Submission. Data deposition: The microarray data have been deposited in the Gene Expression Omnibus (GEO) Database, www.ncbi.nlm.nih.gov/geo (accession no. GSE12868). ChIP2 data is at http://wiki.c2b2.columbia.edu/califanolab/PNASAM2009/. This article contains supporting information online at www.pnas.org/cgi/content/full/0806445106/DCSupplemental. References 1. Ren B, et al. Genome-wide location and function of DNA binding proteins. Science. 2000;290:2306–2309. [PubMed] 2. Buck MJ, Lieb JD. ChIP-chip: Considerations for the design, analysis, and application of genome-wide chromatin immunoprecipitation experiments. Genomics. 2004;83:349–360. [PubMed] 3. Glynn EF, et al. Genome-wide mapping of the cohesin complex in the yeast Saccharomyces cerevisiae. PLoS Biol. 2004;2:E259. [PubMed] 4. Kim TH, et al. A high-resolution map of active promoters in the human genome. Nature. 2005;436:876–880. [PubMed] 5. Li W, Meyer CA, Liu XS. A hidden Markov model for analyzing ChIP-chip experiments on genome tiling arrays and its application to p53 binding sequences. Bioinformatics. 2005;21(Suppl 1):i274–i282. [PubMed] 6. Zheng M, Barrera LO, Ren B, Wu YN. ChIP-chip: Data, model, and analysis. Biometrics. 2007;63:787–796. [PubMed] 7. Qi Y, et al. High-resolution computational models of genome binding events. Nat Biotechnol. 2006;24:963–970. [PubMed] 8. Gibbons FD, Proft M, Struhl K, Roth FP. Chipper: Discovering transcription-factor targets from chromatin immunoprecipitation microarrays using variance stabilization. Genome Biol. 2005;6:R96. [PubMed] 9. Lee TI, et al. Transcriptional regulatory networks in Saccharomyces cerevisiae. Science. 2002;298:799–804. [PubMed] 10. Polo JM, et al. Transcriptional signature with differential expression of BCL6 target genes accurately identifies BCL6-dependent diffuse large B cell lymphomas. Proc Natl Acad Sci USA. 2007;104:3207–3212. [PubMed] 11. Palomero T, et al. Transcriptional regulatory networks downstream of TAL1/SCL in T-cell acute lymphoblastic leukemia. Blood. 2006;108:986–992. [PubMed] 12. Odom DT, et al. Control of pancreas and liver gene expression by HNF transcription factors. Science. 2004;303:1378–1381. [PubMed] 13. Ferrando AA, et al. Gene expression signatures define novel oncogenic pathways in T cell acute lymphoblastic leukemia. Cancer Cell. 2002;1:75–87. [PubMed] 14. Palomero T, et al. Activating mutations in NOTCH1 in acute myeloid leukemia and lineage switch leukemias. Leukemia. 2006;20:1963–1966. [PubMed] 15. Palomero T, et al. Mutational loss of PTEN induces resistance to NOTCH1 inhibition in T-cell leukemia. Nat Med. 2007;13:1203–1210. [PubMed] 16. Weng AP, et al. Activating mutations of NOTCH1 in human T cell acute lymphoblastic leukemia. Science. 2004;306:269–271. [PubMed] 17. Palomero T, et al. NOTCH1 directly regulates c-MYC and activates a feed-forward-loop transcriptional network promoting leukemic cell growth. Proc Natl Acad Sci USA. 2006;103:18261–18266. [PubMed] 18. Hughes TR, et al. Functional discovery via a compendium of expression profiles. Cell. 2000;102:109–126. [PubMed] 19. Buck MJ, Nobel AB, Lieb JD. ChIPOTle: a user-friendly tool for the analysis of ChIP-chip data. Genome Biol. 2005;6:R97. [PubMed] 20. Matys V, et al. TRANSFAC: Transcriptional regulation, from patterns to profiles. Nucleic Acids Res. 2003;31:374–378. [PubMed] 21. Sandelin A, et al. JASPAR: An open-access database for eukaryotic transcription factor binding profiles. Nucleic Acids Res. 2004;32:D91–D94. [PubMed] 22. Miele L. Notch signaling. Clin Cancer Res. 2006;12:1074–1079. [PubMed] 23. Basso K, et al. Reverse engineering of regulatory networks in human B cells. Nat Genet. 2005;37:382–390. [PubMed] 24. Margolin AA, et al. ARACNE: An algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context. BMC Bioinformatics. 2006;7(Suppl 1):S7. [PubMed] 25. Ashburner M, et al. Gene ontology: Tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000;25:25–29. [PubMed] 26. Pelengaris S, Khan M. The many faces of c-MYC. Arch Biochem Biophys. 2003;416:129–136. [PubMed] 27. Tanay A. Extensive low-affinity transcriptional interactions in the yeast genome. Genome Res. 2006;16:962–972. [PubMed] 28. Beirlant J, Dudewicz E, Gyorfi L, van der Meulen E. Nonparametric entropy estimation: An overview. Int J Math Stat Sci. 1997;6:17–39. 29. Sheather SJ, Jones MC. A reliable data-based bandwidth selection method for kernel density-estimation. J R Stat Soc Ser B. 1991;53:683–690. 30. Mosteller F, Fisher RA. Questions and answers. Am Stat. 1948;2:30–31. 31. Keles S, van der Laan MJ, Dudoit S, Cawley SE. Multiple testing methods for ChIP-Chip high density oligonucleotide array data. J Comput Biol. 2006;13:579–613. [PubMed] 32. Benjamini Y, Hochberg Y. Controlling the false discovery rate - a practical and powerful approach to multiple testing. J R Stat Soc Ser B. 1995;57:289–300. |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||
Science. 2000 Dec 22; 290(5500):2306-9.
[Science. 2000]Genomics. 2004 Mar; 83(3):349-60.
[Genomics. 2004]PLoS Biol. 2004 Sep; 2(9):E259.
[PLoS Biol. 2004]Nature. 2005 Aug 11; 436(7052):876-80.
[Nature. 2005]Bioinformatics. 2005 Jun; 21 Suppl 1():i274-82.
[Bioinformatics. 2005]Biometrics. 2007 Sep; 63(3):787-96.
[Biometrics. 2007]Nat Biotechnol. 2006 Aug; 24(8):963-70.
[Nat Biotechnol. 2006]Cancer Cell. 2002 Feb; 1(1):75-87.
[Cancer Cell. 2002]Blood. 2006 Aug 1; 108(3):986-92.
[Blood. 2006]Leukemia. 2006 Nov; 20(11):1963-6.
[Leukemia. 2006]Nat Med. 2007 Oct; 13(10):1203-10.
[Nat Med. 2007]Science. 2004 Oct 8; 306(5694):269-71.
[Science. 2004]PLoS Biol. 2004 Sep; 2(9):E259.
[PLoS Biol. 2004]Nature. 2005 Aug 11; 436(7052):876-80.
[Nature. 2005]Bioinformatics. 2005 Jun; 21 Suppl 1():i274-82.
[Bioinformatics. 2005]Biometrics. 2007 Sep; 63(3):787-96.
[Biometrics. 2007]Nat Biotechnol. 2006 Aug; 24(8):963-70.
[Nat Biotechnol. 2006]Nucleic Acids Res. 2003 Jan 1; 31(1):374-8.
[Nucleic Acids Res. 2003]Nucleic Acids Res. 2004 Jan 1; 32(Database issue):D91-4.
[Nucleic Acids Res. 2004]Clin Cancer Res. 2006 Feb 15; 12(4):1074-9.
[Clin Cancer Res. 2006]Proc Natl Acad Sci U S A. 2006 Nov 28; 103(48):18261-6.
[Proc Natl Acad Sci U S A. 2006]Blood. 2006 Aug 1; 108(3):986-92.
[Blood. 2006]Nat Genet. 2005 Apr; 37(4):382-90.
[Nat Genet. 2005]BMC Bioinformatics. 2006 Mar 20; 7 Suppl 1():S7.
[BMC Bioinformatics. 2006]Nat Genet. 2000 May; 25(1):25-9.
[Nat Genet. 2000]PLoS Biol. 2004 Sep; 2(9):E259.
[PLoS Biol. 2004]Nature. 2005 Aug 11; 436(7052):876-80.
[Nature. 2005]Bioinformatics. 2005 Jun; 21 Suppl 1():i274-82.
[Bioinformatics. 2005]Biometrics. 2007 Sep; 63(3):787-96.
[Biometrics. 2007]Nat Biotechnol. 2006 Aug; 24(8):963-70.
[Nat Biotechnol. 2006]Arch Biochem Biophys. 2003 Aug 15; 416(2):129-36.
[Arch Biochem Biophys. 2003]Science. 2002 Oct 25; 298(5594):799-804.
[Science. 2002]Proc Natl Acad Sci U S A. 2007 Feb 27; 104(9):3207-12.
[Proc Natl Acad Sci U S A. 2007]Blood. 2006 Aug 1; 108(3):986-92.
[Blood. 2006]Science. 2004 Feb 27; 303(5662):1378-81.
[Science. 2004]Genome Res. 2006 Aug; 16(8):962-72.
[Genome Res. 2006]Proc Natl Acad Sci U S A. 2006 Nov 28; 103(48):18261-6.
[Proc Natl Acad Sci U S A. 2006]Nat Med. 2007 Oct; 13(10):1203-10.
[Nat Med. 2007]Proc Natl Acad Sci U S A. 2006 Nov 28; 103(48):18261-6.
[Proc Natl Acad Sci U S A. 2006]Genome Biol. 2005; 6(11):R97.
[Genome Biol. 2005]Nat Biotechnol. 2006 Aug; 24(8):963-70.
[Nat Biotechnol. 2006]J Comput Biol. 2006 Apr; 13(3):579-613.
[J Comput Biol. 2006]