![]() | ![]() |
Formats:
|
||||||||||||||
Copyright © 2006 by The National Academy of Sciences of the USA Genetics Hotspots of transcription factor colocalization in the genome of Drosophila melanogaster *Department of Molecular Biology, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX, Amsterdam, The Netherlands; ‡Department of Genetics, Yale University School of Medicine, New Haven, CT 06520; and ¶Department of Biological Sciences and Center for Computational Biology and Bioinformatics, Columbia University, New York, NY 10027 ‖To whom correspondence may be addressed. E-mail: kevin.white/at/yale.edu, Email: hjb/at/genomecenter.columbia.edu, or Email: b.v.steensel/at/nki.nl Communicated by Steven Henikoff, Fred Hutchinson Cancer Research Center, Seattle, WA, June 18, 2006. †C.M., L.V.S., and J.W. contributed equally to this work. §Present address: Institute of Developmental Biology and Molecular Medicine, Fudan University, 220 Handan Road, Shanghai 200433, China. Author contributions: C.M., L.V.S., J.W., K.P.W., H.J.B., and B.v.S. designed research; C.M., L.V.S., W.T., and F.G. performed research; C.M., J.W., E.d.W., L.D.W., X.-J.L., and B.v.S. analyzed data; and C.M., J.W., H.J.B., and B.v.S. wrote the paper. Received May 31, 2006. This article has been cited by other articles in PMC.Abstract Regulation of gene expression is a highly complex process that requires the concerted action of many proteins, including sequence-specific transcription factors, cofactors, and chromatin proteins. In higher eukaryotes, the interplay between these proteins and their interactions with the genome still is poorly understood. We systematically mapped the in vivo binding sites of seven transcription factors with diverse physiological functions, five cofactors, and two heterochromatin proteins at ≈1-kb resolution in a 2.9 Mb region of the Drosophila melanogaster genome. Surprisingly, all tested transcription factors and cofactors show strongly overlapping localization patterns, and the genome contains many “hotspots” that are targeted by all of these proteins. Several control experiments show that the strong overlap is not an artifact of the techniques used. Colocalization hotspots are 1–5 kb in size, spaced on average by ≈50 kb, and preferentially located in regions of active transcription. We provide evidence that protein–protein interactions play a role in the hotspot association of some transcription factors. Colocalization hotspots constitute a previously uncharacterized type of feature in the genome of Drosophila, and our results provide insights into the general targeting mechanisms of transcription regulators in a higher eukaryote. Keywords: chromatin profiling, DamID, tiling array, transcriptional regulator Transcription factors are proteins that control the expression of specific sets of genes. They act by binding to regulatory DNA elements in the vicinity of these genes. The target specificity of transcription factors is modulated by protein–protein interactions with other factors and by the local chromatin structure. Because of the complexity of these interactions, prediction of the in vivo binding sites of transcription factors based on sequence alone still is unreliable (1–3). In the past few years, experimental approaches have been developed to identify the binding sites of transcription factors in living cells, on a genomewide scale (4–6). Systematic mapping studies in yeast have indicated that each promoter is typically bound by a small set of transcription factors. The vast majority of promoters is occupied by only a few proteins, and binding of >10 proteins to a single promoter is rare (<1%) (7, 8). Although transcription factors frequently act in a combinatorial fashion (9), these results suggest that the transcription regulatory network in yeast shows a substantial degree of “division of labor,” where each factor (or a small group of factors) binds a distinct set of genes. Genomes of higher eukaryotes are much more complex than those of yeast species, raising the question of how transcription factor-binding sites are organized in such more complex genomes. Although sequence analysis of the genomes of several higher eukaryotes has indicated that transcription factor consensus motifs tend to be clustered in the genome (10–12), it is still unclear whether these clusters are the predominant targets in vivo, because systematic comparative studies of the genomic-binding patterns of regulatory proteins have not been reported yet. So far, genomewide-mapping efforts of transcription factor location in flies and mammals have focused on individual proteins or on small sets of functionally related factors (4–6, 13). Thus, our general knowledge of the interplay between various transcription factors and their in vivo binding in the genome in higher eukaryotes still is limited. To gain understanding of the general principles that underlie regulator–genome interactions, we initiated a systematic survey of the in vivo genomic-binding patterns of a broad set of regulatory proteins in Drosophila melanogaster. We mapped the binding of these proteins in the Drosophila Kc cell line by using DamID technology (14, 15) combined with genomic tiling path arrays (16). Surprisingly, we found that the genomic localization patterns of unrelated factors show a strong overlap. We identified many genomic sites (“colocalization hotspots”) that are targeted by most tested transcription factors and coregulators. Bioinformatics analysis and studies with mutated protein indicate that protein–protein interactions play a role in targeting of some regulator proteins to these hotspots. These results provide insights into the complex network of regulator–genome interactions in a higher eukaryote. Results We began our study in Drosophila by mapping the in vivo genomic-binding sites of seven transcription factors from different classes and with diverse physiological functions: Bicoid (Bcd), GAGA factor (Gaf), Jun-related antigen (Jra), Max, odd paired (opa), Ecdysone receptor (EcR) isoform B1 (EcRB1), and its heterodimerization partner Ultraspiracle (USP) (Table 1, which is published as supporting information on the PNAS web site). Except for heterodimerization of EcRB1 and USP, no physical or functional interactions have been reported between any of these proteins (www.flybase.org). The localization patterns of these seven factors were determined in the embryonic Kc cell line, which provides a homogeneous cell population. We used the DamID technology, which has been used to map genomic binding of a variety of proteins (14, 16–20). DamID involves the in vivo expression of a chimeric protein consisting of a transcription factor of interest fused to Escherichia coli DNA adenine methyltransferase (Dam) (15). The expression levels of the Dam-fusion proteins are kept very low to avoid mistargeting due to overexpression (14, 15). DNA in the close vicinity of the natural binding sites of the transcription factor is methylated preferentially by the tethered Dam. Methylated DNA fragments are isolated by using a PCR-based method, and microarrays then are used to detect the pattern of targeted adenine methylation, from which the localization pattern of the transcription factor can be deduced (14, 16, 18, 20). We used genomic tiling arrays containing 3,648 genomic fragments of 430–920 bp, together covering 2.9 Mbp of the Adh-cactus region of chromosome 2 of Drosophila (16). The results for all seven transcription factors are displayed graphically in a detailed map of the Adh-cactus region (Figs. 1
To further verify our DamID results, we performed ChIP, a method for detecting protein–DNA interactions that is fundamentally different from DamID (4, 5, 23). The ChIP profile of endogenous Gaf (i.e., in untransfected cells) is in good agreement with the Gaf DamID profile (Fig. 3
To investigate whether the high degree of colocalization is caused by nonspecific protein–DNA interactions, we determined the DamID profile of the DNA-binding domain (DBD) of the sequence-specific yeast transcription factor Gal4p (Fig. 1 Transcription factors rely on interactions with other proteins, such as corepressors, coactivators, and chromatin proteins, to regulate gene expression. We analyzed whether these proteins also associate with transcription factor-binding sites. Indeed, the corepressors Groucho (Gro), Rpd3, and Sin3 and the coactivator Brahma (Brm) colocalize strongly with the transcription factors and with each other (Figs. 2 Visual inspection of the DamID patterns indicated that the Adh-cactus region contains many sites where all transcription factors and cofactors associate together. To identify such sites in a systematic and unbiased way, we applied machine learning methods to classify protein composition along the Adh-cactus region of Drosophila into a number of distinct “chromatin types.” We first used a self-organizing map (SOM) algorithm (26) to generate a 2D representation of the DamID data for all 14 proteins, in which different areas correspond to genomic regions with different protein compositions (Fig. 4
We computationally screened a collection of transcription factors of known sequence specificity (30, 31) for preferential binding to colocalization hotspots (see Materials and Methods). Fig. 5 We investigated the mechanism of regulator targeting to hotspots more directly for the transcription factor Bcd. For this purpose, we generated DamID profiles of Bcd mutants that either lacked a functional DBD (BcdK50A; refs. 32 and 33) or consisted of the DBD alone (Fig. 7). Strikingly, BcdK50A localizes to the hotspots, whereas the DBD alone is not detected in most hotspots (Fig. 4 We noticed that colocalization hotspots are often located in predicted genes: 41 of the 61 hotspot regions overlap with 37 genes (Fig. 2 Recent studies in Drosophila cells have shown that the histone variant H3.3 is specifically deposited in transcribed genes and their flanking regions (28). Analysis of these data revealed that colocalization hotspots are strongly enriched in H3.3 (Fig. 4 We wondered whether colocalization hotspots, which we identified in Kc cells, also may be linked to sites of transcription during fly development. We therefore analyzed the developmental expression profiles of genes overlapping with hotspots by using published genomewide expression profiles from six different developmental stages (34). Strikingly, this analysis (Fig. 6
Discussion We report here that the target loci of several unrelated transcription factors and cofactors strongly overlap with each other in the genome of Drosophila. Colocalization hotspots, which are targeted by all seven transcription factors and six of eight other proteins tested, represent extreme cases of this phenomenon. Because Drosophila contains ≈700 transcription factors (35), we predict that many more factors will show strong colocalization and that hotspots may recruit up to a few hundred different proteins. This observation contrasts with results in yeast where overlap between transcription factors was relatively rare and hotspots were not apparent (7, 8). The mechanism by which such a large number of proteins are recruited to specific genomic loci remains to be elucidated. Our observation that Bcd does not require its DBD for hotspot association, together with the lack of enrichment of the consensus motifs for USP, Jra, and Bcd in hotspots, indicates that at least some transcription factors do not directly bind the DNA in hotspots, but instead are recruited through protein–protein interactions. We note that the DamID maps represent the average protein occupancy over time and over many cells, and, therefore, it is quite possible that the composition of hotspot-associated complexes is variable and dynamic (36). The function of hotspots is unclear, but at least three models can be envisioned that are not mutually exclusive. First, hotspots may represent general “sinks” or “buffers” that sequester many regulator molecules. Thus, hotspots may regulate the effective concentration of free transcription factor molecules in the nucleus. This model is in agreement with our observation that wild-type Bcd fails to bind to several loci containing the Bcd consensus motif, whereas the Bcd DBD (that is not tethered to hotspots) does bind to these loci. Second, hotspots may be directly involved in the regulation of nearby genes, similar to enhancers. Unlike classical enhancers, however, hotspots may associate with hundreds of proteins, many of which may have only a minor contribution to the overall effect on gene expression. Third, hotspots could mediate physical interactions between distant loci in the genome, as has been reported for several transcription factor-rich loci (37, 38). Regardless of what could be the regulatory function of hotspots, our results have important implications for our understanding of the mechanisms that determine the target specificity of transcription factors and for the interpretation of genomewide protein occupancy data in higher eukaryotes. Materials and Methods DamID and ChIP. Detailed descriptions of the methods for mapping protein location are described in Supporting Materials and Methods, which is published as supporting information on the PNAS web site. Data Analysis. All measured ratios in the DamID and ChIP experiments were log2-transformed and normalized to the median value of all Adh-cactus fragments. Release 3.1 of the Drosophila genome sequence and annotation was used (39). For the SOM analysis, an average over replicates was calculated, and probes with one or more missing values were excluded, leaving 3,140 probes. For each protein, the data were then normalized to zero mean and unit variance (Z scores). Training of the SOM was performed as described in ref. 27 by using a hexagonal topology of 24 × 12 nodes. Nodes in the converged SOM then were clustered into eight sets by using a K means algorithm, the optimal number of clusters being determined by the Davies-Bouldin index (29). To define the number of hotspots in the Adh-cactus region, we determined the number of clusters of contiguous fragments (“contigs”) of the hotspot type (i.e., type 1, as defined by SOM analysis). If hotspot fragments were interrupted by 1–2 fragments that had missing values for all (or most) of the proteins, we regarded those hotspot fragments as part of the same contig. In this way, we found 61 contigs varying from 1 to 10 fragments. To uncover sequence signals enriched in the loci targeted by each Dam fusion protein, we predicted the affinity of each probed region for a large number of transcription factors. To this end, we downloaded position-specific nucleotide count matrices from TRANSFAC (30) and JASPAR (31), added a pseudocount of one at each position, and normalized each position by dividing by the count for the most frequent base. The resulting position-specific affinity matrix was used to predict the (relative) affinity of each probed region for the corresponding transcription factor as described in refs. 21 and 22. MatrixREDUCE software (21, 22) was used to compute the t value for the regression coefficient between DamID signal and predicted affinity. The data were preprocessed by taking the residuals from a multivariate model fit based on counts for all mono- and dinucleotides, as well as the motif GATC, the site methylated by Dam. To uncover sequence signals enriched in the probed regions making up a given chromatin type, we compared the distribution of their predicted affinity with that of all other probed regions by using a paired t test. To compare the Pol II ChIP and chromosomewide expression profiling data (27) to the data generated in this study, we resampled the 1.5-kb tiling fragment data from MacAlpine et al. (27) as follows. For each Adh-cactus fragment, we determined the overlapping fragment(s) on the microarray of the MacAlpine study. If there was one overlapping fragment, we assigned the mean value of that fragment to our Adh-cactus fragment. If two or more partially overlapping fragments were found (which was the maximum), we assigned a weighted average of the mean values of these two fragments to our Adh-cactus fragment. For analysis of the enrichment of H3.3, we subtracted for each 100-bp tiling array fragment in the data set from Mito et al. (28) the mean H3 log ratio from the mean H3.3 log ratio. The resulting H3-corrected log ratios then were resampled to match our Adh-cactus tiling fragments. For the analysis of the developmental expression patterns, we used the genomewide data from Stolc et al. (34). Hybridization intensities of each exon probe were log2-transformed, data from replicate experiments were averaged, and exon probes corresponding to the same gene subsequently were averaged to obtain a single log2 expression level for each gene at each developmental stage. Next, these values were normalized to the mean log2 value, first by column (developmental stage) then by row (gene). The developmental expression levels of genes overlapping with one or more hotspots were compared with those of genes not overlapping with hotspots. Supporting Information. Additional data can be found at Fig. 11 and Data Set 1, which are published as supporting information on the PNAS web site. Supporting Information
Acknowledgments We thank Susan Parkhurst and Amir Orian (Fred Hutchinson Cancer Research Center, Seattle, WA) and David Wassarman (University of Wisconsin, Madison, WI) for plasmids; Nicolas Nègre and Giacomo Cavalli (Institute of Human Genetics, Centre National de la Recherche Scientifique, Montpellier, France) for ChIP protocol and Gaf antibody; Marcel van Batenburg and Barrett Foat for technical advice; and Melissa Davis for preparing microarrays. Some materials were received through the Drosophila Genomics Resource Center. This work was supported by a Human Frontier Science Program Grant (to B.v.S., K.P.W., and H.J.B.); National Institutes of Health Grants HG003008, CA12152, and R24 GM074105 (to H.J.B.) and R01 HG03231 (to K.P.W); the W.M. Keck Foundation (K.P.W.), the Beckman Foundation (K.P.W.), and a European Young Investigators Award (to B.v.S.). Abbreviations Footnotes Conflict of interest statement: No conflicts declared. References 1. Bulyk M. L. Genome Biol. 2003;5:201. [PubMed] 2. Chua G., Robinson M. D., Morris Q., Hughes T. R. Curr. Opin. Microbiol. 2004;7:638–646. [PubMed] 3. Siggia E. D. Curr. Opin. Genet. Dev. 2005;15:214–221. [PubMed] 4. Bertone P., Gerstein M., Snyder M. Chromosome Res. 2005;13:259–274. [PubMed] 5. Blais A., Dynlacht B. D. Genes Dev. 2005;19:1499–1511. [PubMed] 6. van Steensel B. Nat. Genet. 2005;37(Suppl.):S18–S24. [PubMed] 7. Harbison C. T., Gordon D. B., Lee T. I., Rinaldi N. J., Macisaac K. D., Danford T. W., Hannett N. M., Tagne J. B., Reynolds D. B., Yoo J., et al. Nature. 2004;431:99–104. [PubMed] 8. Lee T. I., Rinaldi N. J., Robert F., Odom D. T., Bar-Joseph Z., Gerber G. K., Hannett N. M., Harbison C. T., Thompson C. M., Simon I., et al. Science. 2002;298:799–804. [PubMed] 9. Beer M. A., Tavazoie S. Cell. 2004;117:185–204. [PubMed] 10. Berman B. P., Nibu Y., Pfeiffer B. D., Tomancak P., Celniker S. E., Levine M., Rubin G. M., Eisen M. B. Proc. Natl. Acad. Sci. USA. 2002;99:757–762. [PubMed] 11. Markstein M., Markstein P., Markstein V., Levine M. S. Proc. Natl. Acad. Sci. USA. 2002;99:763–768. [PubMed] 12. Rajewsky N., Vergassola M., Gaul U., Siggia E. D. BMC Bioinformatics. 2002;3:30. [PubMed] 13. Orian A. Curr. Opin. Genet. Dev. 2006;16:157–164. [PubMed] 14. van Steensel B., Delrow J., Henikoff S. Nat. Genet. 2001;27:304–308. [PubMed] 15. van Steensel B., Henikoff S. Nat. Biotechnol. 2000;18:424–428. [PubMed] 16. Sun L. V., Chen L., Greil F., Negre N., Li T.-R., Cavalli G., Zhao H., van Steensel B., White K. P. Proc. Natl. Acad. Sci. USA. 2003;100:9428–9433. [PubMed] 17. Bianchi-Frias D., Orian A., Delrow J. J., Vazquez J., Rosales-Nieves A. E., Parkhurst S. M. PLoS Biol. 2004;2:E178. [PubMed] 18. Greil F., van der Kraan I., Delrow J., Smothers J. F., de Wit E., Bussemaker H. J., van Driel R., Henikoff S., van Steensel B. Genes Dev. 2003;17:2825–2838. [PubMed] 19. Orian A., van Steensel B., Delrow J., Bussemaker H. J., Li L., Sawado T., Williams E., Loo L. W., Cowley S. M., Yost C., et al. Genes Dev. 2003;17:1101–1114. [PubMed] 20. van Steensel B., Delrow J., Bussemaker H. J. Proc. Natl. Acad. Sci. USA. 2003;100:2580–2585. [PubMed] 21. Foat B. C., Houshmandi S. S., Olivas W. M., Bussemaker H. J. Proc. Natl. Acad. Sci. USA. 2005;102:17675–17680. [PubMed] 22. Foat B. C., Morozov A. V., Bussemaker H. J. Bioinformatics. 2006 in press. 23. van Steensel B., Henikoff S. BioTechniques. 2003;35:346–350. 352–354, 356–357. [PubMed] 24. Negre N., Hennetin J., Sun L. V., Lavrov S., Bellis M., White K. P., Cavalli G. PLoS Biol. 2006;4:E170. [PubMed] 25. Smothers J. F., Henikoff S. Mol. Cell. Biol. 2001;21:2555–2569. [PubMed] 26. Kohonen T. Self-Organizing Maps. Berlin: Springer; 2001. 27. MacAlpine D. M., Rodriguez H. K., Bell S. P. Genes Dev. 2004;18:3094–3105. [PubMed] 28. Mito Y., Henikoff J. G., Henikoff S. Nat. Genet. 2005;37:1090–1097. [PubMed] 29. Wang J., Delabie J., Aasheim H., Smeland E., Myklebost O. BMC Bioinformatics. 2002;3:36. [PubMed] 30. Matys V., Fricke E., Geffers R., Gossling E., Haubrock M., Hehl R., Hornischer K., Karas D., Kel A. E., Kel-Margoulis O. V., et al. Nucleic Acids Res. 2003;31:374–378. [PubMed] 31. Sandelin A., Alkema W., Engstrom P., Wasserman W. W., Lenhard B. Nucleic Acids Res. 2004;32:D91–D94. [PubMed] 32. Hanes S. D., Brent R. Cell. 1989;57:1275–1283. [PubMed] 33. Treisman J., Gonczy P., Vashishtha M., Harris E., Desplan C. Cell. 1989;59:553–562. [PubMed] 34. Stolc V., Gauhar Z., Mason C., Halasz G., van Batenburg M. F., Rifkin S. A., Hua S., Herreman T., Tongprasit W., Barbano P. E., et al. Science. 2004;306:655–660. [PubMed] 35. Adams M. D., Celniker S. E., Holt R. A., Evans C. A., Gocayne J. D., Amanatides P. G., Scherer S. E., Li P. W., Hoskins R. A., Galle R. F., et al. Science. 2000;287:2185–2195. [PubMed] 36. Phair R. D., Scaffidi P., Elbi C., Vecerova J., Dey A., Ozato K., Brown D. T., Hager G., Bustin M., Misteli T. Mol. Cell. Biol. 2004;24:6393–6402. [PubMed] 37. Tolhuis B., Palstra R. J., Splinter E., Grosveld F., de Laat W. Mol. Cell. 2002;10:1453–1465. [PubMed] 38. Ling J. Q., Li T., Hu J. F., Vu T. H., Chen H. L., Qiu X. W., Cherry A. M., Hoffman A. R. Science. 2006;312:269–272. [PubMed] 39. Misra S., Crosby M. A., Mungall C. J., Matthews B. B., Campbell K. S., Hradecky P., Huang Y., Kaminker J. S., Millburn G. H., Prochnik S. E., et al. Genome Biol. 2002;3 RESEARCH0083. |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||
Genome Biol. 2003; 5(1):201.
[Genome Biol. 2003]Curr Opin Microbiol. 2004 Dec; 7(6):638-46.
[Curr Opin Microbiol. 2004]Curr Opin Genet Dev. 2005 Apr; 15(2):214-21.
[Curr Opin Genet Dev. 2005]Chromosome Res. 2005; 13(3):259-74.
[Chromosome Res. 2005]Genes Dev. 2005 Jul 1; 19(13):1499-511.
[Genes Dev. 2005]Nature. 2004 Sep 2; 431(7004):99-104.
[Nature. 2004]Science. 2002 Oct 25; 298(5594):799-804.
[Science. 2002]Cell. 2004 Apr 16; 117(2):185-98.
[Cell. 2004]Proc Natl Acad Sci U S A. 2002 Jan 22; 99(2):757-62.
[Proc Natl Acad Sci U S A. 2002]Proc Natl Acad Sci U S A. 2002 Jan 22; 99(2):763-8.
[Proc Natl Acad Sci U S A. 2002]BMC Bioinformatics. 2002 Oct 24; 3():30.
[BMC Bioinformatics. 2002]Chromosome Res. 2005; 13(3):259-74.
[Chromosome Res. 2005]Genes Dev. 2005 Jul 1; 19(13):1499-511.
[Genes Dev. 2005]Nat Genet. 2001 Mar; 27(3):304-8.
[Nat Genet. 2001]Nat Biotechnol. 2000 Apr; 18(4):424-8.
[Nat Biotechnol. 2000]Proc Natl Acad Sci U S A. 2003 Aug 5; 100(16):9428-33.
[Proc Natl Acad Sci U S A. 2003]Nat Genet. 2001 Mar; 27(3):304-8.
[Nat Genet. 2001]Proc Natl Acad Sci U S A. 2003 Aug 5; 100(16):9428-33.
[Proc Natl Acad Sci U S A. 2003]PLoS Biol. 2004 Jul; 2(7):E178.
[PLoS Biol. 2004]Genes Dev. 2003 Nov 15; 17(22):2825-38.
[Genes Dev. 2003]Genes Dev. 2003 May 1; 17(9):1101-14.
[Genes Dev. 2003]Proc Natl Acad Sci U S A. 2005 Dec 6; 102(49):17675-80.
[Proc Natl Acad Sci U S A. 2005]Chromosome Res. 2005; 13(3):259-74.
[Chromosome Res. 2005]Genes Dev. 2005 Jul 1; 19(13):1499-511.
[Genes Dev. 2005]Biotechniques. 2003 Aug; 35(2):346-50, 352-4, 356-7.
[Biotechniques. 2003]PLoS Biol. 2006 Jun; 4(6):e170.
[PLoS Biol. 2006]Proc Natl Acad Sci U S A. 2003 Aug 5; 100(16):9428-33.
[Proc Natl Acad Sci U S A. 2003]Genes Dev. 2003 Nov 15; 17(22):2825-38.
[Genes Dev. 2003]Mol Cell Biol. 2001 Apr; 21(7):2555-69.
[Mol Cell Biol. 2001]BMC Bioinformatics. 2002 Nov 24; 3():36.
[BMC Bioinformatics. 2002]Nucleic Acids Res. 2003 Jan 1; 31(1):374-8.
[Nucleic Acids Res. 2003]Nucleic Acids Res. 2004 Jan 1; 32(Database issue):D91-4.
[Nucleic Acids Res. 2004]Cell. 1989 Jun 30; 57(7):1275-83.
[Cell. 1989]Cell. 1989 Nov 3; 59(3):553-62.
[Cell. 1989]Genes Dev. 2004 Dec 15; 18(24):3094-105.
[Genes Dev. 2004]Nat Genet. 2005 Oct; 37(10):1090-7.
[Nat Genet. 2005]Science. 2004 Oct 22; 306(5696):655-60.
[Science. 2004]Science. 2000 Mar 24; 287(5461):2185-95.
[Science. 2000]Nature. 2004 Sep 2; 431(7004):99-104.
[Nature. 2004]Science. 2002 Oct 25; 298(5594):799-804.
[Science. 2002]Mol Cell Biol. 2004 Jul; 24(14):6393-402.
[Mol Cell Biol. 2004]Mol Cell. 2002 Dec; 10(6):1453-65.
[Mol Cell. 2002]Science. 2006 Apr 14; 312(5771):269-72.
[Science. 2006]Genes Dev. 2004 Dec 15; 18(24):3094-105.
[Genes Dev. 2004]BMC Bioinformatics. 2002 Nov 24; 3():36.
[BMC Bioinformatics. 2002]Nucleic Acids Res. 2003 Jan 1; 31(1):374-8.
[Nucleic Acids Res. 2003]Nucleic Acids Res. 2004 Jan 1; 32(Database issue):D91-4.
[Nucleic Acids Res. 2004]Proc Natl Acad Sci U S A. 2005 Dec 6; 102(49):17675-80.
[Proc Natl Acad Sci U S A. 2005]Genes Dev. 2004 Dec 15; 18(24):3094-105.
[Genes Dev. 2004]Nat Genet. 2005 Oct; 37(10):1090-7.
[Nat Genet. 2005]Science. 2004 Oct 22; 306(5696):655-60.
[Science. 2004]Genes Dev. 2004 Dec 15; 18(24):3094-105.
[Genes Dev. 2004]Nat Genet. 2005 Oct; 37(10):1090-7.
[Nat Genet. 2005]Nucleic Acids Res. 2003 Jan 1; 31(1):374-8.
[Nucleic Acids Res. 2003]Nucleic Acids Res. 2004 Jan 1; 32(Database issue):D91-4.
[Nucleic Acids Res. 2004]Science. 2004 Oct 22; 306(5696):655-60.
[Science. 2004]