![]() | ![]() |
Formats:
|
||||||||||
Evolution EVOPRINTER, a multigenomic comparative tool for rapid identification of functionally important DNA *Neural Cell-Fate Determinants Section, National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892; and ‡Office of the Scientific Director, Intramural Research Program, National Institute of Mental Health, National Institutes of Health, Bethesda, MD 20892 † To whom correspondence may be addressed. E-mail: ward/at/codon.nih.gov or brodyt/at/ninds.nih.gov. Communicated by Marshall Nirenberg, National Institutes of Health, Bethesda, MD, August 10, 2005 Received July 5, 2005. Freely available online through the PNAS open access option. This article has been cited by other articles in PMC.Abstract Here, we describe a multigenomic DNA sequence-analysis tool, evoprinter, that facilitates the rapid identification of evolutionary conserved sequences within the context of a single species. The evoprinter output identifies multispecies-conserved DNA sequences as they exist in a reference DNA. This identification is accomplished by superimposing multiple reference DNA vs. test-genome pairwise blat (blast-like alignment tool) readouts of the reference DNA to identify conserved nucleotides that are shared by all orthologous DNAs. evoprinter analysis of well characterized genes reveals that most, if not all, of the conserved sequences are essential for gene function. For example, analysis of orthologous genes that are shared by many vertebrates identifies conserved DNA in both protein-encoding sequences and noncoding cis-regulatory regions, including enhancers and mRNA microRNA binding sites. In Drosophila, the combined mutational histories of five or more species affords near-base pair resolution of conserved transcription factor DNA-binding sites, and essential amino acids are revealed by the nucleotide flexibility of their codon-wobble position(s). Conserved small peptide-encoding genes, which had been undetected by conventional gene-prediction algorithms, are identified by the codon-wobble signatures of invariant amino acids. Also, evoprinter allows one to assess the degree of evolutionary divergence between orthologous DNAs by highlighting differences between a selected species and the other test species. Keywords: comparative genomics, evolution, gene structure and function Deciphering the regulatory mechanisms that control coordinate gene expression is a long-standing goal of biology. The comparison of orthologous DNA sequences from multiple vertebrate or invertebrate species promises to identify the cis- regulatory elements that are central to the dynamic interplay between a gene and its transcriptional regulators (1-3). This cross-species comparison, termed phylogenetic footprinting, is based on the hypothesis that functionally important sequences evolve at a significantly slower rate than nonfunctional DNA (1). Phylogenetic footprinting has been used successfully to discover multispecies-conserved sequences (MCSs) that are critical for gene function (reviewed in refs. 2, 4, and 5). An essential first step in this process is the alignment of multiple orthologous DNAs. Multisequence-alignment programs include threaded blockset aligner (6), footprinter (7), conreal (5), and phyme (8). The multiDNA alignments are accomplished either by simultaneous or sequential pairwise alignments of input DNAs, with alignment gaps introduced to optimize the overall homology comparisons. Individual genome searches have also been commonly used to initiate MCS searches, and two popular whole-genome search algorithms are blast (9) and blat (blast-like alignment tool) (10). One significant difference between the blast and blat algorithms is that blat keeps an index of a species genome in memory and uses this index to scan linearly through the query sequence, whereas blast indexes the query sequence first and then scans linearly along the database. This fundamental difference is the primary reason a blat alignment is significantly faster than other whole-genome alignment algorithms (10). The speed of blat alignment and the current availability of 13 vertebrate and seven Drosophila species blat-formatted genomes (see the Human blat Search database, available at http://genome.ucsc.edu/cgi-bin/hgBlat) enables rapid reference-DNA vs. test-genome pairwise homology searches of related or evolutionary distant species. Taking advantage of the speed of the blat alignment and the availability of multiple blat-formatted genomes, we developed a simple multigenomic comparative tool that allows one to rapidly identify MCSs as they appear in a species of interest. The evoprinter algorithm superimposes multiple blat readouts of individual reference-DNA vs. test-genome alignments to generate an evolutionary gene print (EvoP) of invariant DNA sequences as they appear in the reference DNA. Unlike most multispecies-alignment programs that display MCSs as consecutive columns of invariant nucleotides interspersed by alignment gaps, the EvoP readout displays only the reference DNA, with no alignment gaps, highlighting a species-centric representation of the conserved sequences. To facilitate the comparative analysis of evolutionary changes between test species, a second algorithm, evodifference (evodif) enables one to identify MCSs that are common to all but one of the test genomes. To demonstrate the efficacy of evoprinter as a phylogenetic-footprinting tool, we show how EvoPs of well characterized genes (one vertebrate and one Drosophila gene) accurately identify DNA sequences that have been shown to be essential for gene function. Also, we describe how evoprinter can be used to identify genes that had not been noticed by conventional gene-prediction methods. Materials and Methods evoprinter is a tool for discovering MCSs that are shared among three or more orthologous DNAs. The program uses the reference DNA outputs of blat alignments and then identifies the sequences within this DNA that are shared by all species. evoprinter is a javascript program that runs on the user's computer. Its algorithm creates an array of strings from the selected blat outputs and then looks for conservation of sequence by looping through the strings one letter at a time (outputting a black capital letter only for the reference DNA nucleotides that are aligned in all test species). Nucleotides within the reference DNA that are not shared are represented by lowercase gray letters. The program requires an up-to-date web browser, and javascript has to be enabled. There is no arbitrary limit on sequence capacity. For example, a 50-kb EvoP can be generated by splicing together two 25-kb blat outputs. The second evodif algorithm reveals what is different in any one species from the EvoP of all other test species (described below). The first step in generating an EvoP is the curation of the reference DNA (up to 25 kb per alignment) from the University of California, Santa Cruz Genome Browser database (http://genome.ucsc.edu/cgi-bin/hgGateway), the Ensembl database (available at: www.ensembl.org), or the FlyBase database (http://flybase.net). When copied and pasted into the blat engine input window (http://genome.ucsc.edu/cgi-bin/hgBlat), the pairwise alignment is performed between the reference DNA and a selected test species, and the highest-scoring readout alignment is then selected. The readout labeled as “YourSequence” (showing the reference DNA) is then copied and pasted into one of the evoprinter input windows (http://evoprinter.ninds.nih.gov) without removing numbering or spaces. This procedure is repeated with the same reference DNA vs. as many test species as required. evoprinter can also be used to generate a protein EvoP from blat alignments of amino acid sequences. One important feature of the evoprinter program is its ability to generate EvoPs from subsets of the selected blat readouts by unchecking the species or groups of species to be excluded. This flexibility is particularly useful when assessing whether the loss of an MCS or group of MCSs in one or more blat alignments is caused by (i) small mutational differences; (ii) chromosome rearrangements, including large insertions and/or deletions, resulting in loss of sequence colinearity; (iii) overall sequence divergence being so great that alignment is not achieved for short homologies; or (iv) sequencing gaps in the test genome. To identify MCSs that are shared by all but one of the test species, deselect all of the test-species readouts that were entered into the evoprinter except for the species in question, and then select the “Highlight Species Differences” button to generate the evodif readout. The lowercase red letters are nucleotides that are lost from the final EvoP if that species is included in the comparison. In addition to assessing the degree of evolutionary divergence, the evodif is particularly useful for identifying chromosome rearrangements (identified by uninterrupted blocks of lost MCSs). Color formatting of the EvoP and evodif readouts can be maintained by dragging the saved HTML output into word (Microsoft). Identification of potential transcription-factor DNA-binding sites was carried out by using matinspector (11). MicroRNA binding sites in Drosophila were identified as described (12), and human microRNA binding sites were identified by using the Human miRNA Viewer database (www.cbio.mskcc.org/mirnaviewer) as described (13). Results and Discussion evoprinter Analysis of Vertebrate Achaete-Scute Homologue 1 (Ascl1) Genes Identifies DNA Sequences That Are Essential for Its Expression and Function. The basic helix-loop-helix (bHLH) Ascl1 transcription factor has a critical role in establishing neural cell identities in the developing vertebrate embryo (see refs. 14-16 and references therein). Studies on the mammalian Ascl1 gene (Mash1) demonstrate that it is dynamically expressed in many proliferating CNS and peripheral nervous system neural progenitor cells (NPCs) during murine development (17, 18) and it is also tightly regulated in NPCs that give rise to pulmonary neuroendocrine cells (19). Cis-regulatory elements important for Mash1 expression in mice have been identified in the 5′ flanking intragenic DNA and within the 3′ noncoding region of its transcribed sequence (20, 21). Transgenic studies have localized the Mash1 CNS enhancer to an 1,158-bp region located 7 kb 5′ to its transcription start site (boxed sequence in Fig. 1B
Remarkably, a 15-kb EvoP of this region generated from human, chimpanzee, rhesus monkey, dog, rat, mouse (reference DNA), opossum, chicken, and Xenopus tropicalis DNA identifies a dense cluster of MCSs that are distributed throughout the critical tissue-specific regulatory region (Fig. 1 Transcription-factor DNA-binding site searches have revealed that many of the MCSs have core DNA-binding motifs for different transcription factors, such as homeodomain, bHLH, or Zn-finger proteins, and some have multiple interlocking binding sites for different factors (ref. 20 and data not shown). EvoPs of other characterized vertebrate enhancers have identified clustered MCSs within all cis-regulatory elements examined. For example, within the 90-bp tissue-specific region of the murine anterior neuroectoderm OTX2 enhancer (22), the EvoP reveals that 86% of the nucleotides (77 bp) are part of MCSs (data not shown). Constitutive expression of human Ascl1 (hash1) gene in lung neoplasms is a feature of one of the most virulent forms of lung cancer, small-cell lung cancer (SCLC) (23, 24). SCLC cell culture studies have demonstrated that hash1 expression in neuroendocrine tumors is controlled in part by a proximal enhancer positioned -234 to -46 from the transcribed sequence and a proximal repressor region located at -308 to -234 (24) (both regions are shown in the hatched box in Fig. 1C In addition to identifying MCSs within the upstream enhancer and proximal promoter, EvoP analysis of the transcribed region revealed multiple MCSs in the 5′ untranslated leader, one of which contains a canonical HES-1 binding site, whereas another harbors a docking site for IA-1 (25) (Fig. 1C In the Ascl-1 protein-encoding sequence, multiple MCSs are present and most delineate essential amino acid codons as deduced from invariant nucleotides in critical codon positions. A protein EvoP of the different vertebrate Ascl-1 amino acid sequences confirms that many of the conserved nucleotides identified in the genomic EvoP are positioned in invariant codon positions (data not shown). Within the 3′ UTR of the Ascl1 transcript, the EvoP identifies a dense cluster of MCSs that spans 600 bp of the 1.3-kb trailer (Fig. 1C Intragenus evoprinter Analysis of the Drosophila Krüppel (Kr) Gene Loci Identifies Functionally Important DNA. As a second example of the usefulness of evoprinter, we generated an EvoP of the well characterized Drosophila Kr transcription-factor gene. Kr plays multiple roles essential to different phases of Drosophila development. Initially identified as a regulator of thoracic and abdominal segmental identity in the early embryo (ref. 28 and see ref. 29 for review), Kr gene function has been shown to be required for the development of the Malpighian tubule (kidney) (30), muscle (31), and the nervous system (32, 33). Detailed studies of the cis-regulatory elements that control Kr embryonic expression have identified multiple enhancer regions located upstream of its transcribed sequence (34). The genomic regions that control early blastoderm, muscle precursor, amnioserosa, or CNS expression are shown in Fig. 2
Further dissection of the 1,159-bp CD1 region revealed that only its first 730 bp (Kr730, dashed line in Fig. 2 An EvoP of 12 kb spanning the Kr genomic locus using Drosophila melanogaster DNA as the reference DNA and Drosophila simulans, Drosophila yakuba, Drosophila ananassae, Drosophila pseudoobscura, and Drosophila virilis as test genomes has identified multiple MCSs within Kr730 but not in the remaining 420 bp that were found not to be essential for CD1 enhancer activity (36) (Fig. 2 Within the Kr transcribed sequence, EvoP identifies multiple clusters of MCSs, many of which encode essential amino acids as identified by their wobble signatures (identified by two or more 2-bp MCSs separated by single nonconserved nucleotides; Fig. 2 In the 3′ untranslated sequence of the Kr transcribed region, the EvoP identified a single MCS (Fig. 2 evoprinter Uncovers a Small Peptide Gene Not Identified in the Current FlyBase Annotation of the Drosophila Genome. The EvoP has the potential to discover small protein-encoding genes that had been previously unannotated by conventional gene-prediction methods. For example, EvoP exploration of the 12.9-kb intragenic region between the Drosophila beta amyloid protein precursor-like gene (43) and the ventral nervous system defective (vnd) gene (44) has identified a cluster of MCSs that were invariant in the D. melanogaster, D. simulans, D. yakuba, D. ananassae, D. pseudoobscura, D. virilis, and Drosophila mojavensis species. Positioned 8.5 kb upstream of the vnd transcribed sequence, portions of the MCS cluster possessed all of the hallmarks of an ORF that encodes short runs of conserved amino acids (multiple 2-bp MCSs separated by single nonconserved nucleotides) (Fig. 3
Although the conserved ORF was not identified in the recent FlyBase genome annotation release 3.1, a GenBank blast homology search using the predicted protein sequence revealed that the Heidelberg Prediction, Heidelberg Collection (HDC) had identified the ORF as the HDC16822 gene (45). The presence of HDC16822 was initially predicted by using a lower-stringency, ab initio gene-prediction algorithm (fgenesh; ref. 46) and then confirmed by whole-transcriptome microarray analysis (41). The genomic EvoP analysis also revealed the presence of additional upstream MCSs that may harbor HDC16822 cis-regulatory elements (Fig. 3 Summary. We have developed a simple, yet effective, comparative genomics tool for identifying MCSs shared among related DNAs. Generated from multiple pairwise blat alignments of a reference DNA to different test genomes, the EvoP presents an ordered, uninterrupted representation of the evolutionarily resilient sequences within the reference DNA. By superimposing the different species evolutionary histories, the combined mutagenic force reveals DNA sequences that are essential for gene expression and function. Also, the evodif algorithm reveals the degree of molecular divergence between species by identifying individual species differences to the EvoP. When compared with other multispecies-alignment tools, the two principal advantages of evoprinter are its speed (derived from the speed of a blat alignment) and the fact that only a single curated genomic sequence is required to initiate the analysis of orthologous DNAs from multiple species. Based on the success of the evoprinter identification of MCSs within known vertebrate and Drosophila cis-regulatory elements, we believe that this tool could be of great use to understanding gene regulation in all animals. Acknowledgments We thank L. Elnitski, J. Kassis, S. Landis, M. Muenke, H. Nash, and A. Raldow for helpful discussions; L. Elnitski and H. Nash for critically reading the manuscript; and J. Brody for help with the evoprinter web site construction and editorial assistance. This work was supported by the National Institutes of Health National Institute of Neurological Disorders and Stroke and National Institute of Medical Health Intramural Research Program. Notes Abbreviations: MSC, multispecies-conserved sequence; evodif, evodifference; bHLH, basic helix-loop-helix; Kr, Krüppel; Hb, Hunchback. References 1. Tagle, D. A., Koop, B. F., Goodman, M., Slightom, J. L., Hess, D. L. & Jones, R. T. (1988. ) J. Mol. Biol. 203, 439-455. [PubMed] 2. Wasserman, W. W., Palumbo, M., Thompson, W., Fickett, J. W. & Lawrence, C. E. (2000. ) Nat. Genet. 26, 225-228. [PubMed] 3. Yuh, C. H., Brown, C. T., Livi, C. B., Rowen, L., Clarke, P. J. & Davidson, E. H. (2002. ) Dev. Biol. 246, 148-161. [PubMed] 4. Grad, Y. H., Roth. F. P., Halfon, M. S. & Church, G. M. (2004. ) Bioinformatics 20, 2738-2750. [PubMed] 5. Berezikov, E., Guryev, V., Plasterk, R. H. & Cuppen, E. (2004. ) Genome Res. 14, 170-178. [PubMed] 6. Blanchette, M., Kent, W. J., Riemer, C., Elnitski, L., Smit, A. F., Roskin, K. M., Baertsch, R., Rosenbloom, K., Clawson, H., Green, E. D., et al. (2004. ) Genome Res. 14, 708-715. [PubMed] 7. Blanchette, M. & Tompa, M. (2003. ) Nucleic Acids Res. 31, 3840-3842. [PubMed] 8. Sinha, S., Blanchette, M. & Tompa, M. (2004. ) BMC Bioinformatics 5, 170. [PubMed] 9. Altschul, S. F., Madden, T. L., Schaffer, A., Zhang, J., Zhang, Z., Miller, W. & Lipman, D. J. (1997. ) Nucleic Acids Res. 25, 3389-3402. [PubMed] 10. Kent, W. J. (2002. ) Genome Res. 12, 656-664. [PubMed] 11. Cartharius, K., Frech, K., Grote, K., Klocke, B., Haltmeier, M., Klingenhoff, A., Frisch, M., Bayerlein, M. & Werner, T. (2005. ) Bioinformatics 21, 2933-2942. [PubMed] 12. Enright, A. J., John, B., Gaul, U., Tuschl, T., Sander, C. & Marks, D. S. (2003. ) Genome Biol. 5, R1. [PubMed] 13. John, B., Enright, A. J., Aravin, A., Tuschl, T., Sander, C. & Marks, D. S. (2004. ) PLoS Biol. 2, e363. [PubMed] 14. Casarosa, S., Fode, C. & Guillemot, F. (1999. ) Development (Cambridge, U.K.) 126, 525-534. 15. Torii, M., Matsuzaki, F., Osumi, N., Kaibuchi, K., Nakamura, S., Casarosa, S., Guillemot, F. & Nakafuku M. (1999. ) Development (Cambridge, U.K.) 126, 443-445. 16. Tuttle, R., Nakagawa, Y., Johnson, J. E. & O'Leary, D. D. (1999. ) Development (Cambridge, U.K.) 126, 1903-1916. 17. Lo, L. C., Johnson, J. E., Wuenschell, C. W., Saito, T. & Anderson, D. J. (1991. ) Genes Dev. 5, 1524-1537. [PubMed] 18. Guillemot, F. & Joyner, A. L. (1993. ) Mech. Dev. 42, 171-185. [PubMed] 19. Borges, M., Linnoila, R. I., van de Velde, H. J., Chen, H., Nelkin, B. D., Mabry, M., Baylin, S. B. & Ball, D. W. (1997. ) Nature 386, 852-855. [PubMed] 20. Verma-Kurvari, S., Savage, T., Gowan, K. & Johnson, J. E. (1996. ) Dev. Biol. 180, 605-617. [PubMed] 21. Verma-Kurvari, S., Savage, T., Smith, D. & Johnson, J. E. (1998. ) Dev. Biol. 197, 106-116. [PubMed] 22. Kurokawa, D., Takasaki, N., Kiyonari, H., Nakayama, R., Kimura-Yoshida, C., Matsuo, I. & Aizawa, S. (2004. ) Development (Cambridge, U.K.) 131, 3307-3317. 23. Ball, D. W., Azzoli, C. G., Baylin, S. B., Chi, D., Dou, S., Donis-Keller, H., Cumaraswamy, A., Borges, M. & Nelkin, B. D. (1993. ) Proc. Natl. Acad. Sci. USA 90, 5648-5652. [PubMed] 24. Chen, H., Biel, M. A., Borges, M. W., Thiagalingam, A., Nelkin, B. D., Baylin, S. B. & Ball, D. W. (1997. ) Cell Growth Differ. 8, 677-686. [PubMed] 25. Breslin, M. B., Zhu, M., Notkins, A. L. & Lan, M. S. (2002. ) Nucleic Acids Res. 30, 1038-1045. [PubMed] 26. Liu, F., Pouponnot, C. & Massague, J. (1997. ) Genes Dev. 11, 3157-3167. [PubMed] 27. Breslin, M. B., Zhu, M. & Lan, M. S. (2003. ) J. Biol. Chem. 278, 38991-38997. [PubMed] 28. Wieschaus, E., Nusslein-Volhard, C. & Kluding, H. (1984. ) Dev. Biol. 104, 172-186. [PubMed] 29. Rivera-Pomar, R. & Jackle, H. (1996. ) Trends Genet. 12, 478-483. [PubMed] 30. Gaul, U. & Weigel, D. (1990. ) Mech. Dev. 33, 57-67. [PubMed] 31. Ruiz-Gomez, M., Romani, S., Hartmann, C., Jackle, H. & Bate, M. (1997. ) Development (Cambridge, U.K.) 124, 3407-3414. 32. Romani, S., Jimenez, F., Hoch, M., Patel, N. H., Taubert, H. & Jackle, H. (1966. ) Mech. Dev. 60, 95-107. 33. Isshiki, T., Pearson, B., Holbrook, S. & Doe, C. Q. (2001. ) Cell 106, 511-521. [PubMed] 34. Hoch, M., Schroder, C., Seifert, E. & Jackle, H. (1990. ) EMBO J. 9, 2587-2595. [PubMed] 35. Hoch, M., Seifert, E. & Jackle, H. (1991. ) EMBO J. 10, 2267-2278. [PubMed] 36. Hoch, M., Gerwin, N., Taubert, H. & Jackle, H. (1992. ) Science 256, 94-97. [PubMed] 37. Brody, T. & Odenwald, W. F. (2002. ) Development (Cambridge, U.K.) 129, 3763-3770. 38. Licht, J. D., Grossel, M., Figge, J. & Hansen, U. M. (1990. ) Nature 346, 76-79. [PubMed] 39. Licht, J. D., Hanna-Rose, W., Reddy, J. C., English, M. A., Ro, M., Grossel, M., Shaknovich, R. & Hansen, U. (1994. ) Mol. Cell. Biol. 14, 4057-4066. [PubMed] 40. Sauer, F. & Jackle, H. (1993. ) Nature 364, 454-457. [PubMed] 41. Hanna-Rose, W., Licht, J. D. & Hansen, U. (1997. ) Mol. Cell. Biol. 17, 4820-4829. [PubMed] 42. Grosskortenhaus, R., Pearson, B. J., Marusich, A. & Doe, C. Q. (2005. ) Dev. Cell 8, 193-202. [PubMed] 43. Rosen, D. R., Martin-Morris, L., Luo, L. Q. & White, K. (1989. ) Proc. Natl. Acad. Sci. USA 86, 2478-2482. [PubMed] 44. Kim, Y. & Nirenberg, M. (1989. ) Proc. Natl. Acad. Sci. USA 86, 7716-7720. [PubMed] 45. Hild, M., Beckmann, B., Haas, S. A., Koch, B., Solovyev, V., Busold, C., Fellenberg, K., Boutros, M., Vingron, M., Sauer, F., et al. (2003. ) Genome Biol. 5, R3. [PubMed] 46. Salamov, A. A. & Solovyev V. V. (2000. ) Genome Res. 10, 516-522. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||
J Mol Biol. 1988 Sep 20; 203(2):439-55.
[J Mol Biol. 1988]Dev Biol. 2002 Jun 1; 246(1):148-61.
[Dev Biol. 2002]Nat Genet. 2000 Oct; 26(2):225-8.
[Nat Genet. 2000]Bioinformatics. 2004 Nov 1; 20(16):2738-50.
[Bioinformatics. 2004]Genome Res. 2004 Jan; 14(1):170-8.
[Genome Res. 2004]Nucleic Acids Res. 1997 Sep 1; 25(17):3389-402.
[Nucleic Acids Res. 1997]Genome Res. 2002 Apr; 12(4):656-64.
[Genome Res. 2002]Bioinformatics. 2005 Jul 1; 21(13):2933-42.
[Bioinformatics. 2005]Genome Biol. 2003; 5(1):R1.
[Genome Biol. 2003]PLoS Biol. 2004 Nov; 2(11):e363.
[PLoS Biol. 2004]Genes Dev. 1991 Sep; 5(9):1524-37.
[Genes Dev. 1991]Mech Dev. 1993 Aug; 42(3):171-85.
[Mech Dev. 1993]Nature. 1997 Apr 24; 386(6627):852-5.
[Nature. 1997]Dev Biol. 1996 Dec 15; 180(2):605-17.
[Dev Biol. 1996]Dev Biol. 1998 May 1; 197(1):106-16.
[Dev Biol. 1998]Dev Biol. 1998 May 1; 197(1):106-16.
[Dev Biol. 1998]Dev Biol. 1996 Dec 15; 180(2):605-17.
[Dev Biol. 1996]Proc Natl Acad Sci U S A. 1993 Jun 15; 90(12):5648-52.
[Proc Natl Acad Sci U S A. 1993]Cell Growth Differ. 1997 Jun; 8(6):677-86.
[Cell Growth Differ. 1997]Nucleic Acids Res. 2002 Feb 15; 30(4):1038-45.
[Nucleic Acids Res. 2002]Genes Dev. 1997 Dec 1; 11(23):3157-67.
[Genes Dev. 1997]Nucleic Acids Res. 2002 Feb 15; 30(4):1038-45.
[Nucleic Acids Res. 2002]J Biol Chem. 2003 Oct 3; 278(40):38991-7.
[J Biol Chem. 2003]PLoS Biol. 2004 Nov; 2(11):e363.
[PLoS Biol. 2004]Dev Biol. 1998 May 1; 197(1):106-16.
[Dev Biol. 1998]Dev Biol. 1984 Jul; 104(1):172-86.
[Dev Biol. 1984]Trends Genet. 1996 Nov; 12(11):478-83.
[Trends Genet. 1996]Mech Dev. 1990 Dec; 33(1):57-67.
[Mech Dev. 1990]Cell. 2001 Aug 24; 106(4):511-21.
[Cell. 2001]EMBO J. 1990 Aug; 9(8):2587-95.
[EMBO J. 1990]EMBO J. 1991 Aug; 10(8):2267-78.
[EMBO J. 1991]Science. 1992 Apr 3; 256(5053):94-7.
[Science. 1992]EMBO J. 1990 Aug; 9(8):2587-95.
[EMBO J. 1990]EMBO J. 1991 Aug; 10(8):2267-78.
[EMBO J. 1991]Science. 1992 Apr 3; 256(5053):94-7.
[Science. 1992]EMBO J. 1991 Aug; 10(8):2267-78.
[EMBO J. 1991]EMBO J. 1990 Aug; 9(8):2587-95.
[EMBO J. 1990]Cell. 2001 Aug 24; 106(4):511-21.
[Cell. 2001]Nature. 1990 Jul 5; 346(6279):76-9.
[Nature. 1990]Mol Cell Biol. 1994 Jun; 14(6):4057-66.
[Mol Cell Biol. 1994]Nature. 1993 Jul 29; 364(6436):454-7.
[Nature. 1993]Mol Cell Biol. 1997 Aug; 17(8):4820-9.
[Mol Cell Biol. 1997]Genome Biol. 2003; 5(1):R1.
[Genome Biol. 2003]Dev Cell. 2005 Feb; 8(2):193-202.
[Dev Cell. 2005]EMBO J. 1990 Aug; 9(8):2587-95.
[EMBO J. 1990]Proc Natl Acad Sci U S A. 1989 Apr; 86(7):2478-82.
[Proc Natl Acad Sci U S A. 1989]Proc Natl Acad Sci U S A. 1989 Oct; 86(20):7716-20.
[Proc Natl Acad Sci U S A. 1989]Genome Biol. 2003; 5(1):R3.
[Genome Biol. 2003]Genome Res. 2000 Apr; 10(4):516-22.
[Genome Res. 2000]Mol Cell Biol. 1997 Aug; 17(8):4820-9.
[Mol Cell Biol. 1997]Dev Biol. 1998 May 1; 197(1):106-16.
[Dev Biol. 1998]EMBO J. 1991 Aug; 10(8):2267-78.
[EMBO J. 1991]Science. 1992 Apr 3; 256(5053):94-7.
[Science. 1992]