![]() | ![]() |
Formats:
|
||||||||||||||||
Copyright © 2009, European Molecular Biology Organization Neural stem cell transcriptional networks highlight genes essential for nervous system development 1The Gurdon Institute and The Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge, UK aDepartment of Physiology, Development and Neuroscience, University of Cambridge, Wellcome CRUK Gurdon Institute, Tennis Court Road, Cambridge CB2 1QN, UK. Tel.: +44 1223 334 141; Fax: +44 1223 334 089; E-mail: ahb/at/mole.bio.cam.ac.uk Received June 15, 2009; Accepted September 14, 2009. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits distribution, and reproduction in any medium, provided the original author and source are credited. This license does not permit commercial exploitation without specific permission. Abstract Neural stem cells must strike a balance between self-renewal and multipotency, and differentiation. Identification of the transcriptional networks regulating stem cell division is an essential step in understanding how this balance is achieved. We have shown that the homeodomain transcription factor, Prospero, acts to repress self-renewal and promote differentiation. Among its targets are three neural stem cell transcription factors, Asense, Deadpan and Snail, of which Asense and Deadpan are repressed by Prospero. Here, we identify the targets of these three factors throughout the genome. We find a large overlap in their target genes, and indeed with the targets of Prospero, with 245 genomic loci bound by all factors. Many of the genes have been implicated in vertebrate stem cell self-renewal, suggesting that this core set of genes is crucial in the switch between self-renewal and differentiation. We also show that multiply bound loci are enriched for genes previously linked to nervous system phenotypes, thereby providing a shortcut to identifying genes important for nervous system development. Keywords: differentiation, multiple transcription factor binding, neural stem cell, self-renewal, transcriptional networks Introduction Recent work on Drosophila neural stem cells (or neuroblasts) has provided important insights into stem cell biology and tumour formation (Yu et al, 2006; Doe, 2008; Egger et al, 2008; Zhong and Chia, 2008). Neuroblasts divide in an asymmetric, self-renewing manner producing another neuroblast and a daughter cell that divides only once to give post-mitotic neurons or glial cells (Wodarz, 2005; Egger et al, 2008). During these asymmetric divisions the atypical homeodomain transcription factor, Prospero, is asymmetrically segregated to the smaller daughter cell, the ganglion mother cell (GMC), where it can enter the nucleus and regulate transcription (Figure 1B
To identify the transcriptional networks promoting neural stem cell fate, we profiled, on a whole genome scale, the binding sites of Asense, Deadpan and Snail. These three proteins are members of a small group of transcription factors that are expressed in all embryonic neuroblasts (Figure 1A The second transcription factor, Deadpan, is a basic-helix–loop–helix protein (Bier et al, 1992) related to the vertebrate Hes family of transcription factors. Deadpan is expressed in all neuroblasts and has been shown to promote the proliferation of optic lobe neural stem cells (Wallace et al, 2000). Unlike Asense, Deadpan is also expressed in the DM/PAN neuroblasts of the larval brain (Boone and Doe, 2008). The third factor, Snail, is a zinc-finger transcription factor whose vertebrate homologues have roles in the epithelial to mesenchymal transition and in cancer metastasis (Hemavathy et al, 2000). The Snail family members (Snail, Worniu and Escargot) are known to regulate neuroblast spindle orientation and cell-cycle progression (Ashraf and Ip, 2001; Cai et al, 2001). To further understand the role of these pan neural stem cell transcription factors, we have mapped their targets throughout the genome. This, combined with expression profiling, allows us to begin to build the gene regulatory networks governing neural stem cell self-renewal, and to enhance our knowledge of the function and mode of action of these transcription factors in neural stem cells. Results Asense, Deadpan, Snail and Prospero bind to many common targets To identify the genes regulated by Asense, Deadpan and Snail in the embryo, we mapped their binding sites in vivo by DamID (van Steensel and Henikoff, 2000; van Steensel et al, 2001), as we have previously done for Prospero (Choksi et al, 2006). In brief, DamID involves tagging a DNA or chromatin-associated protein with a Escherichia coli DNA adenine methyltransferase (Dam). Wherever the fusion protein binds, surrounding DNA sequences are methylated. Methylated DNA fragments can then be isolated, labelled and hybridised on a microarray. Here, we express Dam fusion proteins in vivo, in transgenic Drosophila embryos. Methylated DNA fragments from transgenic embryos expressing Dam alone serve as a reference. Target sites identified by DamID have been shown to match targets identified by chromatin immunoprecipitation (Sun et al, 2003; Song et al, 2004; Tolhuis et al, 2006), by mapping to polytene chromosomes (Bianchi-Frias et al, 2004) and by 3D microscopy data (Pickersgill et al, 2006; Guelen et al, 2008). In comparing our results for Asense, Deadpan, Snail and Prospero, we observe a high degree of overlap between their targets (Figure 2A and C
Properties of loci bound by Asense, Deadpan and Snail Genome-wide analysis of Asense DamID peaks shows that Asense binding is associated with increased levels of DNA conservation (determined by the alignment of eight insect species (Remm et al, 2001) (Figure 3B
The resolution of DamID is ~1 kb (Vogel et al, 2007) and there are currently no motif discovery tools available that can analyse the large amount of sequence data generated by full genome DamID. Therefore, we developed a motif discovery tool, called MICRA (Motif Identification using Conservation and Relative Abundance) to identify overrepresented motifs in low-resolution data. In brief, 1 kb of sequence from each binding site is extracted and filtered for conserved sequences. The relative frequency of each 6–10 mer is then calculated and compared with background frequency (see Supplementary data for more details). Using MICRA we identified the E-box, CAGCTG, as the most overrepresented 6 mer in the regions of Asense binding (131% overrepresented using a conservation threshold of 0.6; see Supplementary Figure S6; Figure 3C A GO annotation analysis (Martin et al, 2004) of the genes bound by Asense shows a highly significant overrepresentation of genes involved in nervous system development and cell fate determination (Supplementary Figure S1). Similar analyses were performed for Deadpan and Snail and for both transcription factors; DNA conservation was enriched surrounding their binding sites (Supplementary Figure S5). Deadpan and Snail targets fall broadly into the same gene ontology classes as Asense and Prospero (Choksi et al, 2006) (Supplementary Figures S2 and S3) and the binding peaks show a similar distribution relative to gene structure as for Asense (Supplementary Figure S4). Motif discovery using MICRA identifies sites consistent with previously published in vitro studies for Deadpan (CACGCG and CACGTG) (Winston et al, 1999, 2000) and Snail (CAGGTA) (Mauhin et al, 1993) (Supplementary Figure S6). These analyses provide unbiased support for the Deadpan and Snail DamID experiments. Multiple transcription factor binding is associated with increased conservation of regulatory sequences and with genes critical for the development of the nervous system When comparing our data sets for Asense, Deadpan, Snail and Prospero we find genomic loci in which multiple transcription factors bind. This phenomenon has been described previously in a Drosophila cell line (Moorman et al, 2006) and, more recently, in mouse embryonic stem (ES) cells (Chen et al, 2008) in which these loci are termed ‘multiple transcription factor-binding loci' (MTL). The ES cell MTLs are associated with ES-cell-specific gene expression and are thought to identify genes important for stem cell self-renewal. Our data provide an independent and direct, in vivo demonstration of the phenomenon described in these two earlier studies. Analysis of neural MTLs (as determined by binding of Asense, Deadpan, Prospero and Snail within a 2 kb window) shows increased sequence constraint, correlating with the number of transcription factors bound (Figure 4A and B
To investigate further the relationship between the number of transcription factors bound at a locus and the importance of the associated target gene in neural development, we assembled a database (www.neuroBLAST.org) comprising DamID data, expression profiling of neural transcription factors, and data on Drosophila nervous system development collated from genetic screens, expression screens, gene homology and text mining screens (see Supplementary data for details of database construction). Using a random permutation algorithm and training sets of known nervous system development genes we assigned weighted scores to each screen (Supplementary Figure S7). A total score is calculated for each gene, providing an indication of the gene's involvement in nervous system development. Multiple gene lists can be searched in the database, which is a useful method to pinpoint key genes in user generated gene lists (e.g. expression array results). Using the data collected for the database, we consistently find a correlation between gene sets bound by increasing numbers of transcription factors and genes in Drosophila genetic screens for defects in nervous system development, eye development and cell-cycle progression (r=0.83, Supplementary Figure S8A) or in text mining screens (occurrence of the gene or its homologue with neural or stem cell terms; r=0.98, Supplementary Figure S8B). Neural stem cell gene regulatory networks highlight genes crucial for neural development We have shown that Asense, Deadpan, Prospero and Snail bind to genes essential for neural development. This finding enables us to highlight novel genes that may be involved in neural development. The neuroBLAST database ranks genes based on the number of transcription factors bound, together with their appearance in external screens. In this way it identifies known key players in neural development (Figure 5
Interestingly, there are many high scoring genes that have not previously been characterised for a role in Drosophila neural development (a selection are highlighted in Figure 5 High-resolution expression profiling suggests a dual role for Asense Using the binding data for these four transcription factors as a foundation, we sought to construct the transcriptional networks governing neural stem cell self-renewal and differentiation. Although DamID reports protein-binding sites, it cannot show how individual target genes are regulated in response to binding. Expression profiling of neuroblasts and GMCs from wild type and mutant embryos can provide this information, and provide greater insight into the biological function of each of the transcriptional regulators (see, for example, Choksi et al, 2006). Expression profiling of asense mutants was performed on 50–100 neuroblasts and GMCs microdissected from the ventral nerve cord of stage 11 wild type and mutant embryos (see Material and methods). Figure 6A
Discussion We combined in vivo chromatin profiling and cell-specific expression profiling to identify the gene regulatory networks directing neural stem cell fate and promoting differentiation in the Drosophila embryo. We find that the transcription factors Asense, Deadpan, Snail and Prospero bind to many of the same target genes. The targets of Asense, Deadpan and Snail include neuroblast genes but also many differentiation genes. The binding of these neural stem cell factors to differentiation genes is not entirely unexpected. In vertebrates, stem cell transcription factors bind to and repress differentiation genes to maintain the stem cell state (Boyer et al, 2005; Loh et al, 2006). Additionally, it is becoming apparent that transcription factors can have roles in both activation and repression, in Drosophila (Choksi et al, 2006) and in vertebrate stem cell transcriptional networks (Boyer et al, 2005; Loh et al, 2006). The ability to either repress or activate is likely to be due to interaction with co-factors, and the ability to recruit chromatin remodelling complexes to specific loci. We showed previously that Prospero represses the expression of Asense and Deadpan in GMCs, supporting a model whereby a core set of genes involved in neuroblast self-renewal and multipotency is activated by the neuroblast transcription factors and repressed by Prospero. Here, we show that, in part, Asense acts oppositely to Prospero, promoting the expression of neuroblast genes and repressing certain differentiation genes. However, our data also indicate that Asense can promote the expression of some genes required for differentiation, including the cell-cycle inhibitor dacapo, which is a member of the p21/p27 family of cdk inhibitors (de Nooij et al, 1996). dacapo expression inititates in the GMC (Liu et al, 2002) and we observe a reduction in levels of dacapo mRNA in the asense mutant neuroblasts and GMCs, similar to what has been reported in the developing optic lobe (Wallace et al, 2000). asense mRNA is known to be expressed in at least a subset of GMCs (Brand et al, 1993) and Asense protein is present in larval GMCs (Bowman et al, 2008). This suggests that Asense has a secondary role, to promote GMC cell-cycle exit and differentiation. Asense is absent in larval PAN neuroblasts whose progeny, unlike GMCs, divide in a stem cell-like manner. (Bello et al, 2008; Bowman et al, 2008). Ectopic expression of Asense prevents formation of these daughter cells, which can undergo extra divisions (Bowman et al, 2008), possibly by the upregulation of dacapo, and other differentiation genes. The expression pattern, function and binding site specificity of Asense all correlate strongly with its vertebrate counterpart, Ascl1 (Mash1). Mash1 is expressed in neural precursors in vertebrates (Parras et al, 2004), is known to regulate genes involved in Notch signalling (Delta, Jag2, Lfng and Magi1), cell-cycle control (Cdc25b) and neuronal differentiation (Insm1) (Castro et al, 2006) and recognises the E-box sequence, CAGCTG (Hu et al, 2004; Castro et al, 2006). Furthermore, Mash1 is consistently found to promote neuronal differentiation (Sommer et al, 1995; Tomita et al, 1996), which is consistent with a pro-differentiation role for Asense. Conversely, we show that Asense activates the expression of certain neuroblast genes, such as miranda, which is expressed in all neuroblasts and repressed by Prospero. Deadpan and Snail bind to many neuroblast genes. Given that the expression of deadpan and snail is restricted to pan-neural neuroblasts, it is likely that they can also activate the expression of neuroblast genes. However, confirmation of this awaits expression profiling of deadpan and snail mutant neuroblasts and GMCs. Finally, we show that multiple transcription factor binding is associated with genes that have critical functions in neural development. We show that this relationship can be used to identify novel genes involved in neural development, including those with vertebrate counterparts. A similar gene network and data mining study, using two pair-rule genes in Drosophila, has recently been used to identify a new marker for kidney cancer (Liu et al, 2009). Therefore, large-scale analysis of gene regulatory networks, as used here, provides a powerful approach to identifying key genes involved in development and disease. Materials and methods Fly lines asense1 is a deficiency removing the entire asense gene (Gonzalez et al, 1989). Control flies used for expression profiling were w1118. UAS-Dam has been described previously (Choksi et al, 2006). DamID Full-length coding sequences from asense, deadpan and snail were PCR amplified from an embryonic cDNA library and cloned into pUASTNDam (Choksi et al, 2006) using BglII and NotI sites. Transgenic lines were generated as described previously (Choksi et al, 2006). Stage 10–11 embryos (4–7 h AEL) were collected from UAS-Dam, UAS-Dam-Asense, UAS-Dam-Deadpan and UAS-Dam-Snail. DNA isolation, processing and amplification were performed as described previously (Choksi et al, 2006). Samples were labelled and hybridised to a custom whole genome 375 000 feature tiling array, with 60-mer oligonucleotides spaced at ~300 bp intervals (Choksi et al, 2006). Arrays were scanned and intensities extracted (Nimblegen Systems). Two biological replicates for each TF were performed. Log2 mean ratios of each spot were median normalised. DamID analysis A peak finding algorithm with false discovery rate (FDR) analysis (PERL script available on request) was developed to identify significant binding sites. All peaks spanning four or more consecutive probes (>~1200 bp) over a two-fold increase were analysed and assigned an FDR value (using 100 iterations). This analysis was performed for each chromosome arm. Nimblegen genome coordinates were converted to Release 5.0 of the Drosophila genome and genes were defined as targets in which a binding event (with an FDR of 0%) occurred within up to 5 kbp of the gene structure (depending on the proximity of adjacent genes). MTLs were defined as ~1800 bp genomic regions (six consecutive probes) containing binding events from 2, 3 or 4 transcription factors. Expression profiling Using a drawn out and bevelled capillary, samples of ~100 cells were extracted in vivo from the ventral and intermedial columns of the ventral nerve cords of late stage 11 homozygous ase1 embryos (ase1/ FM7c, Kruppel-GAL4; UAS-GFP) and w1118 embryos. cDNA was amplified from the cells, labelled and hybridised to microarrays (FL002; Flychip Cambridge Microarray Facility) as described previously (Choksi et al, 2006). Statistics FDR analysis was performed for DamID determined binding sites (see above). Linear correlation regression and significance were calculated in Graphpad Prism. Significance analysis of microarrays (Tusher et al, 2001) was used to analyse the expression microarray data. Genes with less than 1% FDR were identified as significant. MICRA, conservation analysis and database construction See Supplementary data. Immunohistochemistry and in situ hybridisation See Supplementary data. Public database access of microarray data The raw and normalised data for the DamID- binding experiments and the expression profiling experiments described here are available on the GEO public database (http://www.ncbi.nlm.nih.gov/projects/geo/). The accession numbers are as follows: Four DamID data sets (Asense, Deadpan, Snail and Prospero)—GSE18270. asense mutant expression profiling—GSE18214. prospero mutant expression profiling—GSE18213. Supplementary Information Click here to view.(1.2M, doc) Supplementary Data 1 Click here to view.(1.0M, xls) Supplementary Data 2 Click here to view.(176K, xls) Supplementary Data 3 Click here to view.(71K, xls) Review Process File Click here to view.(403K, pdf) Acknowledgments We thank A Jarman for providing fly stocks; D Griffiths, M Goodwin, S Sansom and members of the Brand lab for critical reading of the paper; K Edoff and FlyChip for assisting with expression profiling experiments and A Carr for advice on bioinformatics. This work was funded by a Wellcome Trust Programme Grant and an MRC Project Grant to AHB. TDS is funded by a Herchel Smith Fellowship. Footnotes The authors declare that they have no conflict of interest. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||
Neuron. 2006 Jul 6; 51(1):13-20.
[Neuron. 2006]Development. 2008 May; 135(9):1575-87.
[Development. 2008]Philos Trans R Soc Lond B Biol Sci. 2008 Jan 12; 363(1489):39-56.
[Philos Trans R Soc Lond B Biol Sci. 2008]Curr Opin Neurobiol. 2008 Feb; 18(1):4-11.
[Curr Opin Neurobiol. 2008]Curr Opin Cell Biol. 2005 Oct; 17(5):475-81.
[Curr Opin Cell Biol. 2005]EMBO J. 1989 Dec 1; 8(12):3553-62.
[EMBO J. 1989]Development. 1993 Sep; 119(1):19-29.
[Development. 1993]Development. 1993 Sep; 119(1):1-17.
[Development. 1993]Neural Dev. 2008 Feb 19; 3():5.
[Neural Dev. 2008]Dev Cell. 2008 Apr; 14(4):535-46.
[Dev Cell. 2008]Genes Dev. 1992 Nov; 6(11):2137-51.
[Genes Dev. 1992]Genesis. 2000 Jan; 26(1):77-85.
[Genesis. 2000]Dev Neurobiol. 2008 Aug; 68(9):1185-95.
[Dev Neurobiol. 2008]Gene. 2000 Oct 17; 257(1):1-12.
[Gene. 2000]Development. 2001 Dec; 128(23):4757-67.
[Development. 2001]EMBO J. 2001 Apr 2; 20(7):1704-14.
[EMBO J. 2001]Nat Biotechnol. 2000 Apr; 18(4):424-8.
[Nat Biotechnol. 2000]Nat Genet. 2001 Mar; 27(3):304-8.
[Nat Genet. 2001]Dev Cell. 2006 Dec; 11(6):775-89.
[Dev Cell. 2006]Proc Natl Acad Sci U S A. 2003 Aug 5; 100(16):9428-33.
[Proc Natl Acad Sci U S A. 2003]Mol Cell Biol. 2004 Oct; 24(19):8790-802.
[Mol Cell Biol. 2004]J Mol Biol. 2001 Dec 14; 314(5):1041-52.
[J Mol Biol. 2001]Nature. 2007 Jun 14; 447(7146):799-816.
[Nature. 2007]Cell. 2008 Jun 13; 133(6):1106-17.
[Cell. 2008]Nat Protoc. 2007; 2(6):1467-78.
[Nat Protoc. 2007]Development. 1993 Sep; 119(1):19-29.
[Development. 1993]Dev Cell. 2006 Dec; 11(6):831-44.
[Dev Cell. 2006]BMC Bioinformatics. 2004 Nov 18; 5():178.
[BMC Bioinformatics. 2004]Dev Cell. 2006 Dec; 11(6):775-89.
[Dev Cell. 2006]Biochemistry. 1999 Apr 20; 38(16):5138-46.
[Biochemistry. 1999]Biochemistry. 2000 Aug 8; 39(31):9092-8.
[Biochemistry. 2000]Nucleic Acids Res. 1993 Aug 25; 21(17):3951-7.
[Nucleic Acids Res. 1993]Proc Natl Acad Sci U S A. 2006 Aug 8; 103(32):12027-32.
[Proc Natl Acad Sci U S A. 2006]Cell. 2008 Jun 13; 133(6):1106-17.
[Cell. 2008]Development. 2006 Jul; 133(14):2639-48.
[Development. 2006]Cell. 2006 Mar 24; 124(6):1241-53.
[Cell. 2006]Dev Cell. 2006 Apr; 10(4):441-9.
[Dev Cell. 2006]Cell. 1997 Aug 8; 90(3):449-58.
[Cell. 1997]Dev Cell. 2005 Feb; 8(2):203-13.
[Dev Cell. 2005]Genome Biol. 2007; 8(7):R145.
[Genome Biol. 2007]Neuron. 2005 Jan 20; 45(2):207-21.
[Neuron. 2005]Curr Biol. 2008 Jun 3; 18(11):831-7.
[Curr Biol. 2008]Dev Cell. 2006 Dec; 11(6):775-89.
[Dev Cell. 2006]Cell. 1997 Aug 8; 90(3):449-58.
[Cell. 1997]Dev Biol. 2000 Oct 1; 226(1):34-44.
[Dev Biol. 2000]Cell. 1990 Feb 23; 60(4):565-75.
[Cell. 1990]Neuron. 1996 Mar; 16(3):501-14.
[Neuron. 1996]Development. 1996 Feb; 122(2):589-97.
[Development. 1996]Cell. 2005 Sep 23; 122(6):947-56.
[Cell. 2005]Nat Genet. 2006 Apr; 38(4):431-40.
[Nat Genet. 2006]Dev Cell. 2006 Dec; 11(6):775-89.
[Dev Cell. 2006]Cell. 1996 Dec 27; 87(7):1237-47.
[Cell. 1996]Mech Dev. 2002 Mar; 112(1-2):25-36.
[Mech Dev. 2002]Genesis. 2000 Jan; 26(1):77-85.
[Genesis. 2000]Development. 1993 Sep; 119(1):1-17.
[Development. 1993]Dev Cell. 2008 Apr; 14(4):535-46.
[Dev Cell. 2008]EMBO J. 2004 Nov 10; 23(22):4495-505.
[EMBO J. 2004]Dev Cell. 2006 Dec; 11(6):831-44.
[Dev Cell. 2006]Proc Natl Acad Sci U S A. 2004 Apr 13; 101(15):5559-64.
[Proc Natl Acad Sci U S A. 2004]Neuron. 1995 Dec; 15(6):1245-58.
[Neuron. 1995]Genes Cells. 1996 Aug; 1(8):765-74.
[Genes Cells. 1996]EMBO J. 1989 Dec 1; 8(12):3553-62.
[EMBO J. 1989]Dev Cell. 2006 Dec; 11(6):775-89.
[Dev Cell. 2006]Dev Cell. 2006 Dec; 11(6):775-89.
[Dev Cell. 2006]Dev Cell. 2006 Dec; 11(6):775-89.
[Dev Cell. 2006]Proc Natl Acad Sci U S A. 2001 Apr 24; 98(9):5116-21.
[Proc Natl Acad Sci U S A. 2001]