Logo of plosonePLoS OneView this ArticleSubmit to PLoSGet E-mail AlertsContact UsPublic Library of Science (PLoS)
PLoS One. 2009; 4(9): e6804.
Published online Sep 3, 2009. doi:  10.1371/journal.pone.0006804
PMCID: PMC2731164

The FunGenES Database: A Genomics Resource for Mouse Embryonic Stem Cell Differentiation

Mai Har Sham, Editor

Abstract

Embryonic stem (ES) cells have high self-renewal capacity and the potential to differentiate into a large variety of cell types. To investigate gene networks operating in pluripotent ES cells and their derivatives, the “Functional Genomics in Embryonic Stem Cells” consortium (FunGenES) has analyzed the transcriptome of mouse ES cells in eleven diverse settings representing sixty-seven experimental conditions. To better illustrate gene expression profiles in mouse ES cells, we have organized the results in an interactive database with a number of features and tools. Specifically, we have generated clusters of transcripts that behave the same way under the entire spectrum of the sixty-seven experimental conditions; we have assembled genes in groups according to their time of expression during successive days of ES cell differentiation; we have included expression profiles of specific gene classes such as transcription regulatory factors and Expressed Sequence Tags; transcripts have been arranged in “Expression Waves” and juxtaposed to genes with opposite or complementary expression patterns; we have designed search engines to display the expression profile of any transcript during ES cell differentiation; gene expression data have been organized in animated graphs of KEGG signaling and metabolic pathways; and finally, we have incorporated advanced functional annotations for individual genes or gene clusters of interest and links to microarray and genomic resources. The FunGenES database provides a comprehensive resource for studies into the biology of ES cells.

Introduction

Stem cells hold great promise for tissue repair after injury or as a result of disease[1]. Studies in animal models and clinical trials indicate that stem cells and their progeny may replace damaged tissue improving organ recovery and function [2], [3]. For this reason, understanding the programs controlling self-renewal and differentiation of stem cells may facilitate the development of tools to unlock their regenerative potential. To this end, mouse embryonic stem (ES) cells offer an accessible and pertinent model system because they give rise to many different cell types in a reproducible manner, can be propagated practically indefinitely, have relatively stable karyotypes, and are easy to genetically manipulate [4][7]. Moreover, ES cell differentiation in vitro recapitulates events that take place during early embryonic development including the formation of the three germ layers of ectoderm, mesoderm and endoderm, and the emergence of endothelial, hematopoietic, cardiac, neuronal and hepatic or pancreatic cells [8], [9].

Functional studies have highlighted the critical roles of genes such as Oct4 (Pou5f1), Nanog and Sox2 in the maintenance of ES pluripotency and suppression of differentiation pathways [10][19]. Chromatin immunoprecipitation and chip analyses revealed that both active and silenced genes in ES cells are directly bound by one or more of these three proteins [17], [19]. The recent discoveries of new pluripotency factors including Klf4, Sall4, Zfp206, Esrrb, Tcl1, Tbx3 and Zfx suggest that the expansion and fate of ES cells follows a complex course requiring the coordinated action of a number of yet to be characterized genes [20][26].

The links between the many genes involved in the maintenance of pluripotency and the regulation of ES cell differentiation programs are not well characterized. Microarray studies have the potential to piece together groups of co-regulated genes and thus lead to the discovery of novel components of genetic pathways in ES cells. In recent years, a number of genome-wide approaches have identified transcripts present in mouse and human ES cells or their differentiated derivatives using a variety of gene expression profiling methods [24], [27][32]. This wealth of information also underscored a degree of variability and “biological noise” among data sets [33], [34].

The “Functional Genomics in Embryonic Stem Cells” consortium comprising 20 research groups (acronym FunGenES; http://www.fungenes.org) has analyzed the transcriptome of ES cells under a series of diverse stimuli during growth expansion and differentiation. Besides information gathered to answer specific experimental questions, as determined by the interests of individual partners [35][41], the collective data offered the opportunity to search for coordinated gene expression patterns in a systematic exploration of the mouse ES transcriptome under a battery of different experimental settings, thus minimizing possible site-specific artifacts. The results have been organized in an interactive, open-access database with a number of novel features and search tools to promote studies into the biological properties of embryonic stem cells.

Results

Coordinated analysis of the mouse ES cell transcriptome

The FunGenES consortium collected gene expression profiling data from mouse ES cells in a coordinated fashion by streamlining techniques and standardizing experimental protocols among partners. To this end, consortium members selected three ES cell lines (CGR8, E14TG2a and R1) for common use; ES cell clones were karyotyped and tested by alkaline phosphatase staining before being distributed to most of the consortium groups. A number of laboratories shared serum batches and used a common LIF source. Finally, RNA samples were prepared following the same procedure and subsequent microarray analyses were performed in a central facility using Affymetrix Mouse 430 v.2 arrays.

The configuration of each of the eleven individual experiments and the RNA samples collected are summarized in Table 1. In brief, the studies consisted of seven analyses on gene regulation in undifferentiated ES cells, focusing on LIF targets, Stat3 and PI3K regulated genes, as well as global gene expression changes through epigenetic mechanisms; and, four studies where ES cells were allowed to differentiate in monolayers or as embryoid bodies. Differentiation took place either in control culture media, or in the presence of various agents including retinoic acid (RA), Fibroblast Growth Factor-2 (FGF2) and Wnt pathway activators. Detailed descriptions of the individual experimental settings are included in the Supplemental File S1. The total number of tested conditions was 67, each performed in up to six, separate, biological replicates using a total of 258 Affymetrix arrays.

Table 1
Outline of the eleven experimental data sets in the FunGenES study.

Comparison of gene expression profiles showed a low number of differentially expressed transcripts among the three ES cell lines. Using 5% false discovery rate in ANOVA calculation in any of the 3 comparisons (CGR8 vs. E14TG2a vs. R1), there are 137 genes (0.9% of the analyzed transcripts) that show a 2-fold difference or higher in expression levels among the three lines; 34 of these genes are >2-fold higher or lower expressed in E14TG2a, 5 in CGR8 and 11 in R1 cells.

Organizational design and special features of the FunGenES database

To enhance the analytical power of the collected information, facilitate data mining and provide public access of the consortium results to the scientific community, the expression data have been organized in an open, interactive database (http://biit.cs.ut.ee/fungenes/) with a number of original features and tools (Figure 1). These include: a) Global Clusters that consist of a small, tight subset of genes that are co-expressed under the entire spectrum of experimental conditions; b) Time Series of gene expression profiles during successive days of standard ES cell differentiation; c) Specific Gene Classes based on hierarchical clustering of transcriptional factors and ESTs; d) Expression Waves of genes with characteristic expression profiles during ES cell differentiation, juxtaposed to waves of genes that behave in the exact opposite way; e) Pathway Animations that illustrate dynamic changes in the components of individual KEGG signaling and metabolic pathways viewed in time-related manner; and, f) Search Engines to display the expression pattern of any transcript, or groups of transcripts, during the course of ES cell differentiation, or to query the association of candidate genes with various FunGenES database clusters. In addition, there are cross-links to annotate and characterize these genes in the context of other relevant genomic and stem cell resources.

Figure 1
Outline of the FunGenES database.

Gene expression profiles are provided for all RNA samples combined, or separately for the CGR8 and E14TG2a ES cell lines. The list of genes belonging to a cluster together with the heatmaps of individual transcripts, appear by clicking on the corresponding cluster. The heatmaps of gene clusters or single genes can be displayed in different color codes or configured using a range of analytical parameters using the ExpressView tool. With subsequent marking of any gene, or groups of genes, it is possible to zoom in to the clustering visualization. In addition, when a subset of genes is selected, it is possible to access functional analysis and other relevant resources via the URLMAP link aggregator. This provides crosslinks to external resources such as NCBI Entrez, Ensembl, iHOP, Pubgene, MEM - Multi Experiment Matrix, and a number of genomics and stem cell databases. There is also a link to the g:Profiler tool that provides functional annotation to assess the biological classification of transcripts with specific expression patterns [42]. Terms of description in g:Profiler include GO categories [43], KEGG [44] and Reactome pathways [45], miRBase microRNA information [46], and TRANSFAC motifs [47]. In addition to functional explanations, g:Profiler provides convenient tools for dealing with different gene identifiers and finding orthologs from other organisms.

Identification of gene sets with similar expression profiles across all tested experimental conditions

The synchronized genomic analyses among consortium partners presented the opportunity to search for coordinately expressed genes, either during ES cell differentiation, or in response to various stimuli. Towards this goal, we mined the genomics data to identify sets of genes, the expression of which performed in the same way over the entire spectrum of experimental conditions.

In order to facilitate the interpretation of the bioinformatic output, and enhance the biological significance of the computational data, we pre-selected probe sets corresponding to previously characterized genes. The initial focus on known genes with common expression profiles across many conditions, allowed us to interpret differences between conditions, as well as to identify specific core groups of genes that could serve as anchor-points for mapping gene function in future analyses. Specifically, we applied exclusion criteria to screen out transcripts without annotation and of unknown origin, as well as hypothetical transcripts or proteins. This selection reduced the number of transcripts from 45,101 to 32,020. We then removed redundant probe sets, and probe sets that showed minor differences in expression levels across all tests setting the standard deviation of the log2 signal over 67 conditions to less than 0.45. The selection criteria brought the number of transcripts used for cluster analysis to 5,959.

Unsupervised hierarchical clustering of the 5,959 genes, using 100 random permutations, gave rise to 115 groups, containing a total number of 2,855 transcripts, with a probability of 95% or higher that clustering was not random (Supplemental File S2). Eighteen clusters had >20 transcripts, fifteen clusters contained between 10–19 transcripts, whereas the remaining eighty two clusters had 3–9 members. The heatmaps of the eighteen largest clusters with >20 transcripts are shown in Figure 2. The heatmaps and the complete list of genes belonging to each cluster, ordered by cluster size, are available in Supplemental File S3 and in the FunGenES database under the heading “Global Clusters”.

Figure 2
Global hierarchical clustering analysis of the FunGenES microarray data.

The functional annotations of all the clusters with ≥10 transcripts, which were obtained using the on GO classification categories of the g:Profiler tool for all the genes in each cluster, are shown in Table 2 (for downregulated genes during ES cell differentiation) and Table 3 (for upregulated genes). Inspection of the data illustrates that in many instances, hierarchical clustering grouped genes that have been functionally associated with particular developmental and/or cellular processes. For example, clusters containing genes that are upregulated during the course of ES cell differentiation (Table 3) include in order of time of expression: cluster 30 that represents genes which take part in the formation of the three embryonic germ layers during gastrulation, i.e., Goosecoid, Cerberus like 1 homolog, Wnt3, Mesp1, Mixl1, mEomes and Even-skipped 1; cluster 15 containing molecular regulators of early mesoderm development including Bmp2, Bmp5, Msx1, Msx2, Cripto, Tbx20, Hey2, Smad6, Vegfr2 (Kdr), Foxf1 and Hand1; cluster 20, which comprises regulatory and structural genes linked to hemopoiesis such as Gata1, Nfe2, Klf1, Tie1, hemoglobins (Hba-x, Hbb-b1) and Glycophorin A; cluster 12, which is rich in genes involved in cardiac development, e.g., Mef2c, Myl4, cardiac Troponin T2, Tropomodulin 1, myosin binding protein C, Bves, Angiopoietin 1 and Angiopoietin 2; and, cluster 4, which consists mostly of genes associated with neuronal development and differentiation, for example, Neurog1, Neurog2, Olig2, Nkx6.1, Neurod4, Pou3f2, Pou3f4, Cacna2d3, Cacng4, Kcnq2 and EphA5. The average expression pattern of all the genes in these clusters is depicted in Figure 3A.

Figure 3
Average expression profiles of selected Global Clusters.
Table 2
Functional annotation of Global Clusters of downregulated genesa.
Table 3
Functional annotation of Global Clusters of upregulated genesa.

Taking into account that ES cells are isolated at embryonic day 3.5 post fertilization, the sequential appearance of genes specific for gastrulation, mesoderm formation, hemopoiesis, cardiopoiesis and neurogenesis during ES cell differentiation follows the timing of comparable developmental stages in embryonic development. For example, the transient expression of cluster 30 genes at day 3 in vitro, which corresponds to embryonic day E6.5 (3.5+3), matches the expression timing of genes such as Cerberus-like 1 and Wnt3 in vivo [48], [49]. In a similar manner, the induction of hematopoietic (cluster 20, day 3.5+4 = E7.5) and cardiovascular-specific (cluster 12, 3.5+5 = E8.5) genes follows the chronological order of the appearance of blood islands and the formation of the heart tube during embryonic development [50], [51].

In contrast to the complex induction scheme of clusters representing upregulated genes, clusters containing genes that decrease upon differentiation form fewer clusters that fall mainly in two categories, of genes suppressed early, at the onset of differentiation (clusters 3 and 18), and of genes downregulated in more gradual fashion (clusters 1, 8 and 13; Figure 3B). Downregulated clusters include mostly genes that take part in cell cycle, proliferation and metabolism, as well as genes that have been implicated in the maintenance of ES cell pluripotency (Table 2). For instance, Cluster 1 contains genes such as cyclin A2, cyclin B1, cyclin E1, cyclin F, polymerase alpha 2, RNA polymerase II polypeptide H, and RNA polymerase III polypeptide G, whereas Cluster 3 genes include Nanog, Sox2, Pou5f1 (Oct4), Klf2, Zpf42 (Rex1) and Esrrb.

The validation rate of the microarray expression profiling data was 90.7%, based on results obtained independently in eleven consortium laboratories. In brief, 330 of 364 genes, tested by quantitative or conventional PCR, gave comparable expression patterns to the data obtained by microarray analysis. A representative comparison of expression profiles obtained by Q-PCR and array analysis for fifteen genes, belonging to five of the clusters depicted in Figure 3, is shown in Supplemental File S4.

Since there is a higher than 95% chance that cluster assignments are accurate (Supplemental File S2), and our validation analysis shows that 90.7% of the array expression patterns match the RNA analysis results using other techniques (e.g., Q-PCR), we estimate that more than 86% of the genes in a cluster follow the corresponding average expression profile. It is likely that these genes are components of related molecular or cellular pathways, or they might be targets of common regulatory mechanisms, or both [52][54]. Next to well-characterized genes, clusters often contain transcripts the function of which is poorly understood. Our analysis predicts that the latter participate in the same biological processes as the known genes in the corresponding clusters – thus providing a starting point to study the function of poorly characterized transcripts.

Time Series and Specific Gene Classes of the FunGenES Database

To better visualize changes in gene expression programs during differentiation, we performed k-means clustering analysis, followed by hierarchical clustering, to group genes by their timing of induction or suppression during the normal ES cell differentiation process. For this purpose, we used the data from a subset of consortium samples representing untreated states without additional stimuli (26 conditions listed in Supplemental File S5) and, we included transcripts with significant differential expression among samples (standard deviation>0.45). The resulting “Time Series”, containing 8,211 genes, have been organized in 50 Concise (Figure 4) and 200 Analytical clusters.

Figure 4
Time Series and Specific Gene Classes of the FunGenES database.

The “Time Series” clusters expanded the number of genes that follow a specific expression pattern revealed in the previous global hierarchical clustering. For example, cluster 5 of the concise “Time Series”, which consists of transiently induced genes around day 3 of differentiation, similarly to Global Cluster 30, contains the same transcripts, but in addition, it also includes T-brachyury, Axin2, Mesp2, Fgf8, Wnt8a, Sp5, Sp8, Follistatin, Mix1 and Lim1. These genes have been also implicated in the gastrulation phase of embryogenesis [55] indicating that “Time Series” clusters provide a comprehensive collection of genes expressed at specific stages of ES cell differentiation and early embryonic development.

To assist searches of interconnected circuits of gene expression regulators, we carried out clustering of genes related to transcriptional activation (Figure 4). Finally, we analyzed ESTs separately to distinguish the ones that are expressed in ES cells or during the differentiation process. From approximately 12,000 ESTs, only 1,027 show a specific expression pattern (8.6% of all ESTs present in the Affymetrix 430 2.0 microarray). This is in contrast to known genes where 21% have a particular pattern, possibly because a number of ESTs included in the microarrays are cloning artifacts. However, the remaining 1,027 ESTs might represent novel transcripts with potentially important functions in stem cell biology and embryonic development. The 1,027 ESTs have been grouped in 50 clusters based on their timing of appearance (Figure 4). About half are expressed specifically in ES cells, the rest in ES cell derivatives. Transcription factor and EST clusters can be accessed through the “Specific Gene Classes” window of the FunGenES Database.

Gene “Expression Waves”

To better illustrate and map co-regulated genes with different activation and deactivation profiles, the levels of every transcript have been assigned to graphs of “Expression Waves” that follow a particular, predetermined, expression pattern (Figure 5). The names of genes belonging to the corresponding “Expression Wave” are included below the graph. The graph and gene content representing transcripts expressed in the opposite manner is available on the same page for side-by-side comparisons. In this way, it is possible to search the database for groups of potentially interconnected genes as a starting point to decipher regulatory networks of transcription factors, signaling molecules and membrane receptors, or for indications of genes that might be co-regulated by the same genetic pathways.

Figure 5
The “Expression Waves” tool of the FunGenES database.

Search Engines of the FunGenES database and links to external databases

To maximize the analytical power of the database and integrate it with the existing genomic and stem cell resources, we included the “Study your Gene(s) of Interest” search engine. For any gene of interest, or group of genes, it provides via URLMAP links to display the expression profile across the entire spectrum of the FunGenES data. This provides electronic analysis of the expression profile of any gene(s) in mouse ES cells and during the subsequent stages of differentiation by using standard abbreviated gene names, Affymetrix probe set IDs, or any identifier supported by the Ensembl database. An example of the expression profiles for the 19 members of the Wnt protein family of morphogens in ES cells and during the first 10 days of differentiation, obtained using the FunGenES search engine, is shown in Figure 6. The search tool also provides a fast assessment of expression profiling data obtained by RT-PCR or other techniques. The design allows easily addition of future data sets to expand and update the analytical power of the search engines.

Figure 6
The FunGenES database Search Engine for gene expression profiles during ES cell differentiation.

In addition to the visualization of expression profiles during ES cell differentiation, the search engine provides links to analyze the selected genes using many publicly available tools and resources. As mentioned above, such links include external resources such as NCBI Entrez, Ensembl, iHOP, Pubgene, MEM - Multi Experiment Matrix, and a number of genomics and stem cell databases. Moreover, the g:Profiler toolset provides functional annotation.

Pathway Animations

To examine the action of individual pathways in toto during ES cell differentiation, the FunGenES database was given an additional feature called “Pathway Animations” that depict dynamic changes in specific genetic, signaling or metabolic pathways viewed in time-related animations based on the KEGG annotation [44], [56]. The resource also offers a set of tools that allow the users to reanimate the graphs by selecting specific time points and/or subsets of pathway components.

Figure 7 depicts a stationary view of the KEGG pathways for “Cell Cycle” and “Apoptosis” at three time points; it appears that ES cells (day 0) have higher numbers of expressed genes involved in cell cycle (rectangles in red color) compared to differentiated cells (day 10). Almost all of the genes expressed at day 0 have been silenced by day 10 (green) and replaced by a new set of genes. The extensive changes in the expression profile from ES cells (day 0) to differentiated cells at day 10 are suggestive of a broad overhaul of the self-renewal machinery. The “Cell Cycle” animated pathway shows that genes encoding regulators of DNA replication are expressed at high levels in pluripotent, self-renewing ES cells and are progressively down regulated during differentiation. They include genes of the origin of replication complex (orc), the minichromosome maintenance (mcm), and the cell division cycle (cdc) families. Genes involved in DNA damage control and inhibition of DNA synthesis [49], in particular Atm, Chk1 and Chk2, are also highly expressed in ES cells, but decline during differentiation. These changes are indicative of the active replication machinery and the tightly controlled replication fidelity in proliferating ES cells [57][59].

Figure 7
Snap shots from the animated Cell Cycle and Apoptosis KEGG pathways.

Undifferentiated ES cells are also characterized by elevated levels of transcripts encoding the G1/S transition-promoting complex cyclin E1:Cdk2 and, by contrast, low levels of transcripts encoding D-type cyclins and Cdk4/6 inhibitors of the INK4 family (p15, p16, p18, p19; Figure 7). Differentiation is associated with a decrease in cyclin E1 and a concomitant elevation in D-type cyclins and cdk inhibitor transcripts [57], [60]. These results are likely due to the progressive switch from a cyclin E-based autonomous cell cycle, which characterizes self-renewing ES cells, to the D-type cyclins/Retinoblastoma (Rb) protein-regulated somatic cell cycle [61].

Conversely, few pro-apoptotic genes are expressed in ES cells (day 0; most boxes appear in green), but many are gradually induced during the differentiation process showing the exact opposite pattern from the genes involved in cell proliferation (Figure 7). As observed for cell-cycle genes, there is minimal overlap between apoptosis-associated genes expressed at days 0 and 10. This strikingly complementary pattern suggests a reciprocal interrelationship between the balance of pro- and anti-apoptotic genes in ES cells and their differentiated progeny.

Discussion

Functional analyses using loss-of-function and protein-protein interaction approaches, as well as bioinformatics tools, have began to piece together the regulatory networks active in ES cells [24], [62]. Furthermore, genome-wide studies, combining chromatin immunoprecipitation (ChIP) and array hybridization (ChIP-on-chip), have revealed that both active and silenced genes are directly bound in ES cells by one or more of the core pluripotency factors Oct4, Sox2 and Nanog [17], [19], [63].

However, it appears that the actual core factor set regulating pluripotency and early differentiation in ES cells is larger and more highly interconnected than previously suspected. Kim et al. have performed a genome-wide analysis of target promoters for nine transcription factors, namely Oct4, Sox2, Klf4, c-Myc, Nanog, Dax1, Rex1, Zpf281, and Nac1 [64]. They found that target promoters bound by a single or few factors tend to be inactive or repressed, whereas promoters bound by more than four factors are active in the pluripotent state and become repressed upon differentiation. Interestingly, targets of Myc or Rex1 are implicated in protein metabolism, whereas targets of the other factors are enriched in genes involved in developmental processes. The results also established a hierarchy within the key pluripotency factors such that Klf4 serves as an upstream regulator of feed-forward circuits involving Oct4, Sox2, Nanog and Myc.

The increasing complexity of gene regulatory networks emerging from these studies, combined with the surging amount of genomics and proteomics work, underscore the need for resources that would enable the scientific community to readily mine available and prospective data. The FunGenES database provides such a template with a number of tools including Animation of KEGG Pathways, Expression Waves, Time Series, Specific Gene Classes, such as ESTs and transcription factors, and searches for the expression pattern of any gene or transcript during ES cell differentiation using standard gene names and IDs. Search results are linked to: comprehensive annotation tools using the g:Profiler tool, which includes the presence of common regulatory motifs in promoter areas and miRNA targeting information; and, to available resources such as NCBI Entrez, Ensembl, etc.

Genomic studies, which in principle group together co-regulated genes, can potentially identify new components of known regulatory pathways in ES cells that can subsequently be explored in functional studies. In addition to well-described genes, clusters often contain transcripts the function of which has not yet been associated with a specific biological process thus providing novel unexplored links to known molecular pathways.

Although the database described here was based on the gene expression profiling results of the FunGenES consortium, it can be easily adapted to incorporate available or future genomics data obtained in ES cells. Moreover, the analytical paradigms and expression pattern clusters presented here could provide a scaffold for comparative analyses with human ES cell lines. This information will be particularly important for future evaluation of ES-like induced pluripotent stem (iPS) cells reprogrammed from somatic tissues that can be potentially used to derive pancreatic cells, cardiomyocytes or neurons for organ regeneration [21], [65][67]. For example, the g:Profiler tool provides the possibility to convert mouse Affymetrix probe set numbers to any Affymetrix probe set numbers from other organisms, allowing gene profiling comparisons among data sets generated in different species. This tool also allows conversion of previous Affymetrix probe set numbers (i.e., the first generation of Affymetrix microarrays - U74v2) to the more recent microarray probe set numbers (like the MG430v2 used in this study).

During the last years, a growing number of repositories of microarray data and other forms of gene expression profiles for stem cell research have been developed [68][71]. Data presentation is heterogeneous and ranges from: simple storage of expression data and experiment information (StemDB); presentation of lists of specific regulated transcripts (HESC); specific analysis results of a closed dataset (SCDb); or storage and visualization of variable resources with correlative and mutual information about single transcripts (one to many relationship, StemBase) [68][70]. To facilitate data comparison between the FunGenES database and other resources, we have included a series of links to other Stem cell databases, i.e., to SCDb, Amazonia in the Study your Gene(s) of Interest search engine. This way it is possible to obtain and compare the expression pattern of genes in the FunGenES database to the expression profiles in other tissues, experimental settings, or different stem cells types.

In contrast to existing microarray database resources, the FunGenES database includes a state of the art tools for the interactive visualization of gene to gene relationships. It provides gene lists and hierarchical matrices using co-expression analysis by distance-base clustering (k-means, hierarchical clustering), as well as integrated gene expression analyses by mapping observed gene expression changes onto specific signaling and metabolic pathways. We expect that not only regenerative medicine applications, but also basic science studies will benefit from the resources described here, especially when compared to expression profiling data obtained from loss- and gain-of-function approaches [19], [27], [72]. Furthermore, the assignment of ESTs and genes to specific pathways provide a fresh collection of novel components that can be further explored in functional assays during embryonic development and in human diseases.

Materials and Methods

RNA isolation and microarray hybridization

Total RNA was isolated using the RNeasy Mini Kit from Qiagen and treated with RNase-free DNase I (5 units/100 µg of nucleic acids, Sigma). Biotinylated cRNA was prepared according to the standard Affymetrix protocol [73]. In brief, double-stranded cDNA was synthesized from 10 µg total RNA using the SuperScript Choice System (Invitrogen) and the Affymetrix T7-(dT)24 primer. Following phenol/chloroform extraction and ethanol precipitation, the cDNA was transcribed into biotin-labeled cRNA using T7 polymerase (Ambion MEGAScript T7). cRNA products were purified using the RNeasy kit (Qiagen) and fragmented to an average size of 30–50 bases according to Affymetrix recommendations. 15 µg of fragmented cRNA were used to hybridize to the Mouse Genome 430 version 2.0 Array for 16 hrs at 45°C. The arrays were washed and stained in the Affymetrix Fluidics Station 450 and scanned using the Affymetrix GeneChip Scanner 3000 7G. The image data were analyzed with the GeneChip® Operating Software (GCOS) 1.4 using Affymetrix default analysis settings. Arrays were normalized by the log scale robust multi-array analysis (RMA) [74].

We used 258 Affymetrix GeneChips to analyze 67 individual experimental conditions (outlined in Table 1). A detailed description of the individual experiments is provided in Supplemental File S1. The eleven microarray data sets have been annotated in a MIAME compliant manner and deposited in EBI ArrayExpress (http://www.ebi.ac.uk/microarray-as/ae/. The accession numbers are as follows: AVEF-1: E-TABM-669, CNRS-UMR-5164: E-TABM-667, CNRS-UMR-6543: E-TABM-668, IMBB-1: E-TABM-670, INS-1: E-TABM-562, INS-2: E-TABM-671, IPK-1: E-TABM-493, TUD-1: E-TABM-675, UKOE-1: E-TABM-672, UOB-1: E-TABM-673, UOB-2: E-TABM-674).

Each array was checked for general assay quality (3′-5′ ratio of Gapdh <1.5, noise (RawQ) <4 and scaling factor at a TGT value of 200 <4). The robust multi-array average (rma) normalization (background-adjustment, quantile normalization and median polish summarization) has been performed using RMAExpress version 1.0 beta 4. In addition, we assessed data integrity by calculating Pearson correlation z-values over the complete dataset of 45,101 probe sets. The difference between array to array correlation within biological replicates (z = 2.73±0.38) and between non replicates (z = 1.90±0.35) indicates that there is a sufficiently high signal to noise ratio.

Comparison of gene expression profiles in the three ES cell lines

For comparison analysis, from the 45,101 probe sets represented on the Mouse 430 version 2 array, we selected 30,526 gene-associated transcripts (eliminating transcripts without annotation and of unknown origin, as well as hypothetical transcripts or proteins). In addition, for genes represented multiple times on the microarray, we selected the transcript with the strongest average signal as representative for the respective gene. This brought the number of analyzed transcripts to n = 15,263. A 5% false discovery rate in ANOVA calculation and a 2-fold difference or higher in any of the 3 comparisons (CGR8 vs. E14TG2a vs. R1) led to a set of 137 differentially expressed genes (0.9% of the analyzed transcripts).

Data preparation for unsupervised hierarchical clustering – Global Clusters

The first step in our data analysis was to average the biological replicates for each of the 67 experimental conditions. To identify genes that cluster together under the tested conditions, we excluded probe sets with a standard deviation in expression values of < log2 (0.45) from the vector mean. We then removed redundant gene/probe sets taking into account the ENTREZ, Unigene and RefSeq gene-id annotations. Among redundant probe sets, we selected the probe set with the highest average expression signal. We also removed probe sets of unknown origin, for example RIKEN sequences, or hypothetical transcripts/proteins. These criteria led to a data set of 5,959 transcripts.

Unsupervised hierarchical clustering

Correlation of differentially expressed transcripts was detected by hierarchical clustering of expression values with the Cluster version 2.11 software [52] applying mean centering and normalization of genes and arrays before the computational clustering analysis. Average linkage hierarchical clustering of the data was carried out as described [75].

Permutations

The correlation significance of expression profiles between probe sets was assessed empirically by one hundred rounds of random permutations. For each round, the 67 values for each probe set were randomly redistributed [76] and data sets clustered as described [75]. The best similarity scores of each permuted and clustered data set was collected to evaluate the 95th percentile of significant clusters in the original data set. 5% of the permuted data sets gave rise to clusters containing no more than two genes at a similarity score >0.85071 (Supplemental File S6). Clusters with 3 or more genes (115 clusters) were documented and selected for further analysis.

Clustering of the data in Time Series

Besides the clustering described above that was based on the entire spectrum of experimental conditions, expression data were clustered according to timing of expression in a two-step strategy. First, probe sets were clustered with k-means into a small number of clusters using chord distance (Euclidean distance over vectors normalized to unit sphere). In a second step, the resulting clusters, represented by mean profiles, were clustered using average linkage hierarchical clustering with Pearson correlation distance measure, and visualized in a heatmap representation [52]. No filtering besides removing genes with low variation was applied to these data sets.

Expression Waves

We developed a method to identify all genes that have characteristic expression patterns during ES cell differentiation. In brief, transcripts were included into a particular expression wave represented by a single artificial template, if its correlation with the specific pattern was higher than a certain threshold and also highest among all other patterns. This analysis was done in two different stringent conditions with correlation thresholds of 0.8 and 0.85. The results are presented in a series of graphs, with the list of genes that belong in the depicted pattern identified below. Each graph is juxtaposed to its “mirror image”, i.e., the graph representing genes that behave exactly the opposite way.

Pathway Animations

We designed animations of pathways in the Kyoto Encyclopedia of Genes and Genomes (KEGG) [44]. The animations use diagrams available at the KEGG webpage, which portray connections between pathway components. The expression levels of relevant genes are shown in the diagrams by the standard red (high) – green (low) color codes. In case a gene family represents a particular pathway step, the corresponding box displays the expression patterns of all individual members of the family in adjacent vertical stripes. Each stripe may be further divided horizontally depicting the expression patterns of different probe sets corresponding to the same transcript.

“Study your Gene(s) of Interest”

This feature has been designed to allow investigators to search and display the expression of any probe set during ES cell differentiation based on the FunGenES data sets. The program draws clustered heatmaps with the columns annotated with relevant sample information. The search engine recognizes common gene identifiers; the conversion to Affymetrix probe set IDs is done using Ensembl BioMart [77] mappings. The heatmap representation is based on the ExressView tool, which is linked to the URLMAP, to provide further analysis options for selected genes. The organization of the various FunGenES tools is depicted in Supplemental File S7.

Supporting Information

Supplemental File S1

Detailed overview of the microarray experimental designs & Contact Information

(0.24 MB PDF)

Supplemental File S2

Yield of the unsupervised hierarchical clustering. Histogram of the number of clusters (y-axis) for each cluster size (x-axis). Clusters with more than 100 genes are listed separately on the top right corner.

(0.58 MB TIF)

Supplemental File S3

Gene content of the 115 Global Clusters

(0.40 MB XLS)

Supplemental File S4

Comparison of gene expression profiles obtained by Q-PCR (left panels) and microarray analysis (right panels). The gene name is depicted on top of the Q-PCR graph; the Affymetrix ID of the same gene marks the corresponding adjacent graph. CT: Cycle Threshold values of the Q-PCR analysis. Signal: normalized log2 signal values from the microarray analysis. Genes are organized according to the Global Cluster they belong as indicated. The results show comparable gene expression profiles between microarray and Q-PCR data.

(0.97 MB TIF)

Supplemental File S5

Experimental data sets used in Time Series

(0.05 MB PDF)

Supplemental File S6

Evaluation of significant correlations. Ranked plot of the best similarity scores (y-axis) of 100 permutated and clustered datasets (x-axis) and the evaluated 95th percentile of significant clusters (blue line). The results are given for cluster nodes with more than two (black line) or three (red line) cluster members.

(0.55 MB TIF)

Supplemental File S7

Schematic representation of the FunGenES Database depicting tools to view expression data sets and links to external resources and databases. Tools in boldface have been developed specifically for the FunGenES Database.

(0.79 MB TIF)

Footnotes

Competing Interests: The authors have declared that no competing interests exist.

Funding: The FunGenES Integrated Project was funded by a grant from the European Commission (6th Framework Programme, Thematic Priority: Life sciences, Genomics and Biotechnology for Health, Contract No. : FunGenES LSHG-CT-2003-503494; http://ec.europa.eu/grants/index_en.htm). H.B. and M.T. were also supported by the University Bordeaux 2 (http://www.u-bordeaux2.fr/index.jsp) and CNRS (http://www.cnrs.fr/); A.K.H. received NIH support, grant HL08395 (http://www.nih.gov/). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

1. Rando TA. Stem cells, ageing and the quest for immortality. Nature. 2006;441:1080–1086. [PubMed]
2. Singec I, Jandial R, Crain A, Nikkhah G, Snyder EY. The leading edge of stem cell therapeutics. Annu Rev Med. 2007;58:313–28.: 313–328. [PubMed]
3. Burt RK, Loh Y, Pearce W, Beohar N, Barr WG, et al. Clinical Applications of Blood-Derived and Marrow-Derived Stem Cells for Nonmalignant Diseases. JAMA. 2008;299:925–936. [PubMed]
4. Evans MJ, Kaufman MH. Establishment in culture of pluripotential cells from mouse embryos. Nature. 1981;292:154–156. [PubMed]
5. Martin GR. Isolation of a pluripotent cell line from early mouse embryos cultured in medium conditioned by teratocarcinoma stem cells. Proc Natl Acad Sci U S A. 1981;78:7634–7638. [PMC free article] [PubMed]
6. Thomas KR, Capecchi MR. Site-directed mutagenesis by gene targeting in mouse embryo-derived stem cells. Cell. 1987;51:503–512. [PubMed]
7. Smith AG. Embryo-derived stem cells: of mice and men. Annu Rev Cell Dev Biol. 2001;17:435–62.: 435–462. [PubMed]
8. Doetschman TC, Eistetter H, Katz M, Schmidt W, Kemler R. The in vitro development of blastocyst-derived embryonic stem cell lines: formation of visceral yolk sac, blood islands and myocardium. J Embryol Exp Morphol. 1985;87:27–45. [PubMed]
9. Keller G. Embryonic stem cell differentiation: emergence of a new era in biology and medicine. Genes Dev. 2005;19:1129–1155. [PubMed]
10. Schöler HR, Hatzopoulos AK, Balling R, Suzuki N, Gruss P. A family of octamer-specific proteins present during mouse embryogenesis: evidence for germline-specific expression of an Oct factor. EMBO J. 1989;8:2543–2550. [PMC free article] [PubMed]
11. Okamoto K, Okazawa H, Okuda A, Sakai M, Muramatsu M, et al. A novel octamer binding transcription factor is differentially expressed in mouse embryonic cells. Cell. 1990;60:461–472. [PubMed]
12. Nichols J, Zevnik B, Anastassiadis K, Niwa H, Klewe-Nebenius D, et al. Formation of pluripotent stem cells in the mammalian embryo depends on the POU transcription factor Oct4. Cell. 1998;95:379–391. [PubMed]
13. Niwa H, Miyazaki J, Smith AG. Quantitative expression of Oct-3/4 defines differentiation, dedifferentiation or self-renewal of ES cells. Nat Genet. 2000;24:372–376. [PubMed]
14. Chambers I, Colby D, Robertson M, Nichols J, Lee S, et al. Functional expression cloning of Nanog, a pluripotency sustaining factor in embryonic stem cells. Cell. 2003;113:643–655. [PubMed]
15. Mitsui K, Tokuzawa Y, Itoh H, Segawa K, Murakami M, et al. The homeoprotein Nanog is required for maintenance of pluripotency in mouse epiblast and ES cells. Cell. 2003;113:631–642. [PubMed]
16. Avilion AA, Nicolis SK, Pevny LH, Perez L, Vivian N, et al. Multipotent cell lineages in early mouse development depend on SOX2 function. Genes Dev. 2003;17:126–140. [PMC free article] [PubMed]
17. Boyer LA, Lee TI, Cole MF, Johnstone SE, Levine SS, et al. Core transcriptional regulatory circuitry in human embryonic stem cells. Cell. 2005;122:947–956. [PMC free article] [PubMed]
18. Boiani M, Scholer HR. Regulatory networks in embryo-derived pluripotent stem cells. Nat Rev Mol Cell Biol. 2005;6:872–884. [PubMed]
19. Loh YH, Wu Q, Chew JL, Vega VB, Zhang W, et al. The Oct4 and Nanog transcription network regulates pluripotency in mouse embryonic stem cells. Nat Genet. 2006;38:431–440. [PubMed]
20. Li Y, McClintick J, Zhong L, Edenberg HJ, Yoder MC, et al. Murine embryonic stem cell differentiation is promoted by SOCS-3 and inhibited by the zinc finger transcription factor Klf4. Blood. 2005;105:635–637. [PubMed]
21. Takahashi K, Yamanaka S. Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors. Cell. 2006;126:663–676. [PubMed]
22. Zhang J, Tam WL, Tong GQ, Wu Q, Chan HY, et al. Sall4 modulates embryonic stem cell pluripotency and early embryonic development by the transcriptional regulation of Pou5f1. Nat Cell Biol. 2006;8:1114–1123. [PubMed]
23. Wang J, Rao S, Chu J, Shen X, Levasseur DN, et al. A protein interaction network for pluripotency of embryonic stem cells. Nature. 2006;444:364–368. [PubMed]
24. Ivanova N, Dobrin R, Lu R, Kotenko I, Levorse J, et al. Dissecting self-renewal in stem cells with RNA interference. Nature. 2006;442:533–538. [PubMed]
25. Ogawa K, Shimosato D, Takahashi K, Yagi R, Toyooka Y, et al. Forced expression of Tbx3 promotes LIF-independent self-renewal of mouse ES cells. Dev Biol. 2007;306:391–392.
26. Galan-Caridad JM, Harel S, Arenzana TL, Hou ZE, Doetsch FK, et al. Zfx controls the self-renewal of embryonic and hematopoietic stem cells. Cell. 2007;129:345–357. [PMC free article] [PubMed]
27. Ivanova NB, Dimos JT, Schaniel C, Hackney JA, Moore KA, et al. A Stem Cell Molecular Signature. Science. 2002;298:601–604. [PubMed]
28. Ramalho-Santos M, Yoon S, Matsuzaki Y, Mulligan RC, Melton DA. “Stemness”: Transcriptional Profiling of Embryonic and Adult Stem Cells. Science. 2002;298:597–600. [PubMed]
29. Sato N, Sanjuan IM, Heke M, Uchida M, Naef F, et al. Molecular signature of human embryonic stem cells and its comparison with the mouse. Dev Biol. 2003;260:404–413. [PubMed]
30. Muller FJ, Laurent LC, Kostka D, Ulitsky I, Williams R, et al. Regulatory networks define phenotypic classes of human stem cell lines. Nature. 2008;455:401–405. [PMC free article] [PubMed]
31. Sekkai D, Gruel G, Herry M, Moucadel V, Constantinescu SN, et al. Microarray Analysis of LIF/Stat3 Transcriptional Targets in Embryonic Stem Cells. Stem Cells. 2005;23:1634–1642. [PubMed]
32. Cinelli P, Casanova E, Uhlig S, Lochmatter P, Matsuda T, et al. Expression profiling in transgenic FVB/N embryonic stem cells overexpressing STAT3. BMC Developmental Biology. 2008;8:57. [PMC free article] [PubMed]
33. Tu Y, Stolovitzky G, Klein U. Quantitative noise analysis for gene expression microarray experiments. PNAS. 2002;99:14031–14036. [PMC free article] [PubMed]
34. Assou S, Le Carrour T, Tondeur S, Ström S, Gabelle A, et al. A meta-analysis of human embryonic stem cells transcriptome integrated into a web-based expression atlas. Stem Cells. 2007;25:961–973. [PMC free article] [PubMed]
35. Doss MX, Winkler J, Chen S, Hippler-Altenburg R, Sotiriadou I, et al. Global transcriptome analysis of murine embryonic stem cell-derived cardiomyocytes. Genome Biol. 2007;8:R56. [PMC free article] [PubMed]
36. Doss MX, Chen S, Winkler J, Hippler-Altenburg R, Odenthal M, et al. Transcriptomic and phenotypic analysis of murine embryonic stem cell derived BMP2+ lineage cells: an insight into mesodermal patterning. Genome Biol. 2007;8:R184. [PMC free article] [PubMed]
37. Karantzali E, Schulz H, Hummel O, Huebner N, Hatzopoulos A, et al. Histone deacetylase inhibition accelerates the early events of stem cell differentiation: transcriptomic and epigenetic analysis. Genome Biol. 2008;9:R65. [PMC free article] [PubMed]
38. Potta SP, Liang H, Pfannkuche K, Winkler J, Chen S, et al. Functional characterization and transcriptome analysis of embryonic stem cell-derived contractile smooth muscle cells. Hypertension. 2009;53:196–204. [PubMed]
39. Mariappan D, Winkler J, Chen S, Schulz H, Hescheler J, et al. Transcriptional profiling of CD31(+) cells isolated from murine embryonic stem cells. Genes Cells. 2009;14:243–260. [PubMed]
40. Trouillas M, Saucourt C, Guillotin B, Gauthereau X, Ding L, et al. Three LIF-dependent signatures and gene clusters with atypical expression profiles, identified by transcriptome studies in mouse ES cells and early derivatives. BMC Genomics. 2009;10:73. [PMC free article] [PubMed]
41. Rolletschek A, Schroeder IS, Schulz H, Hummel O, Huebner N, et al. Characterization of mouse embryonic stem cell differentiation into the pancreatic lineage in vitro by transcriptional profiling, quantitative RT-PCR and immunocytochemistry. Int J Dev Biol. 2009 In press. [PubMed]
42. Reimand J, Kull M, Peterson H, Hansen J, Vilo J. g:Profiler–a web-based toolset for functional profiling of gene lists from large-scale experiments. Nucleic Acids Res. 2007;35:W193–W200. [PMC free article] [PubMed]
43. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, et al. Gene Ontology: tool for the unification of biology. Nat Genet. 2000;25:25–29. [PMC free article] [PubMed]
44. Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28:27–30. [PMC free article] [PubMed]
45. Vastrik I, D'Eustachio P, Schmidt E, Joshi-Tope G, Gopinath G, et al. Reactome: a knowledge base of biologic pathways and processes. Genome Biology. 2007;8:R39. [PMC free article] [PubMed]
46. Griffiths-Jones S, Grocock RJ, van Dongen S, Bateman A, Enright AJ. miRBase: microRNA sequences, targets and gene nomenclature. Nucleic Acids Res. 2006;34:D140–D144. [PMC free article] [PubMed]
47. Wingender E, Dietze P, Karas H, Knuppel R. TRANSFAC: a database on transcription factors and their DNA binding sites. Nucleic Acids Res. 1996;24:238–241. [PMC free article] [PubMed]
48. Takaoka K, Yamamoto M, Hamada H. Origin of body axes in the mouse embryo. Current Opinion in Genetics & Development. 2007;17:344–350. [PubMed]
49. Barrow JR, Howell WD, Rule M, Hayashi S, Thomas KR, et al. Wnt3 signaling in the epiblast is required for proper orientation of the anteroposterior axis. Dev Biol. 2007;312:312–320. [PubMed]
50. Hatzopoulos A, Rosenberg RD. Embryonic development of the vascular system. In: Hare JA, Simons M, editors. Angiogenesis and Cardiovascular Disease. New York, Oxford: Oxford University Press; 1999. pp. 3–29.
51. Murry CE, Keller G. Differentiation of Embryonic Stem Cells toáClinically Relevant Populations: Lessons from Embryonic Development. Cell. 2008;132:661–680. [PubMed]
52. Eisen MB, Spellman PT, Brown PO, Botstein D. Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci U S A. 1998;95:14863–14868. [PMC free article] [PubMed]
53. Brown MP, Grundy WN, Lin D, Cristianini N, Sugnet CW, et al. Knowledge-based analysis of microarray gene expression data by using support vector machines. Proc Natl Acad Sci U S A. 2000;97:262–267. [PMC free article] [PubMed]
54. Wu LF, Hughes TR, Davierwala AP, Robinson MD, Stoughton R, et al. Large-scale prediction of Saccharomyces cerevisiae gene function using overlapping transcriptional clusters. Nat Genet. 2002;31:255–265. [PubMed]
55. Tam PP, Loebel DA. Gene function in mouse embryogenesis: get set for gastrulation. Nat Rev Genet. 2007;8:368–381. [PubMed]
56. Adler P, Reimand J, Janes J, Kolde R, Peterson H, et al. KEGGanim: pathway animations for high-throughput data. Bioinformatics. 2008;24:588–590. [PubMed]
57. Stead E, White J, Faast R, Conn S, Goldstone S, et al. Pluripotent cell division cycles are driven by ectopic Cdk2, cyclin A/E and E2F activities. Oncogene. 2002;21:8320–8333. [PubMed]
58. Saretzki G, Armstrong L, Leake A, Lako M, von Zglinicki T. Stress defense in murine embryonic stem cells is superior to that of various differentiated murine cells. Stem Cells. 2004;22:962–971. [PubMed]
59. Hong Y, Cervantes RB, Tichy E, Tischfield JA, Stambrook PJ. Protecting genomic integrity in somatic cells and embryonic stem cells. Mutat Res. 2007;614:48–55. [PubMed]
60. Savatier P, Lapillonne H, van Grunsven LA, Rudkin BB, Samarut J. Withdrawal of differentiation inhibitory activity/leukemia inhibitory factor up-regulates D-type cyclins and cyclin-dependent kinase inhibitors in mouse embryonic stem cells. Oncogene. 1996;12:309–322. [PubMed]
61. White J, Stead E, Faast R, Conn S, Cartwright P, et al. Developmental activation of the Rb-E2F pathway and establishment of cell cycle-regulated cyclin-dependent kinase activity during embryonic stem cell differentiation. Mol Biol Cell. 2005;16:2018–2027. [PMC free article] [PubMed]
62. Zhou Q, Chipperfield H, Melton DA, Wong WH. A gene regulatory network in mouse embryonic stem cells. Proc Natl Acad Sci U S A. 2007;104:16438–16443. [PMC free article] [PubMed]
63. Wang ZX, Teh CH-L, Kueh JLL, Lufkin T, Robson P, et al. Oct4 and Sox2 directly regulate expression of another pluripotency transcription factor, Zfp206, in embryonic stem cells. J Biol Chem. 2007;282:12822–12830. [PubMed]
64. Kim J, Chu J, Shen X, Wang J, Orkin SH. An extended transcriptional network for pluripotency of embryonic stem cells. Cell. 2008;132:1049–1061. [PMC free article] [PubMed]
65. Okita K, Ichisaka T, Yamanaka S. Generation of germline-competent induced pluripotent stem cells. Nature. 2007;448:313–317. [PubMed]
66. Maherali N, Sridharan R, Xie W, Utikal J, Eminli S, et al. Directly Reprogrammed Fibroblasts Show Global Epigenetic Remodeling and Widespread Tissue Contribution. Cell Stem Cell. 2007;1:55–70. [PubMed]
67. Wernig M, Meissner A, Foreman R, Brambrink T, Ku M, et al. In vitro reprogramming of fibroblasts into a pluripotent ES-cell-like state. Nature. 2007;448:318–324. [PubMed]
68. Phillips RL, Ernst RE, Brunk B, Ivanova N, Mahan MA, et al. The genetic program of hematopoietic stem cells. Science. 2000;288:1635–1640. [PubMed]
69. Assou S, Le Carrour T, Tondeur S, Ström S, Gabelle A, et al. A meta-analysis of human embryonic stem cells transcriptome integrated into a web-based expression atlas. Stem Cells. 2007;25:961–973. [PMC free article] [PubMed]
70. Porter CJ, Palidwor GA, Sandie R, Krzyzanowski PM, Muro EM, et al. StemBase: a resource for the analysis of stem cell gene expression data. Methods Mol Biol. 2007;407:137–148. [PubMed]
71. Wei CL, Miura T, Robson P, Lim SK, Xu XQ, et al. Transcriptome profiling of human and murine ESCs identifies divergent paths required to maintain the stem cell state. Stem Cells. 2005;23:166–185. [PubMed]
72. Walker E, Ohishi M, Davey RE, Zhang W, Cassar PA, et al. Prediction and testing of novel transcriptional networks regulating embryonic stem cell self-renewal and commitment. Cell Stem Cell. 2007;1:71–86. [PubMed]
73. Affymetrix. Genechip Expression Analysis Technical Manual. 1999.
74. Irizarry RA, Bolstad BM, Collin F, Cope LM, Hobbs B, et al. Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res. 2003;31:e15. [PMC free article] [PubMed]
75. Gasch AP, Eisen MB. Exploring the conditional coregulation of yeast gene expression through fuzzy k-means clustering. Genome Biol. 2002;3:RESEARCH0059. [PMC free article] [PubMed]
76. Yvert G, Brem RB, Whittle J, Akey JM, Foss E, et al. Trans-acting regulatory variation in Saccharomyces cerevisiae and the role of transcription factors. Nat Genet. 2003;35:57–64. [PubMed]
77. The Biomart Team. BioMart Project. 2009. http://www.biomart.org/

Articles from PLoS ONE are provided here courtesy of Public Library of Science
PubReader format: click here to try

Formats:

Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...

Links

  • PubMed
    PubMed
    PubMed citations for these articles

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...