Identification of Arabidopsis seed-specific mRNAs. (A) GeneChip data obtained for all stages of the life cycle (Fig. 2 and Fig. S4) were partitioned into three groups: (i) reproductive development [OV and floral buds (FBUD) (blue circle)], (ii) seed development [(24H, GLOB, COT, MG, and PMG) (red circle)], and (iii) vegetative development [SDLG, leaf, stem, and root (green circle)]. Processing and filtering of the data are outlined in SI Materials and Methods. Number in parentheses indicates number of TF mRNAs. The identities of seed-specific, seed-specific TF, reproductive-organ-specific, and vegetative-organ-specific mRNAs are listed in Table S13 of Dataset S2. GO terms that are enriched significantly in the seed-, reproductive-, and vegetative-specific mRNA sets (P < 0.01) are presented in Table S15 of Dataset S2. Seed development accumulation patterns and representation of functional groups for the 289 seed-specific mRNAs are shown in Fig. S5. (B and C) Unsupervised hierarchical clustering of mRNAs (B) and TF mRNAs (C) shared by all periods of the life cycle [i.e., intersection of mRNA sets in A] was carried out by using dChip 1.3 (56) as described in SI Materials and Methods and Fig. 3A legend. Only the top 2,000 probe sets with the most varying signals across all periods of the life cycle were included in the clustering analysis shown in B (SI Materials and Methods). All 77 TF mRNAs included in the top 2,000 most varying probe sets shared by all life cycle periods (B) were used for the clustering analysis shown in C. Blue, red, and green bars highlight mRNA clusters that are up-regulated in (i) OV and 24H seeds, (ii) GLOB, COT, MG, and PMG seeds, and (iii) SDLG and vegetative organs, respectively. The number of mRNAs in each cluster that increased significantly in prevalence ≥2-fold relative to the mean signal intensity of each cluster (P < 0.05) is listed next to the bars in B. The identities of mRNAs shared throughout the life cycle (A) and those that are present in up-regulated clusters (B and C) are listed in Table S17 of Dataset S2. GO terms that are enriched significantly in each cluster (P < 0.01) are presented in Table S18 of Dataset S2. (D) TF families and stage specificity of seed-specific TF mRNAs identified in A. Seed-specific TF mRNAs were classified into families as shown in Fig. S5. Homozygous T-DNA insertion lines for TF genes marked with a * did not produce a detectable seed phenotype (green * by us and blue * by others) (Table S20 in Dataset S2). Mutations in seed-specific TF genes marked with a # were shown previously by us (e.g., lec 1, lec1-like, lec2, mea) and by others (e.g., pei1, fus3), to produce a seed-defective phenotype (Table S20 in Dataset S2). Scale from −3 (green) to +3 (red) represents the relative number of standard deviations from the mean signal intensity for each probe set across all developmental stages.