![]() | ![]() |
Formats:
|
||||||||||||||||||||
Copyright © 2002, American Society of Plant Physiologists Sequence Relationships, Conserved Domains, and Expression Patterns for Maize Homologs of the Polycomb Group Genes E(z), esc, and E(Pc)1 Department of Agronomy, University of Wisconsin, Madison, Wisconsin 53706 (N.M.S., H.F.K., S.M.K.); Pioneer Hi-Bred International, Johnston, Iowa 50131 (O.N.D., P.H., T.G.H.); and Department of Agronomy, University of Minnesota, St. Paul, Minnesota 55108 (R.L.P.) *Corresponding author; e-mail smkaeppl/at/facstaff.wisc.edu; fax 608–262–5217. Received August 16, 2001; Revised November 28, 2001; Accepted December 21, 2001. This article has been cited by other articles in PMC.Abstract Polycomb group (PcG) proteins play an important role in developmental and epigenetic regulation of gene expression in fruit fly (Drosophila melanogaster) and mammals. Recent evidence has shown that Arabidopsis homologs of PcG proteins are also important for the regulation of plant development. The objective of this study was to characterize the PcG homologs in maize (Zea mays). The 11 cloned PcG proteins from fruit fly and the Enhancer of zeste [E(z)], extra sex combs (esc), and Enhancer of Polycomb [E(Pc)] homologs from Arabidopsis were used as queries to perform TBLASTN searches against the public maize expressed sequence tag database and the Pioneer Hi-Bred database. Maize homologs were found for E(z), esc, and E(Pc), but not for Polycomb, pleiohomeotic, Posterior sex combs, Polycomblike, Additional sex combs, Sex combs on midleg, polyhometoic, or multi sex combs. Transcripts of the three maize Enhancer of zeste-like genes, Mez1, Mez2, and Mez3, were detected in all tissues tested, and the Mez2 transcript is alternatively spliced in a tissue-dependent pattern. Zea mays fertilization independent endosperm1 (ZmFie1) expression was limited to developing embryos and endosperms, whereas ZmFie2 expression was found throughout plant development. The conservation of E(z) and esc homologs across kingdoms indicates that these genes likely play a conserved role in repressing gene expression. Gene expression patterns in eukaryotes are regulated in response to developmental and environmental cues. Changes in the patterns of gene expression are often the result of specific transcriptional regulators. In many cases, patterns of gene expression must be stably maintained through mitotic cell divisions even though the transcriptional regulator that effected the change in expression is only present transiently. The Polycomb group (PcG) genes of fruit fly (Drosophila melanogaster) stabilize repressed chromatin states during development. Recently, homologs of PcG genes have also been shown to affect developmental gene regulation in other species. Simon (1995) defined a set of 13 PcG genes in fruit fly based on a common phenotype of homeotic transformation. The homeotic transformations caused by mutations in PcG genes are the result of a failure to maintain transcriptional repression of homeotic genes through development. Biochemical and genetic evidence indicates that the 13 PcG proteins operate in at least two distinct complexes (Franke et al., 1992; Strutt and Paro, 1997; Jones et al., 1998; Sewalt et al., 1998; van Lohuizen et al., 1998; Ng et al., 2000; Tie et al., 2001). One complex includes the PcG proteins E(Z) and ESC, as well as the histone deacetylase RPD3 and the histone-binding p55 proteins (Tie et al., 2001). The second complex includes Polycomb (PC), Posterior sex combs (PSC), Polyhomeotic (PH), Sex combs on Midleg (SCM), and additional uncharacterized proteins (Franke et al., 1992; Shao et al., 1999; Poux et al., 2001). PcG proteins have also been shown to repress expression of introduced (Pal-Bhadra et al., 1997, 1999) and endogenous (Laible et al., 1997) genes in fruit fly. All examples of polycomb-based repression likely operate through formation of a repressive chromatin structure. Mammalian homologs of all of the cloned PcG proteins, except multisex combs, have been documented (Simon, 1995; Schumacher and Magnuson, 1997; Brock and van Lohuizen, 2001). As in fruit fly, mutations in mammalian PcG genes result in anterior derepression of Hox gene expression and alterations in cellular proliferation patterns (van der Lugt et al., 1994; Alkema et al., 1995; Muller et al., 1995; Akasaka et al., 1996; Core et al., 1997; Gould, 1997). In Caenorhabditis elegans, homologs of three of the 11 cloned PcG proteins, Enhancer of zeste [E(z)], extra sex combs (esc), and Enhancer of Polycomb [E(Pc)], have been reported (Holdeman et al., 1998; Korf et al., 1998; Stankunas et al., 1998). The E(z) and esc homologs (maternal effect sterile-2 [mes-2] and maternal effect sterile-6 [mes-6]) from C. elegans were identified as maternal genes required for germline immortality (Holdeman et al., 1998; Korf et al., 1998). The mes-2 and mes-6 genes are also involved in the silencing of transgenes in germline cells (Kelly and Fire, 1998). Homologs of E(z) and esc have also been documented in Arabidopsis (Goodrich et al., 1997; Grossniklaus et al., 1998; Ohad et al., 1999). Three E(z)-like genes, curly leaf (clf; Goodrich et al., 1997), Medea (Mea; Grossniklaus et al., 1998), and E(z)-likeA1 (Eza1; Preuss, 1999) and one esc-like gene, fertilization-independent endosperm (fie; Ohad et al., 1999), have been previously described. Mea (also identified as emb173, fis1, and f644) was identified in a screen for Arabidopsis gametophyte lethal mutations and autonomous endosperm development mutations (Castle et al., 1993; Chaudhury et al., 1997; Grossniklaus et al., 1998; Kiyosue et al., 1999; Luo et al., 1999). A plant heterozygous for a mea mutation will produce 50% aborted seeds that collapse and do not germinate. It has subsequently been found that Mea is imprinted such that the maternal copy of Mea is expressed endosperm tissues, whereas the paternal copy is not (Kinoshita et al., 1999; Vielle-Calzada et al., 1999). Mea mutants fail to repress endosperm development in the absence of fertilization (Kiyosue et al., 1999). These results indicate that maternal expression of Mea is required to repress endosperm development. Plants homozygous for clf mutations display curled leaves, altered maturation times, and partial homeotic transformations of floral tissues (Goodrich et al., 1997). Ectopic expression of the homeotic genes Agamous (AG) and Apetela3 (AP3) is also observed in clf homozygotes. In plants with wild-type Clf alleles, AG and AP3 are expressed in floral tissues where Clf mRNA is present. The presence of Clf RNA in cells expressing AG and AP3 indicates that CLF protein is not sufficient to organize suppressive chromatin, but that other targeting factors are also required (Goodrich et al., 1997). Overlapping expression of PcG and homeotic genes also occurs in fruit fly. The coexpression of functional PcG proteins and the genes that they can repress indicates that specific targeting factors are involved in PcG-dependent repression. A third E(z)-like gene, Eza1, is present in the Arabidopsis genome (Preuss, 1999). No phenotype for Eza1 has been reported. Mutations in the Arabidopsis esc-like gene, fie, produce phenotypes very similar to Mea (Ohad et al., 1996). A female gametophyte with a fie mutant allele will undergo replication of the central cell nucleus and partial endosperm development without a fertilization event (Ohad et al., 1999). This indicates that FIE is involved in repressing endosperm development until fertilization occurs. The similar phenotypes of fie and mea mutants suggests that these two genes may participate together in a silencing complex. This is consistent with the proven direct interaction of E(Z) and ESC in fruit fly (Jones et al., 1998), an interaction also shown to occur between FIE and MEDEA (Luo et al., 2000; Spillane et al., 2000; Yadegari et al., 2000). The objective of this research was to characterize the PcG genes (defined by Simon, 1995) in maize. Fruit fly and Arabidopsis sequences were used to identify maize sequences homologous to PcG genes. We report the full-length cDNA sequence of three E(z) homologs, two esc homologs, and one E(Pc) homolog, and we describe the likely evolution of these sequences. We also describe expression patterns of these genes in maize. RESULTS The 11 cloned (two other PcG genes have not been cloned) fruit fly proteins defined as the PcG by Simon (1995) were used as queries to identify homologs in Arabidopsis and maize (Table I). The top TBLASTN score from searches of the complete Arabidopsis genome sequence is shown for each PcG protein. The existence of E(z) and esc homologs in Arabidopsis has been previously documented (Goodrich et al., 1997; Grossniklaus et al., 1998; Ohad et al., 1999). The BLAST scores of searches with E(Z) and ESC proteins were much higher than the scores for searches with any of the other PcG proteins. In addition, two sequences with significant similarity to E(Pc) were detected. The two Arabidopsis proteins with significant similarity to E(PC) (AAG10815 and AAC17077) are much shorter than the fruit fly E(Pc), which is 2,033 amino acids. Further analysis of these sequences indicates that they contain the domains found in E(Pc) homologs from fruit fly, mouse, C. elegans, and Saccharomyces cerevisiae.
The top two BLAST hits for all Polycomb proteins were analyzed based on length, organization of the gene, and similarity of conserved domains. Based on BLAST scores, gene organization, and conservation in critical domains, we determined that only homologs of E(z), esc, and E(Pc) exist in Arabidopsis. Relatively strong BLAST hits to pleiohomeotic (pho), mxc, Pc, and psc were found, but subsequent analysis determined that they were not PcG homologs. The Arabidopsis proteins most similar to pho showed homology within a zinc finger domain, but not to any other regions of pho. Therefore, these zinc-finger proteins were determined to not be orthologs of pho. The proteins most similar to mxc contained an RNA binding motif, but were different in length and composition throughout the rest of the protein and were clearly not orthologs of mxc. The search for Pc homologs identified chromodomain-containing proteins that were members of other types of gene families. The lack of homology between Pc and these proteins outside the chromodomain clearly indicated that no Pc homologs exist in Arabidopsis. Psc candidates were found to have homology in a RING finger domain, but no homology outside this domain. Therefore, these candidates were determined to be RING finger proteins, but not Psc homologs. The 11 cloned PcG proteins from fruit fly and the E(z), esc, and E(Pc) homologs from Arabidopsis were used as queries to perform TBLASTN searches against the public maize expressed sequence tag (EST) database and the Pioneer Hi-Bred database. Maize homologs were found for E(z), esc, and E(Pc), but not for Pc, ph, Psc, Polycomblike (Pcl), Additional Sex combs (Asx), Scm, pho, or mxc. The full-length sequences of the maize E(z), esc, and E(Pc) genes were then obtained and characterized. Maize Has Three E(z) Homologs The Mez1 cDNA is 3,180 bp in length and produces a predicted protein of 933 amino acids (Fig. (Fig.1).1
The MEZ2 and MEZ3 putative proteins are 89% identical and the nucleotide sequences of the genes are 92% identical. Based on the high degree of sequence homology, Mez2 and Mez3 are likely the genome duplicates resulting from the paleotetraploid origin of maize. It is common to find two closely related sequences in the maize genome due its evolutionary history (Gaut and Doebley, 1997). The amino acid sequences of MEZ1 and MEZ2 are 42% identical and 56% similar to each other over their entire lengths. The nucleotide sequences of Mez1 and Mez2 are 52% identical. Arabidopsis contains at least three E(z)-like genes, and mutations in two of these genes, Mea and Clf, show distinct phenotypes. We attempted to determine which of the Arabidopsis E(z)-like genes that Mez1, Mez2, and Mez3 were most closely related to. The SET domain sequences of all E(z)-like proteins (indicated in Fig. Fig.1)1
Alignment of E(z) Homologs Identifies Five Conserved Domains Alignments of plant and animal E(z) homologs were used to identify conserved domains (Fig. (Fig.1).1 In addition to the domains identified by SMART, two additional regions of conservation are present in the plant and animal E(z)-like proteins. These domains were analyzed by BLASTP and are not found in any other sequences outside of E(z) homologs. Therefore, we have named these two domains Enhancer of zeste domain1 (EZD1) and Enhancer of zeste domain2 (EZD2) because they represent conserved domains specific to the E(z) family. EZD1 is a highly conserved acidic region of 70 amino acids in the N-terminal region (Figs. (Figs.11 Maize Contains Two ESC/FIE Homologs Two homologs of fruit fly esc were isolated from maize, ZmFie1 (Zea mays fertilization independent endosperm 1) and ZmFie2 (Zea mays fertilization independent endosperm 2). The ZmFIE1 and ZmFIE2 proteins are 76% identical and 85% similar over their entire lengths (Fig. (Fig.4).4
An alignment of esc-like proteins is shown in Figure Figure4A.4 Plants Contain Homologs of E(Pc) Two predicted proteins with significant similarity to E(Pc) were detected in the Arabidopsis genome. These proteins were named AtEPL1 (AAG10815) and AtEPL2 (AAC17077). AtEpl1 is located on chromosome 1 at 25cM (BAC F19K19) and is represented by a single EST (AV559019) from green silique tissue. AtEpl2 is also located on chromosome 1 at 117 cM (yeast artificial chromosome YUP8H12R) and is not represented by any Arabidopsis ESTs. The Arabidopsis proteins are 67% identical and 79% similar to each other. We searched for ESTs from maize with significant similarity to the Arabidopsis Epc genes. A single EST (BE511872) with significant similarity to the AtEpl1 and AtEpl2 genes was found. The maize E(Pc) homolog, ZmEpl101, is 52% identical and 70% similar to AtEpl2. In addition to the ZmEpl101 EST from maize, E(pc) homologous ESTs were also found in soybean (Glycine max), tomato (Lycopersicon esculentum), potato (Solanum tuberosum), wheat (Triticum aestivum), and barley (Hordeum vulgare). This indicates that E(Pc) homologs are present and expressed in a variety of plant species. E(Pc) homologs have previously been identified in mammals, yeast, and C. elegans (Stankunas et al., 1998). Alignments of the known E(Pc) homologs with the plant EPL proteins revealed significant conservation of the previously characterized EPcA domain (Stankunas et al., 1998; Fig. Fig.5A).5
Mez Genes in Maize Are Expressed throughout Plant Development, and Mez2 Shows a Tissue-Specific Alternative Splicing Pattern The expression level of the maize PcG homologs is low and difficult to detect by northern-blot analysis. In addition, it is not possible to generate specific probes for the pairs of duplicate genes, ZmFie1/ ZmFie2 and Mez2/Mez3, due to the high degree of nucleotide identity between these two sequences. Therefore, we used reverse transcriptase (RT)-PCR analysis to determine the expression pattern of the PcG genes of maize (Fig. (Fig.6).6
Multiple transcripts were observed when the entire coding sequence of Mez2 was PCR amplified (data not shown). Three transcripts were characterized from Mez2, including the full-length transcript and two alternatively spliced products (Fig. (Fig.7A).7
We tested cDNA from eight different maize tissues to determine whether the splicing of Mez2 transcripts is tissue specific (Fig. (Fig.7A).7 ZmFie2 Is Expressed throughout Development, Whereas ZmFie1 Expression Is Limited to the Early Embryo and Endosperm The presence of ZmFie1 and ZmFie2 transcripts in various plant tissues was tested by RT-PCR (Fig. (Fig.6).6 DISCUSSION PcG proteins play an important role in the maintenance of gene silencing in animals. In this study, we searched the complete Arabidopsis genome sequence and maize EST databases for homologs of PcG proteins. This exhaustive search identified homologs for only three of the 11 cloned fruit fly PcG proteins, E(z), esc, and E(Pc). Biochemical studies have found that the PcG proteins of animals are organized into at least two complexes, the E(z)/esc complex (Jones et al., 1998; Sewalt et al., 1998) and the PRC1 complex, which includes Pc, Ph, Scm, and Psc (Shao et al., 1999). The role of the E(PC) protein is not well characterized, and current evidence suggests that it operates independently of the E(Z)/ESC complex. Genetic evidence from fruit fly suggests that the functions of all PcG proteins are necessary for repression of homeotic genes (Simon, 1995). The genetic data include similar phenotypes of single mutants as well as studies of double mutants. The observations that E(z) and esc homologs function to repress gene expression in organisms that lack other PcG proteins,such as plants and C. elegans (Goodrich et al., 1997; Holdeman et al., 1998; Kelly and Fire, 1998; Korf et al., 1998), indicates that the E(z)/esc complex is capable of repression of transcription in the absence of the PRC1 complex. In our searches, we failed to find homologs of any proteins in the PRC1 complex. The fact that the E(Z)/ESC complex is capable of repressing transcription in the absence of PRC1 raises the question of what, if anything, is fulfilling the function of the PRC1 complex in plants and C. elegans. No catalytic activities have been defined for any of the fruit fly proteins found in the PRC1 complex. In vitro, the PRC1 complex of fruit fly interferes with SWI/SNF-dependent chromatin remodeling activities (Shao et al., 1999). If the primary role of the PRC1 complex is inhibition of SWI/SNF-dependent chromatin remodeling, then plants may contain a different group of proteins, targeted by E(Z)/ESC, that interfere with SWI/SNF activity. In an alternate manner, the E(Z)/ESC complex may be capable of silencing transcription independent of PRC1 function. The role of the PRC1 complex in fruit fly and mammals may be to enhance or stabilize this silencing, whereas in plants and C. elegans this supplementary function is absent. The E(Z) and ESC Proteins Can Provide Two Mechanisms for Repression We have found multiple homologs of E(z) and esc in maize, and previous studies have documented similar homologs in Arabidopsis (Goodrich et al., 1997; Grossniklaus et al., 1998; Ohad et al., 1999; Preuss, 1999). Our sequence analysis of the MEZ1, MEZ2, and MEZ3 proteins revealed the presence of multiple domains that are conserved between plant and animal E(Z)-like proteins. The most conserved domain is the SET domain, which is found near the C terminus of all E(Z)-like proteins. The SET domain of the Su(var)3-9-like proteins and mammalian G9a protein has been demonstrated to act as a histone methyltransferase (Rea et al., 2000; Lachner et al., 2001; Nakayama et al., 2001; Tachibana et al., 2001). These proteins contain Cys-rich regions on both sides of the SET domain. The presence of both Cys-rich regions was proposed to be required for targeting histone methyltransferase activity to histone H3 (Rea et al., 2000). In this same study, other proteins such as EZH2 and HRX, which contain only one Cys-rich region, did not possess detectable histone methyltransferase activity. The maize E(Z)-like proteins like fruit fly and mammalian E(Z) proteins only contain a Cys-rich region on the N-terminal side of the SET domain. Due to the high conservation of the SET domain, it is likely that the plant E(Z)-like proteins are protein methyltransferases. Further research will determine the if E(Z)-like proteins are capable of histone methylation in vivo or if they methylate other proteins. Alignments of the two ZmFIE proteins with Arabidopsis FIE, fruit fly ESC, and mammalian WAIT-1 revealed several conserved features. The maize ZmFIE1 and ZmFIE2 proteins contain seven WD repeats, the same as found in all other characterized ESC-like proteins (Ng et al., 1997; Ohad et al., 1999). The spacing of the WD repeats found in ZmFIE1 and ZmFIE2 is also consistent with that observed in other ESC-like proteins. This suggests that the ZmFIE proteins found in plants are likely to form a β-propeller structure similar to that predicted for fruit fly ESC (Ng et al., 1997). The sequence conservation of ZmFIE proteins with the fruit fly and mammalian proteins makes it reasonable to propose that these proteins are involved in similar protein-protein interactions. The ESC-like proteins have been documented to physically interact with two other proteins. A direct interaction between E(Z) and ESC homologs has been demonstrated in fruit fly, mice, C. elegans, and Arabidopsis (Jones et al., 1998; Sewalt et al., 1998; Luo et al., 2000; Spillane et al., 2000; Yadegari et al., 2000; Xu et al., 2001). The ESC-like proteins in fruit fly and mouse directly interact with Rpd3 histone deacetylase proteins (van der Vlag and Otte, 1999; Tie et al., 2001). These interactions provides a mechanism for ESC-like proteins to link the catalytic activities of protein methylation by E(Z)-like proteins with histone deacetylation by Rpd3-like proteins. The E(Z)/ESC complex could perform two distinct functions in repressing gene expression. First, the E(Z)/ESC contains a histone deacetylase protein, which can mediate a repressive chromatin structure. Histone acetylation states are relatively unstable and require constant presence of a histone deacetylase to be maintained (Jenuwein, 2001). Second, the SET domain of E(z) is predicted to be involved in protein methylation. Unlike acetylation, methylation often tends to be more stable (Jenuwein, 2001). Although the role of histone acetylation in regulating chromatin states is defined (Cheung et al., 2000), the role of E(z)-mediated protein methylation is less apparent. Protein methylation may be involved in potentiating interactions with other proteins by creating specific binding sites. For example, the SET domain protein Su(var)3-9 methylates Lys 9 of histone H3 (Rea et al., 2000). Heterochromatin protein HP1 will bind specifically to methylated histone H3, but not to unmodified histone H3 (Bannister et al., 2001; Lachner et al., 2001; Nakayama et al., 2001). In animals, the PcG proteins include a SET domain protein, E(z), and a chromodomain protein, Polycomb. Homologs of the Polycomb gene are not present in plants, but other chromodomain containing genes, such as chromomethylases, are present. Therefore, the E(Z)/ESC complex provides a combination of a reduction in histone acetylation coupled with targeted protein methylation that likely results in a stable repressive chromatin state. Plants Contain Proteins Similar to E(Pc) We have identified homologs of a third PcG protein, E(Pc), from Arabidopsis and maize. The Arabidopsis Epl) sequences that we found are located at two unlinked locations on chromosome 1. The maize ZmEpl101 gene is homologous to the Arabidopsis genes throughout the coding sequence. Unlike E(z) and esc, the size and organization of E(Pc) homologs differs between species. The fruit fly E(PC) protein is 2,033 amino acids. The homologs of E(Pc) from other species lack the C-terminal 1,400 amino acids present in the fruit fly protein and average 560 amino acids in length (Stankunas et al., 1998). All of the E(Pc) homologs contain two conserved domains, EpcA and EpcB. The EpcA domain encompasses approximately the first 200 amino acids of all E(Pc) homologs, whereas the EpcB domain is 13 amino acids. Stankunas et al. (1998) identified the EPcC domain as a region of conservation between the fruit fly and mammalian E(PC) protein sequences that is not present in C. elegans or S. cerevisiae homologs. Our alignments show that like the C. elegans or S. cerevisiae homologs, the plant E(Pc) homologs do not contain the EPcC domain identified in fruit fly and mammalian sequences (Stankunas et al., 1998). To date, E(PC) has not been found in complexes associated with any other PcG proteins. Unlike the other PcG group genes, mutations in E(Pc) do not display homeotic transformations alone (Sato et al., 1983). However, mutations in E(Pc) enhance the homeotic transformations observed when other PcG genes are mutated (Sato et al., 1984). This suggests that E(Pc) plays a role distinct from other PcG proteins. In addition, a homolog of E(Pc), Epl1, has been found in S. cerevisiae, a species that does not contain homologs of any other PcG proteins (Stankunas et al., 1998). The presence of an E(Pc) homolog in a species that lacks all other PcG proteins suggests that E(Pc) homologs may have a basic chromatin function independent of PcG proteins, but that is required for PcG silencing. In support of the idea that E(Pc) plays a broader role in chromatin regulation, the human E(Pc) homolog has been shown to repress and activate transcription (Shimono et al., 2000). In a similar manner, another study found that the yeast E(Pc) homolog Epl1 was present in the NuA4 histone acetyltransferase complex involved in transcriptional activation (Galarneau et al., 2000). The plant E(Pc) homologs Epl1, Epl2, and ZmEpl101 are likely to be proteins involved in chromatin-based regulation of gene expression, but may operate independently of E(z) and esc homologs. Expression of Multiple E(Z) and ESC Proteins May Allow Specialization of Function in Plants Plants contain multiple E(z) and esc homologs. Arabidopsis contains three homologs of E(z): Clf, Eza1, and Mea. We have documented the presence of three E(z) homologs in maize. The first gene, Mez1, does not have a closely related nucleotide sequence in the maize genome and is most likely to be the maize ortholog of Clf based on our phylogenetic analysis. The other two E(z) homologs, Mez2 and Mez3, are highly related to each other and are predicted to be the orthologs of Eza1 based on sequence phylogeny. No Mea homologs are observed in the public monocot EST databases, or rice (Oryza sativa) genomic sequences (searches done February 8, 2002). This could simply reflect the fact that Mea is not highly expressed and is therefore not represented as an EST, or it could indicate that monocots do not have a Mea homolog. If there is no Mea present in monocots, another E(z) protein is likely performing a function in monocots analogous to Mea's function in dicots. This could also reflect differences in the regulation of development in monocots and dicots. In addition to having three distinct genes encoding E(z) homologs, maize also displays alternative splicing of at least one E(z) gene, Mez2. The alternative splicing pattern of Mez2 could provide distinct functions arising from the same gene in different tissues and cells. We successfully amplified transcripts corresponding to multiple splice products from many tissues. However, the alternative splicing pattern was distinct in some tissues, indicating a developmental control of the splice pattern of Mez2. The presence of multiple transcripts in some tissues may be due to simultaneous presence of these transcripts in all cells of that tissue or may be due to cell type-specific expression of certain transcripts of Mez2. The putative proteins produced by the alternatively spliced transcripts of Mez2 are likely to provide different functions. The MEZ2as2 protein is similar in size and domain composition to MEA (with the exception of the SANT domain). The MEZ2as2 form predominates in embryonic tissues, which is the location of Mea function in Arabidopsis. It is possible that through alternative splicing, Mez2 provides Mea and Eza1 functions in maize. The MEZ2as1 protein, which is present in kernel, seedling, and immature leaf tissue, lacks the C-terminal two-thirds of the protein, including the SANT, Cys-rich, and SET domains. The truncated protein produced from Mez2as1 transcripts might negatively regulate function of the E(z)/esc complexes by binding ESC proteins in a nonfunctional complex. The production of the MEZ2as2 protein, which is missing a 270-amino acid region that includes the SANT DNA-binding domain, but still possesses a SET domain, may alter the localization or activities of the complex. The alternative splicing of Mez2 may allow for this gene to provide distinct functions in different tissues. In this study, we have also documented that maize contains multiple esc homologs, ZmFie1 and ZmFie2. In the other organisms in which the PcG proteins have been studied (fruit fly, humans, mice, C. elegans, and Arabidopsis), only one esc homolog has been found. Based on the chromosomal locations and high degree of nucleotide identity, it is likely that the duplication of these genes is due to the allotetraploid history of maize (Helentjaris, 1995; Gaut and Doebley, 1997). This duplication event may have allowed specialization of the function of ZmFie1 and ZmFie2 in expression pattern or function. We have found that ZmFie1 is only expressed in kernel tissues, whereas ZmFie2 is expressed in all tissues tested. The sequence of ZmFie1 is also different from esc homologs, containing N- and C-terminal extensions. ZmFIE1 may have evolved kernel-specific functions that are important in the development of the maize endosperm or embryo. The E(Z) and ESC proteins function as a complex. The domain of the E(Z) and MEA proteins that interacts with ESC and FIE has been mapped to the N-terminal region of the protein (Jones et al., 1998; Luo et al., 2000; Spillane et al., 2000; Yadegari et al., 2000). All three of the MEZ proteins from maize as well as the two putative proteins produced by alterative splicing of Mez2 contain the N-terminal region of the protein and therefore are predicted to interact with the maize ZmFIE proteins. Evidence from Arabidopsis indicates that MEA and EZA1 interact with FIE (Luo et al., 2000). Assuming that the three full-length MEZ proteins and the two alternative splice isoforms can interact equally with both ZmFIE proteins, up to 10 distinct E(z)/esc complexes could be formed. The multiplicity of complexes could allow for a number of specialized roles in regulating gene expression. Furthermore, the expression and splicing pattern of Mez and ZmFie genes is tissue specific, which would regulate the exact set of E(Z)/ESC complexes present in a given tissue. Epigenetic regulation of gene expression involving homologs of E(z) and esc is conserved across diverse species. These proteins offer a pathway for epigenetic regulation separate from DNA methylation. The main role of PcG proteins in plants and in animals may be to maintain the gene expression patterns determined by developmental decisions. This type of repression must be reset at meiosis each generation. In contrast, repression mediated by DNA methylation provides a meiotically heritable mechanism for gene silencing. There may be instances where PcG-mediated and DNA methylation-mediated silencing overlap, such as the regulation of AG and AP3 (Finnegan et al., 1996; Goodrich et al., 1997). However, we believe PcG-mediated silencing is primarily a source of epigenetic memory during development, whereas DNA methylation plays a central role in heritable gene silencing for genome protection. MATERIALS AND METHODS Cloning of Mez Genes Fruit fly (Drosophila melanogaster) E(z) (AAC46462) was used in a TBLASTN search of the Pioneer Hi-Bred EST database. Three contigs with significant similarity to E(z) were discovered and named Mez1, Mez2, and Mez3. Other SET domain-containing proteins were also identified, but were not included in this research because they had greater similarity to proteins other than to E(z). RACE was performed to obtain full-length cDNA sequence of Mez1 and Mez2. Full-length Mez3 was obtained from an EST clone that contained the entire coding region. RACE reactions were performed using the Marathon cDNA kit (CLONTECH, Palo Alto, CA) on cDNA produced from 1-week-old Mo17 seedlings. Advantage2 polymerase (CLONTECH) was used in the RACE reactions. The primers used in the RACE reactions were Mez1F1 (5′-GGGTGTGGTGATGGTACATTGG-3′), Mez1R2 (5′-CAGCTTGTCACCCATTCTGTATGCG-3′, Mez2R3 (5′-TGCCTCGTCCTTCTTTGATCCTTCG-3′), and Mez2F3 (5′-CTCACAAGGAAGCAGACAAACGCGG-3′). RACE products were gel purified and cloned into pGEM-T Easy (Promega, Madison, WI). All sequencing was performed using BigDye terminator cycle sequencing on an ABI sequencer (PerkinElmer Applied Biosystems, Foster City, CA). Sequencing reactions were done in a 10 μL volume with 200 to 400 ng of DNA and 10 pg of primer. The cycling conditions used were 95oC for 2 min, 70 cycles at 95o for 15 s, 55oC for 20 s, 60oC for 4 min, followed by 72oC for 7 min. Cloning of ZmFie Genes Arabidopsis FIE (AF129516) was used in a TBLASTN search of the Pioneer Hi-Bred maize (Zea mays) EST database. Two contigs with significant similarity were found and were named ZmFie1 and ZmFie2. Both contigs contained at least one full-length EST. The sequence of the ZmFie genes was obtained by sequencing the full-length clones as described above. Cloning of ZmEpl101 The Arabidopsis genome sequence was searched using fruit fly E(Pc) as a query. Two sequences were identified in this search, AtEpl1 and AtEpl2. The AtEPL1 (AC011808) protein sequence was then used to perform a TBLASTN search of the public maize EST database (http://www.zmdb.iastate.edu/). One EST (BE511872) with significant similarity was found. We obtained the remaining sequence for the ZmEpl101 gene by RACE-PCR. The primers used were Mepc1R1 (5′-GTCCGGAGAAGAGGATTCCATCGATC-3′) and Mepc1R2 (5′-CCTCGTCCAATCCTACCTCGACACC-3′). Phylogenetic Analysis The SET domains from all E(z)-like proteins were aligned using ClustalW (the exact region used is indicated in Fig. Fig.1).1 RT-PCR Analysis RT-PCR was used to assess expression patterns due to the relatively low expression of the maize PcG homologs, and to the fact that most of the genes were duplicated. Total RNA was extracted with Trizol (Invitrogen, Carlsbad, CA) from 10 tissues from the inbred B73 (endosperm [11 DAP], whole kernel [3 DAP], whole kernel [11 DAP], 10 d seedling [whole plant included], root tips, immature leaf [leaves 3–5], mature leaf [fully expanded leaf 10], and meiotic tassel). One microgram of total RNA was used to make cDNA with the SMART cDNA synthesis kit according to manufacturer's instructions (CLONTECH). PCR reactions were performed in a 25-μL total volume containing approximately 0.5 ng of cDNA, 5 pm each primer, 1 unit of Taq polymerase (Promega), 2.5 μL of 10× reaction buffer, 2 μL of 25 mm MgCl2, and 0.3 μL of 25 mm dNTPs. Primers used for the RT-PCR reactions were Ubi1F1 (5′-TAAGCTGCCGATGTGCCTGCGTCG-3′) and Ubi1R1 (5′-CTGAAAGACAGCACATAATGAGCACAGGC-3′) for Ubiquitin; Mepc1F1 (5′-CCGGCCATGGCGAAGCTG-3′) and Mepc1R2 (5′-CCTCGTCCAATCCTACCTCGACACC-3′) for ZmEpl101; Mez1F1 (5′-GGGTGTGGTGATGGTACATTGG-3′) and Mez1R1 (5′-CGGGACCTAACTCTACGGATGG-3′) for Mez1; Mez2F8 (5′-CCCCTGTTTTGCAGCCAGTCGTGA-3′) and Mez2R8 (5′-GGTGAGAGAAGGATGCCTCGTCC-3′) for Mez2; Mez3F3 (5′-AGTATGTGTTGGATGCTTATCGCAAGG-3′) and Mez3R2 (5′-GGTTGTCAGTTTGTCACCTTCCGACC-3′) for Mez3; Mesc1–5 (5′-TTTGCAAGTTGTGGCATGGA-3′) and Mesc1R2 (5′-CCCAACTTTCAACATTCGAAGCATTC-3′) for ZmFie2; and FLMesc2F1 (5′-CAACATCTGGCACAGCATGC-3′) and Mesc2R3 (5′-GTTGCCTATTGCCATCTGGTTGGAG-3′) for ZmFie1. Conditions of the PCR were as follows: 94oC for 2 min, 35 cycles at 94oC for 30 s, 63oC for 30 s, and 72oC for 2 min, followed by 72oC for 7 min. Amplified products were separated in a 1% (w/v) agarose Tris borate-EDTA gel and were visualized by ethidium bromide staining. The primers chosen flanked introns and are expected to produce different sized transcripts from genomic DNA and cDNA. The ZmEpl101 primers amplify a 450-bp cDNA product and an approximately 1,200-bp genomic product. The Mez1 primers would amplify a 717-bp cDNA product and a 1,235-bp genomic product, the Mez2 primers would amplify a 556-bp cDNA product and a 1,083-bp genomic product, the Mez3 primers would amplify a 509-bp cDNA product and a 1,211-bp genomic product, the ZmFie1 primers would amplify 1,135-bp cDNA product and a 4,008-bp genomic product, and the ZmFie2 primers would amplify 711-bp cDNA product and a 1,748-bp genomic product. Analysis of Mez2 Alternative Splicing Alternative splicing of Mez2 was indicated by the presence of multiple bands when using primers that amplified the entire coding sequence. The primers used were Mez2F6 (5′-CGCAGCTGATACGGCAAGTCCAATCG-3′) and Mez2R2 (5′-GTATCATCCGGAGCGACTCTTCAGC-3′). These primers are expected to produce a 2,594-bp cDNA product and a >9-kb genomic fragment. To characterize the alternative splice products, PCR was performed using 10 ng of B73 cDNA isolated from ear tissue and Amplitaq Gold DNA polymerase (PerkinElmer Applied Biosystems). The cycling conditions were 95oC for 5 min, 35 cycles at 95oC for 30 s, 65oC for 30 s, and 72oC for 3.5 min, followed by 72oC for 7 min. The amplified products were separated by electrophoresis in a 1% (w/v) low melting point agarose Tris borate-EDTA gel and were observed by ethidium bromide staining. Three alternative splice products were consistently observed and each band was excised and sequenced. The tissue-specific distribution of the different splice products was assayed using RT-PCR on the same tissues used for RT-PCR analysis. The primers used for these reactions were Mez2F10 (5′-CCATGTGAGAAGCAACCCTACAGC-3′) and Mez2R10 (5′-CCCAACCTGCAACATCAGATCTTCC-3′). The reaction conditions and cycling times were the same as those used for the RT-PCR reactions. Footnotes 1This research was supported by the U.S. Department of Agriculture (National Needs Fellowship no. 98–38420–5832) and by the National Science Foundation (grant no. 9975930). Article, publication date, and citation information can be found at www.plantphysiol.org/cgi/doi/10.1104/pp.010742. LITERATURE CITED
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||
Curr Opin Cell Biol. 1995 Jun; 7(3):376-85.
[Curr Opin Cell Biol. 1995]EMBO J. 1992 Aug; 11(8):2941-50.
[EMBO J. 1992]Mol Cell Biol. 1997 Dec; 17(12):6773-83.
[Mol Cell Biol. 1997]Mol Cell Biol. 1998 May; 18(5):2825-34.
[Mol Cell Biol. 1998]Mol Cell Biol. 1998 Jun; 18(6):3586-95.
[Mol Cell Biol. 1998]Curr Opin Cell Biol. 1995 Jun; 7(3):376-85.
[Curr Opin Cell Biol. 1995]Trends Genet. 1997 May; 13(5):167-70.
[Trends Genet. 1997]Curr Opin Genet Dev. 2001 Apr; 11(2):175-81.
[Curr Opin Genet Dev. 2001]Genes Dev. 1994 Apr 1; 8(7):757-69.
[Genes Dev. 1994]Nature. 1995 Apr 20; 374(6524):724-7.
[Nature. 1995]Nature. 1997 Mar 6; 386(6620):44-51.
[Nature. 1997]Science. 1998 Apr 17; 280(5362):446-50.
[Science. 1998]Plant Cell. 1999 Mar; 11(3):407-16.
[Plant Cell. 1999]Plant Cell. 1999 May; 11(5):765-8.
[Plant Cell. 1999]Mol Gen Genet. 1993 Dec; 241(5-6):504-14.
[Mol Gen Genet. 1993]Proc Natl Acad Sci U S A. 1997 Apr 15; 94(8):4223-8.
[Proc Natl Acad Sci U S A. 1997]Science. 1998 Apr 17; 280(5362):446-50.
[Science. 1998]Proc Natl Acad Sci U S A. 1999 Mar 30; 96(7):4186-91.
[Proc Natl Acad Sci U S A. 1999]Proc Natl Acad Sci U S A. 1999 Jan 5; 96(1):296-301.
[Proc Natl Acad Sci U S A. 1999]Nature. 1997 Mar 6; 386(6620):44-51.
[Nature. 1997]Plant Cell. 1999 May; 11(5):765-8.
[Plant Cell. 1999]Proc Natl Acad Sci U S A. 1996 May 28; 93(11):5319-24.
[Proc Natl Acad Sci U S A. 1996]Plant Cell. 1999 Mar; 11(3):407-16.
[Plant Cell. 1999]Mol Cell Biol. 1998 May; 18(5):2825-34.
[Mol Cell Biol. 1998]Proc Natl Acad Sci U S A. 2000 Sep 12; 97(19):10637-42.
[Proc Natl Acad Sci U S A. 2000]Curr Biol. 2000 Nov 30; 10(23):1535-8.
[Curr Biol. 2000]Curr Opin Cell Biol. 1995 Jun; 7(3):376-85.
[Curr Opin Cell Biol. 1995]Curr Opin Cell Biol. 1995 Jun; 7(3):376-85.
[Curr Opin Cell Biol. 1995]Nature. 1997 Mar 6; 386(6620):44-51.
[Nature. 1997]Science. 1998 Apr 17; 280(5362):446-50.
[Science. 1998]Plant Cell. 1999 Mar; 11(3):407-16.
[Plant Cell. 1999]Proc Natl Acad Sci U S A. 1997 Jun 24; 94(13):6809-14.
[Proc Natl Acad Sci U S A. 1997]Nucleic Acids Res. 2000 Jan 1; 28(1):231-4.
[Nucleic Acids Res. 2000]Trends Biochem Sci. 1996 Mar; 21(3):87-8.
[Trends Biochem Sci. 1996]Science. 1998 Apr 17; 280(5362):446-50.
[Science. 1998]Nature. 1997 Mar 6; 386(6620):44-51.
[Nature. 1997]Genome Res. 2001 Jan; 11(1):55-66.
[Genome Res. 2001]Mol Cell Biol. 1997 Nov; 17(11):6663-72.
[Mol Cell Biol. 1997]Development. 1998 Jul; 125(13):2469-78.
[Development. 1998]Plant Cell. 1999 Mar; 11(3):407-16.
[Plant Cell. 1999]Development. 1998 Oct; 125(20):4055-66.
[Development. 1998]Mol Cell Biol. 1998 May; 18(5):2825-34.
[Mol Cell Biol. 1998]Mol Cell Biol. 1998 Jun; 18(6):3586-95.
[Mol Cell Biol. 1998]Cell. 1999 Jul 9; 98(1):37-46.
[Cell. 1999]Curr Opin Cell Biol. 1995 Jun; 7(3):376-85.
[Curr Opin Cell Biol. 1995]Nature. 1997 Mar 6; 386(6620):44-51.
[Nature. 1997]Cell. 1999 Jul 9; 98(1):37-46.
[Cell. 1999]Nature. 1997 Mar 6; 386(6620):44-51.
[Nature. 1997]Science. 1998 Apr 17; 280(5362):446-50.
[Science. 1998]Plant Cell. 1999 Mar; 11(3):407-16.
[Plant Cell. 1999]Plant Cell. 1999 May; 11(5):765-8.
[Plant Cell. 1999]Nature. 2000 Aug 10; 406(6796):593-9.
[Nature. 2000]Mol Cell Biol. 1997 Nov; 17(11):6663-72.
[Mol Cell Biol. 1997]Plant Cell. 1999 Mar; 11(3):407-16.
[Plant Cell. 1999]Mol Cell Biol. 1998 May; 18(5):2825-34.
[Mol Cell Biol. 1998]Mol Cell Biol. 1998 Jun; 18(6):3586-95.
[Mol Cell Biol. 1998]Proc Natl Acad Sci U S A. 2000 Sep 12; 97(19):10637-42.
[Proc Natl Acad Sci U S A. 2000]Trends Cell Biol. 2001 Jun; 11(6):266-73.
[Trends Cell Biol. 2001]Curr Opin Cell Biol. 2000 Jun; 12(3):326-33.
[Curr Opin Cell Biol. 2000]Nature. 2000 Aug 10; 406(6796):593-9.
[Nature. 2000]Nature. 2001 Mar 1; 410(6824):120-4.
[Nature. 2001]Nature. 2001 Mar 1; 410(6824):116-20.
[Nature. 2001]Development. 1998 Oct; 125(20):4055-66.
[Development. 1998]Development. 1998 Oct; 125(20):4055-66.
[Development. 1998]J Biol Chem. 2000 Dec 15; 275(50):39411-9.
[J Biol Chem. 2000]Mol Cell. 2000 Jun; 5(6):927-37.
[Mol Cell. 2000]Proc Natl Acad Sci U S A. 1997 Jun 24; 94(13):6809-14.
[Proc Natl Acad Sci U S A. 1997]Mol Cell Biol. 1998 May; 18(5):2825-34.
[Mol Cell Biol. 1998]Proc Natl Acad Sci U S A. 2000 Sep 12; 97(19):10637-42.
[Proc Natl Acad Sci U S A. 2000]Curr Biol. 2000 Nov 30; 10(23):1535-8.
[Curr Biol. 2000]Plant Cell. 2000 Dec; 12(12):2367-2382.
[Plant Cell. 2000]Proc Natl Acad Sci U S A. 1996 Aug 6; 93(16):8449-54.
[Proc Natl Acad Sci U S A. 1996]Nature. 1997 Mar 6; 386(6620):44-51.
[Nature. 1997]Nature. 1997 Mar 6; 386(6620):44-51.
[Nature. 1997]Science. 1998 Apr 17; 280(5362):446-50.
[Science. 1998]Plant Cell. 1999 Mar; 11(3):407-16.
[Plant Cell. 1999]