![]() | ![]() |
Formats:
|
||||||||||||||||
Copyright Guedes Correa et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. The Role of bZIP Transcription Factors in Green Plant Evolution: Adaptive Features Emerging from Four Founder Genes 1Centro de Biologia Molecular e Engenharia Genética, Departamento de Genética e Evolução, Instituto de Biologia, Universidade Estadual de Campinas, Campinas, Brazil 2Department of Molecular Biology, University of Potsdam, Potsdam-Golm, Germany 3Cooperative Research Group, Max-Planck Institute of Molecular Plant Physiology, Potsdam-Golm, Germany 4GabiPD Team, Bioinformatics Group, Max-Planck Institute of Molecular Plant Physiology, Potsdam-Golm, Germany 5Laboratório de Biodiversidade Molecular, Departamento de Genética, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil Shin-Han Shiu, Editor Michigan State University, United States of America #Contributed equally. * E-mail: mgavince/at/unicamp.br Conceived and designed the experiments: LGGC CGS RVRVdS MV. Performed the experiments: LGGC DMRP RVRVdS. Analyzed the data: LGGC DMRP CGS MV. Contributed reagents/materials/analysis tools: BMR. Wrote the paper: LGGC DMRP BMR MV. Received February 18, 2008; Accepted July 22, 2008. Abstract Background Transcription factors of the basic leucine zipper (bZIP) family control important processes in all eukaryotes. In plants, bZIPs are regulators of many central developmental and physiological processes including photomorphogenesis, leaf and seed formation, energy homeostasis, and abiotic and biotic stress responses. Here we performed a comprehensive phylogenetic analysis of bZIP genes from algae, mosses, ferns, gymnosperms and angiosperms. Methodology/Principal Findings We identified 13 groups of bZIP homologues in angiosperms, three more than known before, that represent 34 Possible Groups of Orthologues (PoGOs). The 34 PoGOs may correspond to the complete set of ancestral angiosperm bZIP genes that participated in the diversification of flowering plants. Homologous genes dedicated to seed-related processes and ABA-mediated stress responses originated in the common ancestor of seed plants, and three groups of homologues emerged in the angiosperm lineage, of which one group plays a role in optimizing the use of energy. Conclusions/Significance Our data suggest that the ancestor of green plants possessed four bZIP genes functionally involved in oxidative stress and unfolded protein responses that are bZIP-mediated processes in all eukaryotes, but also in light-dependent regulations. The four founder genes amplified and diverged significantly, generating traits that benefited the colonization of new environments. Introduction Growth and development of all organisms depend on proper regulation of gene expression. The control of transcription initiation rates by transcription factors (TF) represents one of the most important means of modulating gene expression [1]–[4]. TFs can be grouped into different protein families according to their primary and/or three-dimensional structure similarities in the DNA-binding and multimerization domains [4]–[6]. The interplay between the amplification of the ancestral repertoire of TFs, the emergence of new TFs, the combination of protein domains and sequence divergence constitutes an important driving force towards the evolution of organismic complexity [7]–[10]. Understanding the detailed evolutionary history of these TFs and their corresponding functions is therefore crucial to reveal the changes and/or innovations in transcriptional regulatory circuits that underlie the biological diversity found among eukaryotes. Large scale genomic comparisons revealed that angiosperm TF families undergo more intense gene expansion when compared to animals and fungi, possibly reflecting the ability of flowering plants to efficiently adapt to different and unstable environmental conditions. Moreover, gene expansion rates vary among plant TF families, indicating lineage-differential specializations [11], [12]. For instance, MADS-box and homeodomain families, which exert similar functions in developmental control, expanded preferentially in the angiosperm and human lineages, respectively [13], [14]. Contrariwise, the basic leucine zipper (bZIP) TF family apparently expanded to a similar extent in angiosperms and humans [15]. Currently we do not well understand why individual TF families underwent differential evolutionary expansions in the different eukaryotic lineages. Therefore, a deep evolutionary analysis of TF families including the identification of the founding (ancestral) gene sets in combination with functional assignments will greatly assist in addressing this issue [16], [17]. To our knowledge, however, only four families that are present in all green plants have until today been studied in a deep evolutionary scale, Dof [18], homeodomain [19], MADS-box [20], [21] and WRKY [22]. As a matter of fact, groups of orthologues, for which functional equivalence is often assumed, are rarely identified in a systematic and direct manner, with the exception of the HD-Zip class III subfamily [23], [24]. It is thus often difficult to infer ancestral functions at different time points of the evolutionary process. Here we performed a comprehensive analysis of the evolutionary relationships of TFs of the green plant bZIP family; homologous and orthologous relationships among bZIP TFs were established and ancestral functions were inferred. The bZIP TFs are characterized by a 40- to 80-amino-acid-long conserved domain (bZIP domain) that is composed of two motifs: a basic region responsible for specific binding of the TF to its target DNA, and a leucine zipper required for TF dimerization [5], [25]. Genetic, molecular and biochemical analyses indicate that bZIPs are regulators of important plant processes such as organ and tissue differentiation [26]–[30], cell elongation [31], [32], nitrogen/carbon balance control [33], [34], pathogen defence [35]–[40], energy metabolism [41], unfolded protein response [42], [43], hormone and sugar signalling [44]–[47], light response [48]–[50], osmotic control [34], [51], and seed storage protein gene regulation [52]. Initially, 50 plant bZIP proteins were classified into five families, taking into account similarities of their bZIP domain [53]. An original investigation of the complete Arabidopsis thaliana genome sequence indicated the presence of 81 putative bZIP genes [54], [55]. However, further detailed studies revealed 75 to 77 bZIP proteins to be encoded by the Arabidopsis nuclear genome, representing members of ten groups of homologues [55], [56]. The availability of the rice (Oryza sativa) [57], [58], black cottonwood (Populus trichocarpa) [59] and Arabidopsis genomic sequences [54] provides an exciting opportunity for the large-scale investigation of the genetic bases that underlies the extensive physiological and morphological diversity amongst the two main angiosperm divisions: monocots and eudicots. A possible comparative approach involves the establishment of relationships between different genomes in a homologous gene system [60]–[62], in which each group of orthologues is derived from an ancestral gene that underwent numerous modifications throughout evolution, including duplication and subsequent functional diversification. Considering that all genes of a given group of orthologues have the same ancestral origin, the establishment of this classification should allow the transfer of biochemical, structural and functional information from one protein to another, inside the same group [63]. Moreover, the relationships within a group of orthologues constitute the basis for a better understanding of the evolution of ancestral functions (conservation versus neo- or sub-functionalization through duplication) [64]–[66]. In this study, we identified the possible non-redundant complete sets of bZIPs in rice, comprising 92 proteins, and in black cottonwood, comprising 89 proteins. These collections of bZIPs together with the 77 bZIPs from Arabidopsis [56] could be divided, based on bZIP domain and other conserved motifs similarities, into 13 groups of bZIP homologues in angiosperms, three more than previously reported [55]. The identified groups constituted a backbone for a more detailed analysis of each group, to which additional bZIP sequences reported from other plants, including those deduced from expressed sequence tags (ESTs), were added. In total, we defined 34 Possible Groups of Orthologues (PoGOs), which may represent 34 ancestral functions in angiosperms. Interestingly, one PoGO was found exclusively in monocots, whereas a Possible Group of Paralogues (PoGP) appears to be restricted to Arabidopsis. To extend our bZIP analysis to all major lineages of green plants we additionally identified and incorporated bZIP sequences not only from two algal (Chlamydomonas reinhardtii [67] and Ostreococcus tauri [68]) and moss (Physcomitrella patens [69]) genomes, but also from ESTs of the ferns Selaginella moellendorffii and Adiantum capillus-veneris and the gymnosperms Pinus taeda and Picea glauca. Based on this investigation, a model for the evolution of bZIP genes in green plants, based on four founder genes representing an ancestral tool kit, was established. Its main points are discussed here. We also propose an updated classification of plant bZIP genes which should facilitate functional studies. Results and Discussion Groups of Homologues of Angiosperm bZIP Genes The Arabidopsis genome encodes for a possible complete set of 77 unique bZIP proteins, representing an update of previous results [55], [56], [70]. AtbZIP73 contains a premature stop codon and was thus not considered further in our analyses. As it appears to be a pseudogene it should be referred to as ΨAtbZIP73. Through iterated searches with tblastn and blastx algorithms, and PFAM bZIP Hidden Markov Models (HMM), we identified 92 bZIP genes in rice (Text S1a). Recently, Nijhawan et al. [71] reported the presence of 89 bZIP genes in rice and their phylogenetic relationship to the Arabidopsis bZIPs. Of the 89 bZIPs, 86 are also present in this study. Careful sequence analyses of both gene sets revealed complete sequence identity of the Os06g50480 and Os06g50830 TFs, and complete identity with TF Os06g50600 (OsbZIP14) along amino acids 1–143, indicating that these sequences were redundant in the Nijhawan et al. data set. Os03g59460 has also been identified in our studies, however, the protein it encodes contains a proline residue at the beginning of its leucine zipper, precluding dimerization [25]; thus it may not function like other known bZIPs. Despite OsbZIP24 and OsbZIP75 being classified as retrotransposons in TIGR, we included them in our analysis as they possess a standard bZIP sequence in their open reading frame. Table S1 gives a summary of this information. We identified 89 bZIP sequences in P. trichocarpa, some of which were incomplete. We therefore performed a more refined analysis of genomic data sets taking into account gene structures and conserved motifs. This allowed us to resolve the entire bZIP gene sequences in nine cases (Datasets S1 and S2). Through Neighbour-Joining (NJ) analysis of the minimum bZIP domain (44 amino acids; Text S1a) of 257 unique bZIPs from Arabidopsis, rice and black cottonwood (bZARP data set) we identified seven clusters of proteins with bootstrap support greater than or equal to 50%, defining the groups of homologous genes B, D, F, G, H, J and K. The topology of the phylogenetic tree and a bootstrap support of 50% indicate that Groups D and F are sister groups that share a common ancestor (Figures 1A
Members of Groups A and D have a bZIP domain of only 44 amino acids. To refine our analysis we created a subset-of-bZARP (sbZARP) dataset that excluded groups A and D members but included all remaining 172 proteins with a bZIP domain of 60 amino acids (53, 60 and 59 bZIPs from Arabidopsis, rice and black cottonwood, respectively). NJ analyses revealed four new groups of homologues, Groups C, E, I and L, all supported by bootstrap values of >50% (Figure S3; note that Group L members harbor an atypical basic motif; see Figure S2, and Text S1c). The overall organization into twelve groups is further supported by the presence of at least one shared intron position among the members of each group, confirming a common ancestral origin of all its members (Figures 1A
In summary, our data suggest 13 groups of homologous angiosperm bZIP genes (A, B, C, D, E, F, G, H, I, J, K, L, and S), representing a unified classification of angiosperm bZIPs (Figure 3
Possible Groups of Orthologues (PoGOs) in Angiosperms We next aimed at identifying Possible Groups of Orthologues (PoGOs) among the 13 groups of homologues. By definition, each PoGO represents a group of genes that diverged from an ancestral gene through speciation and duplication. Members of a given PoGO typically have closely related biological functions, and this allows making predictions for poorly characterized genes and rationalizes functional studies of the proteins they encode [72]. PoGOs also establish a basis for the definition of functional diversification among genes. Here, we identified PoGOs by NJ analysis of each group of homologues separately, using the criteria defined in Material and Methods. To optimize the resolution of the evolutionary relationships, alignment lengths were extended by including conserved motifs specific to each group of homologues (Figure 2 Our analysis revealed 31 PoGOs distributed among Groups A to L (Figures 3 We identified only one eudicot-monocot PoGO, S1, in Group S (Figure S16). The remaining sequences could be clustered into three PoGOs each restricted to either eudicots (SE1, SE2 and SE3) or monocots (SM1, SM2 and SM3) (Figure S16). Arabidopsis bZIP TFs of groups SE2 and SE3 are involved in energy metabolism and hypoosmolarity signaling (Table S4) further supporting the evolutionary relationship deduced from the phylogenetic analysis. Similarly, SM2 members play a role in cold signaling (Table S4), thus providing function-based support also for this group. Although further efforts to more precisely uncover the relationship between the three monocot (SM1, SM2 and SM3) and eudicot (SE1, SE2 and SE3) groups of orthologues proved unsuccessful, we propose that up to three additional eudicot-monocot PoGOs, besides S1, exist in Group S (as a minimal representation of the three possible monocot and eudicot PoGOs). The difficulty of organizing Group S bZIPs into PoGOs that comprise both eudicots and monocots sequences may reflect an increased evolutionary rate after their emergence. Rapid evolution can mainly be explained by relaxation of purifying selection or by positive selection. We used the Yang algorithm [73] to verify whether lineage-specific dN/dS ratios in Arabidopsis, black cottonwood and rice (the ω parameter, [74], [75]) of Group S were different from that of all other groups. The ω value for Group S (0.12) was found to be significantly different from the average ω calculated for all other groups (0.03, likelihood ratio test , p<0.01). Despite being under purifying selection (ω<1), the value of ω for Group S is four times higher than the average. Thus it can be concluded that purifying selection is relaxed in this group, explaining the higher rate of sequence divergence among its members. Low selective constraint (i.e., low purifying selection) is a hallmark of more recently duplicated genes and can be correlated with functional diversification [76]. The extensive amplification of Group S members in angiosperms (see below) further supports the notion that functional diversification partly related to the control of energy metabolism is operating among Group S genes.In Group G, we observed one PoGO that is restricted to monocots (PoGO G4; Figure S10). This may be explained by gene gain at an early phase of monocot radiation, or alternatively by gene loss in the ancestor of the eudicot lineage. Our analysis also revealed the existence of a Possible Group of Paralogues (PoGP) restricted to Group I in Arabidopsis (PoGP I1, Figure S12). This PoGP most probably reflects a recent duplication event followed by rapid divergence in the Arabidopsis lineage. As PoGO G4 and PoGP I1 are restricted to distinct evolutionary lineages, they probably do not play essential (common) roles in angiosperms as a whole. This conclusion is supported by the fact that EmBP from maize and wheat, both assigned to PoGO G4, control reserve protein (prolamin) production [77] which can be considered a monocot-specific function. Gene duplication is an important means of evolutionary diversification. Therefore, PoGOs that preferentially expanded during angiosperm evolution are expected to include genes that were particularly important for establishing angiosperm-specific physiological or functional characteristics. Of the 13 groups of homologous genes, Groups A, D, E, I and S contain more genes per PoGO than the average (approximately six genes per PoGO, Figure S17), indicating their preferential contribution to the evolution of adaptive characteristics in angiosperms. Interestingly, Groups A, D and S include genes for responses and adaptation to environmental factors (abiotic and biotic stresses in Groups A/S and D, respectively; Table S4) and the control of energy use (Group S; Table S4). These observations raise the possibility that genes of these groups were particularly important for the colonization of new habitats and consequently for the radiation and expansion of angiosperms (Text S1d). Additionally, some PoGOs have a conserved one-to-one gene relationship, indicating that their genes may play a pivotal role during development (Text S1e) In summary, we propose the existence of 31 monocot-eudicot PoGOs in Groups A to L, one monocot-specific PoGO (G5), one PoGP (I1) in Arabidopsis, and possibly three PoGOs in Group S. The 34 PoGOs are likely to be related to 34 possible ancestral functions of bZIPs in angiosperms (Figure 3 Tracing the Origin and Diversification of bZIP Genes in Green Plants Based on the phylogenetic analyses and the bZIP gene structures from Arabidopsis, black cottonwood and rice, we propose a model for the evolution of angiosperm bZIPs (Figure 1A To identify groups of homologues among the major eukaryotic lineages, i.e. animals, fungi, and plants, we performed a large-scale phylogenetic analysis using the conserved bZIP region of all bZIPs from Homo sapiens [78], Caenorhabditis elegans (http://www.wormbase.org/), Drosophila melanogaster [79], Saccharomyces cerevisiae (http://mips.gsf.de/genre/proj/yeast/), A. thaliana and O. sativa. This analysis revealed that bZIPs of each of these lineages share only one common ancestor (data not shown) which is in accordance with the fact that only a single bZIP sequence is present in the primitive eukaryote Giardia lamblia [80], [81], perhaps representing the bZIP gene content prior to the plant/animal/fungal separation [80]. The function of this unique ancestral gene may be related to unfolded protein (UPR) and oxidative stress responses (see below). Deep evolutionary analyses have also been performed for the homeodomain and MADS-box families and it appears that their member TFs derived from at least two genes present in the last common ancestor of the three eukaryotic kingdoms [19], [82]. It has been proposed that one of the ancestral functions of the MIKCc class of MADS-box genes is an involvement in reproductive organ development [83], [84]. Although this function appears to be conserved, it is still not clear whether it has a monophyletic origin. We identified 7, 8, and 40 bZIP genes, respectively, in the genomes of the algae Chlamydomonas reinhardtii and Ostreococcus tauri and the moss Physcomitrella patens (however, a complete bZIP domain is missing in three of the moss proteins). Additionally, we identified bZIP sequences from assembled ESTs of species representing the most relevant divisions of the green plants from which sequences are available: four bZIP genes in the bryophyte Marchantia polymorpha, one each in the ferns Selaginella moellendorffii and Adiantum capillus-veneris, and 40 and nine, respectively, in the gymnosperms Pinus taeda and Picea glauca (Table S5). Although no complete genomic sequences were available for ferns or gymnosperms, a considerable number of ESTs is available for the latter. We assembled a set of 345 bZIPs from algae to angiosperms (ViridiZIP set) for phylogenetic analyses (Figures 1B, 1C
Our study revealed that Group H is the most conserved group of bZIP homologues; members of this group are present in all green plant lineages. This observation is particularly interesting because Group H includes HY5 and HYH that are important regulators of light responses and anthocyanin biosynthesis (Table S4). We therefore propose that Hy5-like bZIPs control light-dependent processes in all green plants. Similar to bZIPs in Group H, DOF transcription factors involved in light responses (subfamily A) also appear to be well conserved, suggesting that genes involved in light-related functions are under strong selective constraints [85]. In Arabidopsis Hy5-mediated photomorphogenesis is negatively regulated by the E3 ubiquitin ligase Cop1, which ubiquitylates Hy5 protein leading to its degradation [86]. We detected Cop1-related proteins in Physcomitrella, in agreement with previous results, as well as the Cop1-interaction motif in Physcomitrella Hy5-like bZIPs, suggesting that the genetic toolkit for photomorphogenesis described in angiosperms is also present in mosses [87]. We also detected a single gene similar to COP1 in Ostreococcus (ID 30007), but while in higher plants Cop1 protein contains a RING domain at the N-terminus, followed by multiple WD40 repetitions [88], this order is reversed in the Ostreococcus protein. Moreover, a Cop1 interaction site (Table S2) was not detected in the algal HY5-orthologues OtbZIP1 or CrbZIP1, or in any other green algae bZIP. Nevertheless, we found one Cop1-related protein in the red alga Cyanidioschyzon merolae (ID CMK039C; http://merolae.biol.s.u-tokyo.ac.jp/). Cop1-like proteins are also known in animals where they promote the degradation of the bZIP transcription factor c-Jun [88], suggesting Cop1-dependent protein degradation to be a regulatory scheme conserved in most eukaryotes. Groups B, C, D, E, F, G, I and J were present in the most recent common ancestor (MRCA) of bryophytes and tracheophytes, indicating a functional connection to the colonization of the terrestrial environment (Figure 5
Group A probably first appeared in the MRCA of spermatophytes and may thus be related to seed formation (Figure 5 According to our data Groups K, L and S are angiosperm-specific (Figure 5 We also detected Group NA, a possible group of homologues exclusively present in non-angiosperm plants (Figure S18, and Text S1g). This finding is intriguing as genes conserved in mosses and gymnosperms are expected to represent general plant functions. Group NA bZIPs may thus have lineage-specific roles unimportant for angiosperms; the reduction of a dominant gametophyte during angiosperm evolution combined with a concomitant gene loss is an example for this. Alternatively, gene loss could have played a key role in the acquisition of important features in angiosperms, as seen for KNOX genes [91]; or, the roles played by bZIPs of Group NA could have been taken over by non-related but functionally analogous genes (non-orthologous gene displacement). Ancestral Relationships in Groups B and C The above analysis in combination with detailed sequential NJ analyses restricted to algal, moss and/or Arabidopsis sequences revealed two new groups, i.e. Groups Proto-B and Proto-C (Figure 1B Group Proto-B consists of Group B, which includes members from bryophytes and angiosperms, a new group of homologues (Group b) that is apparently restricted to bryophytes and gymnosperms, and the Ostreococcus gene OtbZIP3 (Figures 1B Another important result of our analysis is that Ostreococcus sequences could be included, with significant bootstrap support, into Groups Proto-C (OtbZIP5) and Proto-B (OtbZIP3; Figure 1B In conclusion, our results indicate that four Ostreococcus bZIP genes can be assigned to Groups Proto-C (OtbZIP5), Proto-B (OtbZIP3), G+J (OtbZIP2), and H (OtbZIP1), defining four orthologous relationships between algal and five groups of homologues from terrestrial plants (Figure 6
bZIP Evolution in Plants Our data show that Group C and B members are elements of the oxidative stress signaling and UPR pathways, respectively, which appear to be crucial in all eukaryotes. This observation and the likely monophyletic origin of bZIPs of the main eukaryotic lineages (plants, animals, and fungi) suggest that the common bZIP ancestor was a multifunctional regulatory factor. An important consequence of this model is that Group H, which has a central role in light-mediated control, emerged from bZIPs of the oxidative stress and UPR regulatory modules. The integration of the branch leading to Group G+J, however, remains unclear which is partially due to the fact that functional information is limited and restricted to Group G that plays a role in light and ABA signaling. From the extant algal sequences that do not cluster into any of the homologous groups of streptophytes, only a single group of homologues restricted to algae could be detected (Group AL; Figures 1C We estimated the number of bZIPs in the MRCA of all land plants (embryophytes), using the method of Hahn et al. [100]; the MRCA most likely had 64 bZIPs that expanded to 83 in the branch leading to seed plants. The rate of gene gain-loss, λ, in the seed plant lineage was found to be 2.01×10−3 per million years, which is similar to estimates for yeast (0.002) [100] and mammals (0.0016) [101]. We calculated expansions and contractions of the bZIP phylogenetic branches in the land plant lineage, using the estimated value for λ; this revealed a significant expansion (p<0.05) of the branch leading to the seed plant lineage. Finally, the evolution of the bZIP gene family is well explained by the random birth-and-death model in seed plants, i.e., no significant expansions/contractions occurred preferentially in any specific PoGO or group of homologues (Figure S19, and Text S1h). Conclusions In our analysis presented here we systematically classified bZIP TFs into PoGOs and considered existing knowledge about their biological functions to establish a robust methodology to reveal evolutionary relationships of this group of regulatory proteins. The moss Physcomitrella possesses almost five times more bZIP genes (37 genes, Table S5) than the alga Ostreococcus (8 genes), and half the number found in angiosperms (around 80 genes). Group A genes first appeared in the MRCA of spermatophytes and were recruited for seed development or germination but also to fine tune the responses to desiccation/dehydration and salt stress. Groups K, L and S are seemingly exclusive to angiosperms. Unexpectedly, Groups K and S control processes conserved in all eukaryotes, i.e. UPR and energy homeostasis. This apparent paradox can be explained by the fact that both, Groups K and S derived from the functionally related Groups Proto-B and Proto-C, respectively, that emerged early on during green plant evolution. Group S amplification likely contributed to refining the regulatory circuit controlling the organism's energy status. The most strongly conserved group of homologues in algae and angiosperms is Group H which includes light control factors HY5 and HYH. Group H is representative of one of the four green plant founder bZIP genes. Our data thus establish the hypothesis that bZIP-controlled light responses of Group H emerged (through neofunctionalization) from a multifunctional ancestral gene of the UPR and oxidative stress response pathways (UPR/oxidative stress). The UPR/oxidative stress gene is also the ancestor of two other of the four founder genes, i.e. Groups Proto-B (UPR) and Proto-C (oxidative stress), which most likely diverged through subfunctionalization processes. The fourth founder gene, represented by Groups G and J, is the sister gene of the multifunctional UPR/oxidative stress gene. More functional data for Group G- and J-related bZIPs are required to further elaborate the model of green plant bZIP evolution. Materials and Methods Datasets of bZIP Genes We generated a bZIP dataset (Angiotot) representing an updated version of the ABZ data set [56]. Plant bZIP sequences were identified as described by Riaño-Pachón et al. [102]. The whole proteomes deduced from the completely sequenced genomes of the algae Ostreococcus tauri [68] and Chlamydomonas reinhardtii [67], the bryophyte Physcomitrella patens [69], and the angiosperm Populus trichocarpa [59] were downloaded from the Joint Genome Institute/Department of Energy (JGI/DOE; http://www.jgi.doe.gov/). Protein sequences for the angiosperm Arabidopsis thaliana [54] were downloaded from The Arabidopsis Information Resource (TAIR; http://www.arabidopsis.org/), and from The J. Craig Venter Institute (http://www.tigr.org/) for the monocot Oryza sativa ssp. japonica [58]. Assembled ESTs from Marchantia polymorpha, Physcomitrella patens, Adiantum capillus-veneris, Selaginella moellendorffii, Picea glauca, Pinus taeda, Brassica napus, Glycine max, Heliathus annus, Medicago truncatula, Solanum lycopersicum, Solanum tuberosum, Hordeum vulgare, Saccharum ssp., Sorghum bicolor, Triticum aestivum and Zea mays were downloaded from the TIGR Plant Transcript Assemblies Database [103]. ESTs from Oryza sativa ssp. indica were downloaded from the Beijing Genomics Institute website (07.11.2006), and assembled into clusters using TGICL [104]. Additional rice bZIP sequences were obtained from the Full Length Rice cDNA Consortium [105]. Some sequences from completely sequenced genomes were re-annotated (Datasets S1 and S2), based on conserved protein motifs and gene structures of each family. The list of abbreviations of the organisms used is given in Table S6. The tblastn program [106] was used to search for bZIP sequences in rice nucleotide databases (Oryza sativa ssp. indica [57]; Beijing Genomics Institute, http://btn.genomics.org.cn/rice, and Oryza sativa ssp. japonica; Syngenta, http://www.syngenta.com/; IRGSP, http://www.gramene.org/) using Angiotot as query. Sequences with an e-value <10−4 were selected to form a subset (SeqZIP), from which false positive hits, corresponding mainly to low complexity regions, and hits that we initially identified using the above procedure were excluded. To identify the open reading frame and gene structure of each SeqZIP sequence, pairwise blastx analyses against their respective Angiotot best hits were performed. Gene structures were defined based on the alignments obtained, the conserved positions of introns in homologous bZIP genes, and the presence of canonical splicing sites (GT-AG). The protocol used for bZIP identification is described in Figure S20. The procedure used to identify bZIPs in EST datasets was identical to that used for genomic sequences, except that the estwisedb program of the Wise2 package [107] was included to identify the most likely reading frames and its bZIP domains in a given cluster. Phylogenetic Analyses Alignment of bZIP protein sequences was performed by ClustalX [108], using default parameters, and subsequently adjusted manually. The alignments used for the analyses within each group of homologues represent a concatenated sequence of the different conserved motifs found within each group (Figure 2 Identification of Conserved Motifs The putative complete sets of unique bZIPs from Chlamydomonas, Ostreococcus, Physcomitrella, black cottonwood, Arabidopsis and rice served as input for a conserved motif analysis performed with MEME (http://meme.sdsc.edu/meme/meme.html) [120]. Whole protein sequences were employed for this search. A given motif was allowed to appear at any number of repetitions, the maximum width of a motif was set to 80, and the maximum number of motifs was set to 20. The other parameters were used as default. In a complementary approach, each group of homologues was analyzed individually with the parameters described above. Phylogenetic Analyses and Identification of Possible Groups of Orthologoues (PoGOs) The detailed evolutionary analysis of angiosperm bZIP sequence relationships within each group allowed the identification of PoGOs. A PoGO is defined by the following criteria: (i) members of a PoGO have a monophyletic origin, indicated by a bootstrap support greater than 50%; (ii) a PoGO possesses at least one representative gene each from A. thaliana and O. sativa, assuming that the putative complete sets of bZIP genes of these organisms were identified and no selective gene loss had occurred. In case a PoGO is found to be restricted to either monocots or eudicots, the presence of sequences from at least one other species of the same lineage in this PoGO is required; and (iii) the inferred phylogeny should be consistent with the known phylogeny of plant species [56]. Identification of Pseudogenes and Genomic Duplications Search for pseudogenes in Chlamydomonas, Ostreococcus, black cottonwood, Arabidopsis and rice was performed by masking the genomic region for each identified bZIP. Blastx searches were performed against the masked sequences using the Angiotot bZIP database as query. A hit was considered as a pseudogene only if it possessed all or part of the bZIP domain; therefore all hits were compared against bZIP PFAM models [121] and manually cured, eliminating false positives. Genomic duplications in Arabidopsis were identified via “Paralogons in Arabidopsis thaliana” (http://wolfe.gen.tcd.ie/athal/dup) and ‘MATDB: Segmental Duplications’ from MIPS (Munich Information Center for Protein Sequences; http://www.mips.gsf.de/projects/plants) (Table S7). Analysis of Gene Family Expansion and Contraction The evolution of rates of bZIP gene gain and loss along the history of green plants was analyzed by the method of Hahn et al. [100], implemented in CAFÉ [122]. The method models gene family evolution as a stochastic birth-and-death process implemented as a probabilistic graphical model that allows for the inference of the most likely family sizes in the common ancestors of every branching point. In this way one can test the null hypothesis of random change in the family size. To avoid incomplete sampling, only plants with fully sequenced genomes were analyzed. The algorithm developed by Hahn et al. uses a birth-and-death parameter, λ, which was also estimated within CAFE. In addition to the parameter λ, CAFE needs divergence times to be entered along with the phylogeny of the organisms used. Since the inference of the size of gene families at deep evolutionary times is not reliable with any of the current methods available (Hahn, personal communication; [100]), we focused on land plants only. Tree topology and divergence times are shown in Figure S19. Significance of the contractions and expansions along branches was accessed by means of the three methodologies available in CAFE: branch cutting, likelihood ratio test, and Viterbi assignments [122]. Gene Expression Analysis Absolute signal intensity values from Arabidopsis ATH1_22K array (Affymetrix) was obtained through Meta-Analyzer from GENEVESTIGATOR (http://www.genevestigator.ethz.ch/) [123]. The developmental stages were as described by Boyes et al. [124]. Massively Parallel Signature Sequencing, MPSS, [125] was also verified for Arabidopsis and rice genes (Datasets S3 and S4). Figure S1 Definition of homologous gene groups A, D and F. This figure is a partial representation of the tree inferred from NJ analysis from the 258 non-redundant set of bZIPs from Arabidopsis, rice and black cottonwood using p-distance and 1000 bootstrap repetitions (indicated as percentages at the branch points). The alignment used corresponds to the minimum bZIP domain of 44 amino acids. Groups D and F are sister groups supported by a 50% bootstrap. Rice, black cottonwood and Arabidopsis sequences are represented in orange, dark blue and light blue, respectively. (1.01 MB TIF) Click here for additional data file.(989K, tif) Figure S2 Conserved intron position in the basic motif region of angiosperm bZIP transcription factors. The first leucine of the leucine zipper is highlighted in green, and the conserved asparagine of the basic motif is shown in red. According to the position of the introns, indicated by arrows, four different groups can be observed (1 to 4). bZIPs from Group L have a basic motif five amino acids shorter than that of the other bZIPs, and the conserved asparagine, shown in red, is substituted either by lysine (K) or arginine (R). In bold, the first amino acid after the intron. The bZIP genes used in this figure are: AtbZIP24 (Group F), AtbZIP45 (Group D), AtbZIP39 (Group A), AtbZIP54 (Group G), AtbZIP62 (Group J), AtbZIP63 (Group C), AtbZIP56 (Group H), AtbZIP61 (Group E), AtbZIP31 (Group I), AtbZIP60 (Group K), AtbZIP76 (Group L), AtbZIP70 (Group S), and AtbZIP49 (Group B). (1.85 MB TIF) Click here for additional data file.(1.7M, tif) Figure S3 Unrooted phylogenetic tree inferred from a NJ analysis from a subset of 173 bZIPs of Arabidopsis, rice and black cottonwood using p-distance and 1000 bootstrap repetitions (indicated as percentages at the branches). The alignment used corresponds to the minimal bZIP domain extended by two leucine repetitions, totaling 60 amino acids. Groups B, K and H, as well as Groups E and L are sister groups supported by bootstrap analysis. Rice, black cottonwood and Arabidopsis sequences are represented in orange, dark blue and light blue, respectively. (1.11 MB TIF) Click here for additional data file.(1.0M, tif) Figure S4 Phylogenetic tree of monocot and eudicot bZIPs of Group A. The unrooted tree was inferred by a NJ analysis from distances calculated with the PAM distance matrix. The bootstrap values correspond to 1000 repetitions and are indicated as percentage in every branch. The amino acid alignment used to generate this tree corresponds to the bZIP domain plus the conserved motif A1 (Figure 2 (1.28 MB TIF) Click here for additional data file.(1.2M, tif) Figure S5 Phylogenetic tree of Group B bZIPs from monocots and eudicots. An unrooted tree was inferred by a NJ analysis from distances obtained from the PAM distance matrix. The bootstrap values correspond to 1000 repetitions and are indicated as percentage in every branch. The amino acid alignment used to generate this tree corresponds to the bZIP domain plus the conserved motifs within this group (Figure 2 (0.31 MB TIF) Click here for additional data file.(307K, tif) Figure S6 Phylogenetic tree of Group C bZIPs from monocots and eudicots. An unrooted tree was inferred by a NJ analysis from distances calculated with the PAM distance matrix. The bootstrap values correspond to 1000 repetitions and are indicated as percentage in every branch. The amino acid alignment used to generate this tree corresponds to the bZIP domain plus the conserved motif within this group (Figure 2 (2.03 MB TIF) Click here for additional data file.(1.9M, tif) Figure S7 Phylogenetic tree of Group D bZIPs from monocots and eudicots. An unrooted tree was inferred by a NJ analysis from distances calculated with the PAM distance matrix. The bootstrap values correspond to 1000 repetitions and are indicated as percentage in every branch. The amino acid alignment used to generate this tree corresponds to the bZIP domain plus the conserved motifs within this group (Figure 2 (1.31 MB TIF) Click here for additional data file.(1.2M, tif) Figure S8 Phylogenetic tree of Group E bZIPs from monocots and eudicots. An unrooted tree was inferred by a NJ analysis from distances calculated with the PAM distance matrix. The bootstrap values correspond to 1000 repetitions and are indicated as percentage in every branch. The amino acid alignment used to generate this tree corresponds to the bZIP domain plus the conserved motifs within this group (Figure 2 (0.31 MB TIF) Click here for additional data file.(307K, tif) Figure S9 Phylogenetic tree of Group F bZIPs from monocots and eudicots. An unrooted tree was inferred by a NJ analysis from distances calculated with the PAM distance matrix. The bootstrap values correspond to 1000 repetitions and are indicated as percentage in every branch. The amino acid alignment used to generate this tree corresponds to the bZIP domain plus the conserved motif within this group (Figure 2 (0.83 MB TIF) Click here for additional data file.(813K, tif) Figure S10 Phylogenetic tree of Group G bZIPs from monocots and eudicots. An unrooted tree was inferred by a NJ analysis from distances calculated with the PAM distance matrix. The bootstrap values correspond to 1000 repetitions and are indicated as percentage in every branch. The amino acid alignment used to generate this tree corresponds to the bZIP domain plus the conserved motifs within this group (Figure 2 (1.03 MB TIF) Click here for additional data file.(1009K, tif) Figure S11 Phylogenetic tree of Group H bZIPs from monocots and eudicots. An unrooted tree was inferred by a NJ analysis from distances obtained from a PAM distance matrix. The bootstrap values correspond to 1000 repetitions and are indicated as percentage in every branch. The amino acid alignment used to generate this tree corresponds to the bZIP domain plus the conserved motif within this group (Figure 2 (0.85 MB TIF) Click here for additional data file.(829K, tif) Figure S12 Phylogenetic tree of Group I bZIPs from monocots and eudicots. An unrooted tree was inferred by a NJ analysis from distances obtained from a PAM distance matrix. The bootstrap values correspond to 1000 repetitions and are indicated as percentage in every branch. The amino acid alignment used to generate this tree corresponds to the bZIP domain plus the conserved motifs within this group (Figure 2 (1.12 MB TIF) Click here for additional data file.(1.0M, tif) Figure S13 Phylogenetic tree of Group J bZIPs from monocots and eudicots. An unrooted tree was inferred by a NJ analysis from distances obtained from a PAM distance matrix. The bootstrap values correspond to 1000 repetitions and are indicated as percentage in every branch. The amino acid alignment used to generate this tree corresponds to the bZIP domain plus the conserved motifs within this group (Figure 2 (0.14 MB TIF) Click here for additional data file.(139K, tif) Figure S14 Phylogenetic tree of Group K bZIPs from monocots and eudicots. An unrooted tree was inferred by a NJ analysis from distances obtained from a PAM distance matrix. The bootstrap values correspond to 1000 repetitions and are indicated as percentage in every branch. The amino acid alignment used to generate this tree corresponds to the bZIP domain plus the conserved motif within this group (Figure 2 (0.82 MB TIF) Click here for additional data file.(801K, tif) Figure S15 Phylogenetic tree of Group L bZIPs from monocots and eudicots. An unrooted tree was inferred by a NJ analysis from distances obtained from a PAM distance matrix. The bootstrap values correspond to 1000 repetitions and are indicated as percentage in every branch. The amino acid alignment used to generate this tree corresponds to the bZIP domain plus the conserved motifs within this group (Figure 2 (0.47 MB TIF) Click here for additional data file.(459K, tif) Figure S16 Phylogenetic tree of Group S bZIPs from monocots and eudicots. An unrooted tree was inferred by a NJ analysis from distances obtained from a PAM distance matrix. The bootstrap values correspond to 1000 repetitions and are indicated as percentage in every branch. The amino acid alignment used to generate this tree corresponds to the bZIP domain. Rice, black cottonwood and Arabidopsis sequences are represented in orange, dark blue and light blue, respectively. Other eudicot and monocot sequences are show in green and red, respectively. The organism from which the remaining monocot and eudicot bZIPs originated is indicated by the last two letters in each sequence. Abbreviations are explained in Table S6. (2.04 MB TIF) Click here for additional data file.(1.9M, tif) Figure S17 Gene amplification pattern in each angiosperm group of bZIP homologues. (0.77 MB TIF) Click here for additional data file.(749K, tif) Figure S18 Identification of Groups cI and cII. Both trees are a partial representation of the whole tree obtained by NJ analyses. (A) In the initial phylogenetic analysis with the complete ViridiZIP set, we were able to identify two clusters of genes that did not posses any member from angiosperms; therefore, we called them NA (non-angiosperm). (B) Restricted analyses including bZIPs from algae and mosses uncovered the relationship of Groups NA and C; both groups share the same homologue in Ostreococcus (OtbZIP5), indicating it to be a common ancestor. Group NA was re-classified into Groups cI and cII. Their relation to members of Group NA shown in (A) is indicated by stars (* for Group cII, or ** for Group cI). Groups cI, cII, C and OtbZIP5 form the Group Proto-C. The bootstrap support of each group is shown in the figure. (2.44 MB TIF) Click here for additional data file.(2.3M, tif) Figure S19 Evolution of the bZIP family of transcription factors in land plants. We estimated the birth-and-death parameter (λ) using CAFE, as described in Materials and Methods. (A) The examined values of λ ranged from 1.0×10−4 to 6.8×10−3. The log probabilities obtained for each assayed value are shown. The shadowed region is displayed at a higher scale in the inset, where a peak at λ = 0.002011 is observed. (B) Evolutionary relationships of land plants with divergence time points (Arabidopsis - black cottonwood, 100–120 million years ago (mya) (47); monocot - eudicot, 140–150 mya (57); Physcomitrella - angiosperms, 450 mya (58)). Numbers at the branch end points indicate the numbers of bZIPs observed in the extant species. Numbers at the nodes represent the expected number of bZIPs in the ancestral species. Using the three methods available in CAFE, i.e., Viterbi assignments, branch cutting and likelihood ratio test, we identified branches deviating from the background model. According to all three methods, the branch leading to angiosperms significantly deviates from the null model (p<0.05), which implies that there was a significant increase in the number of bZIPs in the lineage leading to that group. Similarly, the Viterbi and branch cutting methods identify the branch leading to bryophytes (Physcomitrella) exhibiting a significant reduction in the number of bZIPs (p<0.05). Finally, we did not observe any significant deviation of the model for the extant group of angiosperms which can be interpreted as an even diffusion of the number of bZIPs in each branch. However, one cannot exclude the effect of natural selection in accounting for the differences that are nevertheless occurring. The increased number of bZIPs in the branch leading to angiosperms might be, at least partly, related to the several genome-wide duplication events that took place in the history of that lineage.(1.62 MB TIF) Click here for additional data file.(1.5M, tif) Figure S20 Scheme of the pipeline for bZIP identification in genomic sequences and ESTs. (I) Input genomic and EST sequences are compared by tblastn with the Angiotot protein dataset, generating a group of sequences that putatively code for bZIPs (SeqZIP). (II) Manual curation allowed subtracting sequences already present in Angiotot (redundancies) and false positives, which mainly correspond to low-complexity sequences. (III) The remaining sequences (true positives) are compared by tblastx against the best hit from Angiotot obtained in step I, allowing to identify the most probable ORF, and in the case of genomic sequences, to identify their gene structure, taking into account conserved intron positions and the presence of canonic splicing sites (GT-AG). (0.75 MB TIF) Click here for additional data file.(735K, tif) Table S1 Comparison between bZIPs reported in this manuscript and in Nijhawan et al. (2008) (0.04 MB XLS) Click here for additional data file.(41K, xls) Table S3 Accession numbers and classification into groups of homologues of non-sequenced angiosperms. (0.03 MB PDF) Click here for additional data file.(26K, pdf) Table S4 Biological functions of genes in PoGOs. (0.02 MB PDF) Click here for additional data file.(18K, pdf) Table S5 Classification of non-angiosperm bZIPs. (0.02 MB XLS) Click here for additional data file.(20K, xls) Table S7 Gene pairs resulting from segmental duplications of the Arabidopsis genome. (0.03 MB DOC) Click here for additional data file.(33K, doc) Dataset S1 Re-annotated nucleotide sequences from rice and black cottonwood. (0.02 MB TXT) Click here for additional data file.(16K, txt) Dataset S2 Re-annotated amino acid sequences from rice and black cottonwood. (0.01 MB TXT) Click here for additional data file.(6.1K, txt) Dataset S3 MPSS Expression data for bZIP genes from rice. (0.02 MB PDF) Click here for additional data file.(21K, pdf) Dataset S4 MPSS Expression data for bZIP genes from Arabidopsis. (0.01 MB PDF) Click here for additional data file.(14K, pdf) Text S1 Supporting texts including further results and discussion. (0.06 MB DOC) Click here for additional data file.(61K, doc) Acknowledgments We thank Amanda Bortolini Silveira (Universidade Estadual de Campinas, Brazil) for nuclear localisation experiments on Group L bZIPs, and Stefanie Hartmann (University of Potsdam) for critical comments on our manuscript, Liam Childs (MPI of Molecular Plant Physiology, Potsdam) for improving our English and the two reviewers for their helpful comments on the manuscript. Footnotes Competing Interests: The authors have declared that no competing interests exist. Funding: L.G.G.C. thanks the DAAD for providing a scholarship (A/04/34814). D.M.R.P. acknowledges financial support from the BMBF (FKZ 0315046). This work was supported in part by grants from the Fundação de Amparo a Ciência do Estado de São Paulo (FAPESP), Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (Capes) (to L.G.G.C and M.V.), the University of Potsdam Interdisciplinary Research Centre ‘Advanced Protein Technologies’ (to B.M.-R.), the DAAD/DFG International PhD Programme ‘Integrative Plant Science’ (DAAD D/04/01336; to B.M-R.), and the Fonds der Chemischen Industrie (N° 0164389; to B.M.-R.). References 1. Meshi T, Iwabuchi M. Plant transcription factors. Plant Cell Physiol. 1995;36:1405–1420. [PubMed] 2. Beckett D. Regulated assembly of transcription factors and control of transcription initiation. J Mol Biol. 2001;314:335–352. [PubMed] 3. Wray GA, Hahn MW, Abouheif E, Balhoff JP, Pizer M, et al. The evolution of transcriptional regulation in eukaryotes. Mol Biol Evol. 2003;20:1377–1419. [PubMed] 4. Warren AJ. Eukaryotic transcription factors. Curr Opin Struct Biol. 2002;12:107–114. [PubMed] 5. Wingender E, Chen X, Fricke E, Geffers R, Hehl R, et al. The TRANSFAC system on gene expression regulation. Nucleic Acids Res. 2001;29:281–283. [PubMed] 6. Riechmann JL, Ratcliffe OJ. A genomic perspective on plant transcription factors. Curr Opin Plant Biol. 2000;3:423–434. [PubMed] 7. Hsia CC, McGinnis W. Evolution of transcription factor function. Curr Opin Genet Dev. 2003;13:199–206. [PubMed] 8. Chen K, Rajewsky N. The evolution of gene regulation by transcription factors and microRNAs. Nat Rev Genet. 2007;8:93–103. [PubMed] 9. Lawton-Rauh A. Evolutionary dynamics of duplicated genes in plants. Mol Phylogenet Evol. 2003;29:396–409. [PubMed] 10. Levine M, Tjian R. Transcription regulation and animal diversity. Nature. 2003;424:147–151. [PubMed] 11. Shiu SH, Shih MC, Li WH. Transcription factor families have much higher expansion rates in plants than in animals. Plant Physiol. 2005;139:18–26. [PubMed] 12. Riaño-Pachón DM, Corrêa LGG, Trejos-Espinosa R, Mueller-Roeber B. Green transcription factors: a chlamydomonas overview. Genetics. 2008;179:31–39. [PubMed] 13. Irish VF. The evolution of floral homeotic gene function. Bioessays. 2003;25:637–646. [PubMed] 14. Garcia-Fernandez J. The genesis and evolution of homeobox gene clusters. Nat Rev Genet. 2005;6:881–892. [PubMed] 15. Deppmann CD, Acharya A, Rishi V, Wobbes B, Smeekens S, et al. Dimerization specificity of all 67 B-ZIP motifs in Arabidopsis thaliana: a comparison to Homo sapiens B-ZIP motifs. Nucleic Acids Res. 2004;32:3435–3445. [PubMed] 16. Floyd SK, Bowman JL. The ancestral developmental tool kit of land plants. Int J Plant Sci. 2007;168:1–35. 17. Bowman JL, Floyd SK, Sakakibara K. Green genes-comparative genomics of the green branch of life. Cell. 2007;129:229–234. [PubMed] 18. Moreno-Risueno MA, Martinez M, Vicente-Carbajosa J, Carbonero P. The family of DOF transcription factors: from green unicellular algae to vascular plants. Mol Genet Genomics. 2007;277:379–390. [PubMed] 19. Derelle R, Lopez P, Le GH, Manuel M. Homeodomain proteins belong to the ancestral molecular toolkit of eukaryotes. Evol Dev. 2007;9:212–219. [PubMed] 20. Martinez-Castilla LP, Alvarez-Buylla ER. Adaptive evolution in the Arabidopsis MADS-box gene family inferred from its complete resolved phylogeny. Proc Natl Acad Sci U S A. 2003;100:13407–13412. [PubMed] 21. Parenicova L, de Folter S, Kieffer M, Horner DS, Favalli C, et al. Molecular and phylogenetic analyses of the complete MADS-box transcription factor family in Arabidopsis: new openings to the MADS world. Plant Cell. 2003;15:1538–1551. [PubMed] 22. Zhang Y, Wang L. The WRKY transcription factor superfamily: its origin in eukaryotes and expansion in plants. BMC Evol Biol. 2005;5:1. [PubMed] 23. Prigge MJ, Clark SE. Evolution of the class III HD-Zip gene family in land plants. Evol Dev. 2006;8:350–361. [PubMed] 24. Floyd SK, Zalewski CS, Bowman JL. Evolution of class III homeodomain-leucine zipper genes in streptophytes. Genetics. 2006;173:373–388. [PubMed] 25. Hurst HC. Transcription factors 1: bZIP proteins. Protein Profile. 1995;2:101–168. [PubMed] 26. Walsh J, Waters CA, Freeling M. The maize gene liguleless2 encodes a basic leucine zipper protein involved in the establishment of the leaf blade-sheath boundary. Genes Dev. 1998;12:208–218. [PubMed] 27. Chuang CF, Running MP, Williams RW, Meyerowitz EM. The PERIANTHIA gene encodes a bZIP protein involved in the determination of floral organ number in Arabidopsis thaliana. Genes Dev. 1999;13:334–344. [PubMed] 28. Abe M, Kobayashi Y, Yamamoto S, Daimon Y, Yamaguchi A, et al. FD, a bZIP protein mediating signals from the floral pathway integrator FT at the shoot apex. Science. 2005;309:1052–1056. [PubMed] 29. Silveira AB, Gauer L, Tomaz JP, Cardoso PR, Carmello-Guerreiro S, et al. The Arabidopsis AtbZIP9 protein fused to the VP16 transcriptional activation domain alters leaf and vascular development. Plant Sci. 2007;172:1148–1156. 30. Shen H, Cao K, Wang X. A conserved proline residue in the leucine zipper region of AtbZIP34 and AtbZIP61 in Arabidopsis thaliana interferes with the formation of homodimer. Biochem Biophys Res Commun. 2007;362:425–430. [PubMed] 31. Yin Y, Zhu Q, Dai S, Lamb C, Beachy RN. RF2a, a bZIP transcriptional activator of the phloem-specific rice tungro bacilliform virus promoter, functions in vascular development. EMBO J. 1997;16:5247–5259. [PubMed] 32. Fukazawa J, Sakai T, Ishida S, Yamaguchi I, Kamiya Y, et al. Repression of shoot growth, a bZIP transcriptional activator, regulates cell elongation by controlling the level of gibberellins. Plant Cell. 2000;12:901–915. [PubMed] 33. Ciceri P, Locatelli F, Genga A, Viotti A, Schmidt RJ. The activity of the maize Opaque2 transcriptional activator is regulated diurnally. Plant Physiol. 1999;121:1321–1328. [PubMed] 34. Weltmeier F, Ehlert A, Mayer CS, Dietrich K, Wang X, et al. Combinatorial control of Arabidopsis proline dehydrogenase transcription by specific heterodimerisation of bZIP transcription factors. EMBO J. 2006;25:3133–3143. [PubMed] 35. Zhang B, Foley RC, Singh KB. Isolation and characterization of two related Arabidopsis ocs-element bZIP binding proteins. Plant J. 1993;4:711–716. [PubMed] 36. Despres C, DeLong C, Glaze S, Liu E, Fobert PR. The Arabidopsis NPR1/NIM1 protein enhances the DNA binding activity of a subgroup of the TGA family of bZIP transcription factors. Plant Cell. 2000;12:279–290. [PubMed] 37. Pontier D, Miao ZH, Lam E. Trans-dominant suppression of plant TGA factors reveals their negative and positive roles in plant defense responses. Plant J. 2001;27:529–538. [PubMed] 38. Niggeweg R, Thurow C, Kegler C, Gatz C. Tobacco transcription factor TGA2.2 is the main component of as-1-binding factor ASF-1 and is involved in salicylic acid- and auxin-inducible expression of as-1-containing target promoters. J Biol Chem. 2000;275:19897–19905. [PubMed] 39. Thurow C, Schiermeyer A, Krawczyk S, Butterbrodt T, Nickolov K, et al. Tobacco bZIP transcription factor TGA2.2 and related factor TGA2.1 have distinct roles in plant defense responses and plant development. Plant J. 2005;44:100–113. [PubMed] 40. Kaminaka H, Nake C, Epple P, Dittgen J, Schutze K, et al. bZIP10-LSD1 antagonism modulates basal defense and cell death in Arabidopsis following infection. EMBO J. 2006;25:4400–4411. [PubMed] 41. Baena-Gonzalez E, Rolland F, Thevelein JM, Sheen J. A central integrator of transcription networks in plant stress and energy signalling. Nature. 2007;448:938–943. [PubMed] 42. Liu JX, Srivastava R, Che P, Howell SH. Salt stress responses in Arabidopsis utilize a signal transduction pathway related to endoplasmic reticulum stress signaling. Plant J. 2007;51:897–909. [PubMed] 43. Iwata Y, Koizumi N. An Arabidopsis transcription factor, AtbZIP60, regulates the endoplasmic reticulum stress response in a manner unique to plants. Proc Natl Acad Sci U S A. 2005;102:5280–5285. [PubMed] 44. Finkelstein RR, Lynch TJ. Abscisic acid inhibition of radicle emergence but not seedling growth is suppressed by sugars. Plant Physiol. 2000;122:1179–1186. [PubMed] 45. Uno Y, Furihata T, Abe H, Yoshida R, Shinozaki K, et al. Arabidopsis basic leucine zipper transcription factors involved in an abscisic acid-dependent signal transduction pathway under drought and high-salinity conditions. Proc Natl Acad Sci U S A. 2000;97:11632–11637. [PubMed] 46. Niggeweg R, Thurow C, Weigel R, Pfitzner U, Gatz C. Tobacco TGA factors differ with respect to interaction with NPR1, activation potential and DNA-binding properties. Plant Mol Biol. 2000;42:775–788. [PubMed] 47. Nieva C, Busk PK, Dominguez-Puigjaner E, Lumbreras V, Testillano PS, et al. Isolation and functional characterisation of two new bZIP maize regulators of the ABA responsive gene rab28. Plant Mol Biol. 2005;58:899–914. [PubMed] 48. Wellmer F, Kircher S, Rugner A, Frohnmeyer H, Schafer E, et al. Phosphorylation of the parsley bZIP transcription factor CPRF2 is regulated by light. J Biol Chem. 1999;274:29476–29482. [PubMed] 49. Osterlund MT, Hardtke CS, Wei N, Deng XW. Targeted destabilization of HY5 during light-regulated development of Arabidopsis. Nature. 2000;405:462–466. [PubMed] 50. Ulm R, Baumann A, Oravecz A, Mate Z, Adam E, et al. Genome-wide analysis of gene expression reveals function of the bZIP transcription factor HY5 in the UV-B response of Arabidopsis. Proc Natl Acad Sci U S A. 2004;101:1397–1402. [PubMed] 51. Satoh R, Fujita Y, Nakashima K, Shinozaki K, Yamaguchi-Shinozaki K. A novel subgroup of bZIP proteins functions as transcriptional activators in hypoosmolarity-responsive expression of the ProDH gene in Arabidopsis. Plant Cell Physiol. 2004;45:309–317. [PubMed] 52. Lara P, Onate-Sanchez L, Abraham Z, Ferrandiz C, Diaz I, et al. Synergistic activation of seed storage protein gene expression in Arabidopsis by ABI3 and two bZIPs related to OPAQUE2. J Biol Chem. 2003;278:21003–21011. [PubMed] 53. Vettore AL, Yunes JA, Cord NG, da Silva MJ, Arruda P, et al. The molecular and functional characterization of an Opaque2 homologue gene from Coix and a new classification of plant bZIP proteins. Plant Mol Biol. 1998;36:249–263. [PubMed] 54. Arabidopsis Genome Iniciative. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature. 2000;408:796–815. [PubMed] 55. Jakoby M, Weisshaar B, Droge-Laser W, Vicente-Carbajosa J, Tiedemann J, et al. bZIP transcription factors in Arabidopsis. Trends Plant Sci. 2002;7:106–111. [PubMed] 56. Vincentz M, Bandeira-Kobarg C, Gauer L, Schlogl P, Leite A. Evolutionary pattern of angiosperm bZIP factors homologous to the maize Opaque2 regulatory protein. J Mol Evol. 2003;56:105–116. [PubMed] 57. Yu J, Hu S, Wang J, Wong GK, Li S, et al. A draft sequence of the rice genome (Oryza sativa L. ssp. indica). Science. 2002;296:79–92. [PubMed] 58. Goff SA, Ricke D, Lan TH, Presting G, Wang R, et al. A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). Science. 2002;296:92–100. [PubMed] 59. Tuskan GA, Difazio S, Jansson S, Bohlmann J, Grigoriev I, et al. The genome of black cottonwood, Populus trichocarpa (Torr. & Gray). Science. 2006;313:1596–1604. [PubMed] 60. Bennetzen J. The rice genome. Opening the door to comparative plant biology. Science. 2002;296:60–63. [PubMed] 61. Pennacchio LA. Insights from human/mouse genome comparisons. Mamm Genome. 2003;14:429–436. [PubMed] 62. Vincentz M, Cara FA, Okura VK, da Silva FR, Pedrosa GL, et al. Evaluation of monocot and eudicot divergence using the sugarcane transcriptome. Plant Physiol. 2004;134:951–959. [PubMed] 63. Tatusov RL, Koonin EV, Lipman DJ. A genomic perspective on protein families. Science. 1997;278:631–637. [PubMed] 64. Adams KL. Evolution of duplicate gene expression in polyploid and hybrid plants. J Hered. 2007;98:136–141. [PubMed] 65. Rijpkema AS, Gerats T, Vandenbussche M. Evolutionary complexity of MADS complexes. Curr Opin Plant Biol. 2007;10:32–38. [PubMed] 66. Woolfe A, Elgar G. Comparative genomics using Fugu reveals insights into regulatory subfunctionalization. Genome Biol. 2007;8:R53. [PubMed] 67. Merchant SS, Prochnik SE, Vallon O, Harris EH, Karpowicz SJ, et al. The Chlamydomonas genome reveals the evolution of key animal and plant functions. Science. 2007;318:245–250. [PubMed] 68. Derelle E, Ferraz C, Rombauts S, Rouze P, Worden AZ, et al. Genome analysis of the smallest free-living eukaryote Ostreococcus tauri unveils many unique features. Proc Natl Acad Sci U S A. 2006;103:11647–11652. [PubMed] 69. Rensing SA, Lang D, Zimmer AD, Terry A, Salamov A, et al. The Physcomitrella genome reveals evolutionary insights into the conquest of land by plants. Science. 2008;319:64–69. [PubMed] 70. Corrêa LGG. Análise Filogenética de Fatores de Transcrição bZIP em Angiospermas. 2004. (Phylogenetic analyses of bZIP transcription factors in angiosperms) [dissertation]. Universidade Estadual de Campinas, Campinas, Brazil. 71. Nijhawan A, Jain M, Tyagi AK, Khurana JP. Genomic survey and gene expression analysis of the basic leucine zipper transcription factor family in rice. Plant Physiol. 2008;146:333–350. [PubMed] 72. Thornton JW, DeSalle R. Gene family evolution and homology: genomics meets phylogenetics. Annu Rev Genomics Hum Genet. 2000;1:41–73. [PubMed] 73. Yang Z. Likelihood ratio tests for detecting positive selection and application to primate lysozyme evolution. Mol Biol Evol. 1998;15:568–573. [PubMed] 74. Kellogg EA. Evolution of developmental traits. Curr Opin Plant Biol. 2004;7:92–98. [PubMed] 75. Nam J, Kim J, Lee S, An G, Ma H, et al. Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms. Proc Natl Acad Sci U S A. 2004;101:1910–1915. [PubMed] 76. Lynch M, Katju V. The altered evolutionary trajectories of gene duplicates. Trends Genet. 2004;20:544–549. [PubMed] 77. Carlini LE, Ketudat M, Parsons RL, Prabhakar S, Schmidt RJ, et al. The maize EmBP-1 orthologue differentially regulates opaque2-dependent gene expression in yeast and cultured maize endosperm cells. Plant Mol Biol. 1999;41:339–349. [PubMed] 78. Vinson C, Myakishev M, Acharya A, Mir AA, Moll JR, et al. Classification of human B-ZIP proteins based on dimerization properties. Mol Cell Biol. 2002;22:6321–6335. [PubMed] 79. Fassler J, Landsman D, Acharya A, Moll JR, Bonovich M, et al. B-ZIP proteins encoded by the Drosophila genome: evaluation of potential dimerization partners. Genome Res. 2002;12:1190–1200. [PubMed] 80. Deppmann CD, Alvania RS, Taparowsky EJ. Cross-species annotation of basic leucine zipper factor interactions: Insight into the evolution of closed interaction networks. Mol Biol Evol. 2006;23:1480–1492. [PubMed] 81. Best AA, Morrison HG, McArthur AG, Sogin ML, Olsen GJ. Evolution of eukaryotic transcription: insights from the genome of Giardia lamblia. Genome Res. 2004;14:1537–1547. [PubMed] 82. Alvarez-Buylla ER, Pelaz S, Liljegren SJ, Gold SE, Burgeff C, et al. An ancestral MADS-box gene duplication occurred before the divergence of plants and animals. Proc Natl Acad Sci U S A. 2000;97:5328–5333. [PubMed] 83. Singer SD, Krogan NT, Ashton NW. Clues about the ancestral roles of plant MADS-box genes from a functional analysis of moss homologues. Plant Cell Rep. 2007;26:1155–1169. [PubMed] 84. Tanabe Y, Hasebe M, Sekimoto H, Nishiyama T, Kitani M, et al. Characterization of MADS-box genes in charophycean green algae and its implication for the evolution of MADS-box genes. Proc Natl Acad Sci U S A. 2005;102:2436–2441. [PubMed] 85. Shigyo M, Tabei N, Yoneyama T, Yanagisawa S. Evolutionary processes during the formation of the plant-specific Dof transcription factor family. Plant Cell Physiol. 2007;48:179–185. [PubMed] 86. Holm M, Ma LG, Qu LJ, Deng XW. Two interacting bZIP proteins are direct targets of COP1-mediated control of light-dependent gene expression in Arabidopsis. Genes Dev. 2002;16:1247–1259. [PubMed] 87. Richardt S, Lang D, Reski R, Frank W, Rensing SA. PlanTAPDB, a phylogeny-based resource of plant transcription-associated proteins. Plant Physiol. 2007;143:1452–1466. [PubMed] 88. Yi C, Deng XW. COP1 - from plant photomorphogenesis to mammalian tumorigenesis. Trends Cell Biol. 2005;15:618–625. [PubMed] 89. Kamisugi Y, Cuming AC. The evolution of the abscisic acid-response in land plants: comparative analysis of group 1 LEA gene expression in moss and cereals. Plant Mol Biol. 2005;59:723–737. [PubMed] 90. Marella HH, Sakata Y, Quatrano RS. Characterization and functional analysis of ABSCISIC ACID INSENSITIVE3-like genes from Physcomitrella patens. Plant J. 2006;46:1032–1044. [PubMed] 91. Singer SD, Ashton NW. Revelation of ancestral roles of KNOX genes by a functional analysis of Physcomitrella homologues. Plant Cell Rep. 2007;26:2039–2054. [PubMed] 92. Vinson C, Acharya A, Taparowsky EJ. Deciphering B-ZIP transcription factor interactions in vitro and in vivo. Biochim Biophys Acta. 2006;1759:4–12. [PubMed] 93. Lawrence CL, Maekawa H, Worthington JL, Reiter W, Wilkinson CR, et al. Regulation of Schizosaccharomyces pombe Atf1 protein levels by Sty1-mediated phosphorylation and heterodimerization with Pcr1. J Biol Chem. 2007;282:5160–5170. [PubMed] 94. Rodrigues-Pousada CA, Nevitt T, Menezes R, Azevedo D, Pereira J, et al. Yeast activator proteins and stress response: an overview. FEBS Lett. 2004;567:80–85. [PubMed] 95. Jaiswal AK. Nrf2 signaling in coordinated activation of antioxidant gene expression. Free Radic Biol Med. 2004;36:1199–1207. [PubMed] 96. Warabi E, Takabe W, Minami T, Inoue K, Itoh K, et al. Shear stress stabilizes NF-E2-related factor 2 and induces antioxidant genes in endothelial cells: role of reactive oxygen/nitrogen species. Free Radic Biol Med. 2007;42:260–269. [PubMed] 97. Makino C, Sano Y, Shinagawa T, Millar JB, Ishii S. Sin1 binds to both ATF-2 and p38 and enhances ATF-2-dependent transcription in an SAPK signaling pathway. Genes Cells. 2006;11:1239–1251. [PubMed] 98. Yoshida H, Haze K, Yanagi H, Yura T, Mori K. Identification of the cis-acting endoplasmic reticulum stress response element responsible for transcriptional induction of mammalian glucose-regulated proteins. Involvement of basic leucine zipper transcription factors. J Biol Chem. 1998;273:33741–33749. [PubMed] 99. Cox JS, Walter P. A novel mechanism for regulating activity of a transcription factor that controls the unfolded protein response. Cell. 1996;87:391–404. [PubMed] 100. Hahn MW, De Bie T, Stajich JE, Nguyen C, Cristianini N. Estimating the tempo and mode of gene family evolution from comparative genomic data. Genome Res. 2005;15:1153–1160. [PubMed] 101. Demuth JP, Wade MJ. Maternal expression increases the rate of bicoid evolution by relaxing selective constraint. Genetica. 2007;129:37–43. [PubMed] 102. Riaño-Pachón DM, Ruzicic S, Dreyer I, Mueller-Roeber B. PlnTFDB: an integrative plant transcription factor database. BMC Bioinformatics. 2007;8:42. [PubMed] 103. Childs KL, Hamilton JP, Zhu W, Ly E, Cheung F, et al. The TIGR Plant Transcript Assemblies database. Nucleic Acids Res. 2007;35:D846–D851. [PubMed] 104. Pertea G, Huang X, Liang F, Antonescu V, Sultana R, et al. TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets. Bioinformatics. 2003;19:651–652. [PubMed] 105. Kikuchi S, Satoh K, Nagata T, Kawagashira N, Doi K, et al. Collection, mapping, and annotation of over 28,000 cDNA clones from japonica rice. Science. 2003;301:376–379. [PubMed] 106. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–410. [PubMed] 107. Birney E, Clamp M, Durbin R. GeneWise and Genomewise. Genome Res. 2004;14:988–995. [PubMed] 108. Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG. The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucl Acids Res. 1997;25:4876–4882. [PubMed] 109. Kumar S, Tamura K, Nei M. MEGA3: Integrated software for Molecular Evolutionary Genetics Analysis and sequence alignment. Brief Bioinform. 2004;5:150–163. [PubMed] 110. Felsenstein J. PHYLIP - Phylogeny Inference Package (Version 3.2). Cladistics. 1989;5:164–166. 111. Dayhoff MO, Schwartz RC, Orcutt BC. A model of evolutionary change in proteins. In: Dayhoff MO, editor. Atlas of protein sequence and structure. Silver Spring, MD: National Biomedical Research Fundation Silver; 1978. pp. 301–310. 112. Nei M, Kumar S. Molecular Evolution and Phylogenetics. New York: Oxford University Press; 2000. 113. Stamatakis A. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006;22:2688–2690. [PubMed] 114. Jones DT, Taylor WR, Thornton JM. The rapid generation of mutation data matrices from protein sequences. Comput Appl Biosci. 1992;8:275–282. [PubMed] 115. Suzuki Y, Glazko GV, Nei M. Overcredibility of molecular phylogenies obtained by Bayesian phylogenetics. Proc Natl Acad Sci U S A. 2002;99:16138–16143. [PubMed] 116. Swofford DL. 2003. PAUP*. Phylogenetic Analysis Using Parsimony (*and Other Methods). Sinauer Associates). 117. Tamura K, Nei M. Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. Mol Biol Evol. 1993;10:512–526. [PubMed] 118. Posada D, Crandall KA. MODELTEST: testing the model of DNA substitution. Bioinformatics. 1998;14:817–818. [PubMed] 119. Schmidt HA, Strimmer K, Vingron M, von Haeseler A. TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics. 2002;18:502–504. [PubMed] 120. Bailey TL, Elkan C. The value of prior knowledge in discovering motifs with MEME. Proc Int Conf Intell Syst Mol Biol. 1995;3:21–29. [PubMed] 121. Finn RD, Mistry J, Schuster-Bockler B, Griffiths-Jones S, Hollich V, Lassmann T, et al. Pfam: clans, web tools and services. Nucleic Acids Res. 2006;34:D247–D251. [PubMed] 122. De Bie T, Cristianini N, Demuth JP, Hahn MW. CAFE: a computational tool for the study of gene family evolution. Bioinformatics. 2006;22:1269–1271. [PubMed] 123. Zimmermann P, Hirsch-Hoffmann M, Hennig L, Gruissem W. GENEVESTIGATOR. Arabidopsis microarray database and analysis toolbox. Plant Physiol. 2004;136:2621–2632. [PubMed] 124. Boyes DC, Zayed AM, Ascenzi R, McCaskill AJ, Hoffman NE, et al. Growth stage-based phenotypic analysis of Arabidopsis: a model for high throughput functional genomics in plants. Plant Cell. 2001;13:1499–1510. [PubMed] 125. Brenner S, Johnson M, Bridgham J, Golda G, Lloyd DH, et al. Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays. Nat Biotechnol. 2000;18:630–634. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||
Plant Cell Physiol. 1995 Dec; 36(8):1405-20.
[Plant Cell Physiol. 1995]Curr Opin Struct Biol. 2002 Feb; 12(1):107-14.
[Curr Opin Struct Biol. 2002]Curr Opin Plant Biol. 2000 Oct; 3(5):423-34.
[Curr Opin Plant Biol. 2000]Curr Opin Genet Dev. 2003 Apr; 13(2):199-206.
[Curr Opin Genet Dev. 2003]Nature. 2003 Jul 10; 424(6945):147-51.
[Nature. 2003]Plant Physiol. 2005 Sep; 139(1):18-26.
[Plant Physiol. 2005]Genetics. 2008 May; 179(1):31-9.
[Genetics. 2008]Bioessays. 2003 Jul; 25(7):637-46.
[Bioessays. 2003]Nat Rev Genet. 2005 Dec; 6(12):881-92.
[Nat Rev Genet. 2005]Nucleic Acids Res. 2004; 32(11):3435-45.
[Nucleic Acids Res. 2004]Mol Genet Genomics. 2007 Apr; 277(4):379-90.
[Mol Genet Genomics. 2007]Evol Dev. 2007 May-Jun; 9(3):212-9.
[Evol Dev. 2007]Proc Natl Acad Sci U S A. 2003 Nov 11; 100(23):13407-12.
[Proc Natl Acad Sci U S A. 2003]Plant Cell. 2003 Jul; 15(7):1538-51.
[Plant Cell. 2003]BMC Evol Biol. 2005 Jan 3; 5(1):1.
[BMC Evol Biol. 2005]Nucleic Acids Res. 2001 Jan 1; 29(1):281-3.
[Nucleic Acids Res. 2001]Protein Profile. 1995; 2(2):101-68.
[Protein Profile. 1995]Genes Dev. 1998 Jan 15; 12(2):208-18.
[Genes Dev. 1998]Biochem Biophys Res Commun. 2007 Oct 19; 362(2):425-30.
[Biochem Biophys Res Commun. 2007]EMBO J. 1997 Sep 1; 16(17):5247-59.
[EMBO J. 1997]Science. 2002 Apr 5; 296(5565):79-92.
[Science. 2002]Science. 2002 Apr 5; 296(5565):92-100.
[Science. 2002]Science. 2006 Sep 15; 313(5793):1596-604.
[Science. 2006]Nature. 2000 Dec 14; 408(6814):796-815.
[Nature. 2000]Science. 2002 Apr 5; 296(5565):60-3.
[Science. 2002]J Mol Evol. 2003 Jan; 56(1):105-16.
[J Mol Evol. 2003]Trends Plant Sci. 2002 Mar; 7(3):106-11.
[Trends Plant Sci. 2002]Science. 2007 Oct 12; 318(5848):245-50.
[Science. 2007]Proc Natl Acad Sci U S A. 2006 Aug 1; 103(31):11647-52.
[Proc Natl Acad Sci U S A. 2006]Science. 2008 Jan 4; 319(5859):64-9.
[Science. 2008]Trends Plant Sci. 2002 Mar; 7(3):106-11.
[Trends Plant Sci. 2002]J Mol Evol. 2003 Jan; 56(1):105-16.
[J Mol Evol. 2003]Plant Physiol. 2008 Feb; 146(2):333-50.
[Plant Physiol. 2008]Protein Profile. 1995; 2(2):101-68.
[Protein Profile. 1995]Trends Plant Sci. 2002 Mar; 7(3):106-11.
[Trends Plant Sci. 2002]Trends Plant Sci. 2002 Mar; 7(3):106-11.
[Trends Plant Sci. 2002]Trends Plant Sci. 2002 Mar; 7(3):106-11.
[Trends Plant Sci. 2002]J Mol Evol. 2003 Jan; 56(1):105-16.
[J Mol Evol. 2003]Plant Physiol. 2008 Feb; 146(2):333-50.
[Plant Physiol. 2008]Annu Rev Genomics Hum Genet. 2000; 1():41-73.
[Annu Rev Genomics Hum Genet. 2000]Mol Biol Evol. 1998 May; 15(5):568-73.
[Mol Biol Evol. 1998]Curr Opin Plant Biol. 2004 Feb; 7(1):92-8.
[Curr Opin Plant Biol. 2004]Proc Natl Acad Sci U S A. 2004 Feb 17; 101(7):1910-5.
[Proc Natl Acad Sci U S A. 2004]Trends Genet. 2004 Nov; 20(11):544-9.
[Trends Genet. 2004]Plant Mol Biol. 1999 Oct; 41(3):339-49.
[Plant Mol Biol. 1999]Mol Cell Biol. 2002 Sep; 22(18):6321-35.
[Mol Cell Biol. 2002]Genome Res. 2002 Aug; 12(8):1190-200.
[Genome Res. 2002]Mol Biol Evol. 2006 Aug; 23(8):1480-92.
[Mol Biol Evol. 2006]Genome Res. 2004 Aug; 14(8):1537-47.
[Genome Res. 2004]Evol Dev. 2007 May-Jun; 9(3):212-9.
[Evol Dev. 2007]Plant Cell Physiol. 2007 Jan; 48(1):179-85.
[Plant Cell Physiol. 2007]Genes Dev. 2002 May 15; 16(10):1247-59.
[Genes Dev. 2002]Plant Physiol. 2007 Apr; 143(4):1452-66.
[Plant Physiol. 2007]Trends Cell Biol. 2005 Nov; 15(11):618-25.
[Trends Cell Biol. 2005]Plant Mol Biol. 2005 Nov; 59(5):723-37.
[Plant Mol Biol. 2005]Plant J. 2006 Jun; 46(6):1032-44.
[Plant J. 2006]Plant Cell Rep. 2007 Dec; 26(12):2039-54.
[Plant Cell Rep. 2007]EMBO J. 2006 Jul 12; 25(13):3133-43.
[EMBO J. 2006]Biochim Biophys Acta. 2006 Jan-Feb; 1759(1-2):4-12.
[Biochim Biophys Acta. 2006]EMBO J. 2006 Sep 20; 25(18):4400-11.
[EMBO J. 2006]Nature. 2007 Aug 23; 448(7156):938-42.
[Nature. 2007]J Biol Chem. 2007 Feb 23; 282(8):5160-70.
[J Biol Chem. 2007]Proc Natl Acad Sci U S A. 2005 Apr 5; 102(14):5280-5.
[Proc Natl Acad Sci U S A. 2005]J Biol Chem. 1998 Dec 11; 273(50):33741-9.
[J Biol Chem. 1998]Cell. 1996 Nov 1; 87(3):391-404.
[Cell. 1996]Plant J. 2007 Sep; 51(5):897-909.
[Plant J. 2007]Mol Genet Genomics. 2007 Apr; 277(4):379-90.
[Mol Genet Genomics. 2007]Proc Natl Acad Sci U S A. 2005 Feb 15; 102(7):2436-41.
[Proc Natl Acad Sci U S A. 2005]Evol Dev. 2006 Jul-Aug; 8(4):350-61.
[Evol Dev. 2006]Genetics. 2006 May; 173(1):373-88.
[Genetics. 2006]Genome Res. 2005 Aug; 15(8):1153-60.
[Genome Res. 2005]Genetica. 2007 Jan; 129(1):37-43.
[Genetica. 2007]J Mol Evol. 2003 Jan; 56(1):105-16.
[J Mol Evol. 2003]BMC Bioinformatics. 2007 Feb 7; 8():42.
[BMC Bioinformatics. 2007]Proc Natl Acad Sci U S A. 2006 Aug 1; 103(31):11647-52.
[Proc Natl Acad Sci U S A. 2006]Science. 2007 Oct 12; 318(5848):245-50.
[Science. 2007]Science. 2008 Jan 4; 319(5859):64-9.
[Science. 2008]Nucleic Acids Res. 2007 Jan; 35(Database issue):D846-51.
[Nucleic Acids Res. 2007]Bioinformatics. 2003 Mar 22; 19(5):651-2.
[Bioinformatics. 2003]Science. 2003 Jul 18; 301(5631):376-9.
[Science. 2003]J Mol Biol. 1990 Oct 5; 215(3):403-10.
[J Mol Biol. 1990]Science. 2002 Apr 5; 296(5565):79-92.
[Science. 2002]Genome Res. 2004 May; 14(5):988-95.
[Genome Res. 2004]Nucleic Acids Res. 1997 Dec 15; 25(24):4876-82.
[Nucleic Acids Res. 1997]Brief Bioinform. 2004 Jun; 5(2):150-63.
[Brief Bioinform. 2004]Bioinformatics. 2006 Nov 1; 22(21):2688-90.
[Bioinformatics. 2006]Comput Appl Biosci. 1992 Jun; 8(3):275-82.
[Comput Appl Biosci. 1992]Proc Natl Acad Sci U S A. 2002 Dec 10; 99(25):16138-43.
[Proc Natl Acad Sci U S A. 2002]Proc Int Conf Intell Syst Mol Biol. 1995; 3():21-9.
[Proc Int Conf Intell Syst Mol Biol. 1995]J Mol Evol. 2003 Jan; 56(1):105-16.
[J Mol Evol. 2003]Nucleic Acids Res. 2006 Jan 1; 34(Database issue):D247-51.
[Nucleic Acids Res. 2006]Genome Res. 2005 Aug; 15(8):1153-60.
[Genome Res. 2005]Bioinformatics. 2006 May 15; 22(10):1269-71.
[Bioinformatics. 2006]Plant Physiol. 2004 Sep; 136(1):2621-32.
[Plant Physiol. 2004]Plant Cell. 2001 Jul; 13(7):1499-510.
[Plant Cell. 2001]Nat Biotechnol. 2000 Jun; 18(6):630-4.
[Nat Biotechnol. 2000]