Amino acid sequence of the 67 A.thaliana B-ZIP domains. Proteins are arranged into families, A–T shown in bold in the first column, with similar predicted dimerization properties. The second column depicts names of the families from Jakoby et al. (7). The solid line delineates homodimerizing proteins (A–N) from proteins that have complex dimerization pattern (P–T). The leucine zipper region is divided into heptads (gabcdef) to help visualize the g↔e′ pairs. Amino acids predicted to regulate dimerization specificity are color-coded. If the g and following e positions contain charged amino acids we colored the heptad from g to the following e. We use four colors to represent g↔e′ pairs. Green is for the attractive basic-acidic pairs (R↔E and K↔E), orange is for the attractive acidic-basic pairs (E↔R, E↔K, D↔R and D↔K), red is for repulsive acidic pairs (E↔E and E↔D), and blue is for repulsive basic pairs (K↔K and R↔K). If only one of the two amino acids in the g↔e′ pair is charged, we color that residue blue for basic and red for acidic. If the a or d position is polar, it is colored black and if either is charged, it is colored purple. The prolines and glycines are colored red to indicate a potential break in the α-helical structure. The predicted C-terminal boundary of the B-ZIP leucine zipper is denoted by an asterisk that enables us to define the frequencies of amino acids in different position of the leucine zipper. The C-terminal boundary is defined by the presence hydrophobic amino acids in the a and d positions, charged amino acids in the e and g positions, the absence of proline or pairs of glycines anywhere in the structure, and the absence of charged amino acids in the a and d positions. In the majority of cases, the decision was straight forward. However, in several instances, it is more ambiguous. For example, we indicate that the DPBF3 and DPBF4 leucine zipper stops after the second heptad because the third heptad has a K and E in the a and d positions, respectively; which should prevent leucine zipper formation. For the same reason At5g44080 and GBF4 stops after the third heptad as they have a R and K in the a and d positions, respectively, in the fourth heptad. At1g06070 goes to the ninth heptad even though it has two glycines in the fourth heptad. We would normally think this would terminate a leucine zipper structure but it appears very canonical from the fifth to the ninth heptads so be propose that it continues through the fourth heptad. This type of thinking was used to define all C-terminal boundary of all the leucine zippers. We do not intend these definitions of the C-terminus to be definitive, only approximate. The natural C-terminus is denoted with circumflex accent. The protein sequence for the tenth heptad of posF21, an N family member, and At2g13150, a P family member, that are predicted to not form a coiled-coil is LTGQVAP and VLISNEK, respectively. The dot in front of a protein indicates that it has been experimentally shown to form a homodimer.