pmc logo image
Logo of pnasPNAS Home page.Reference to the article.PNAS Info for AuthorsPNAS SubscriptionsPNAS About

Formats:

Proc Natl Acad Sci U S A. 2007 July 17; 104(29): 11963–11968.
Published online 2007 July 3. doi: 10.1073/pnas.0700922104.
PMCID: PMC1906725
Biophysics
The design and characterization of two proteins with 88% sequence identity but different structure and function
Patrick A. Alexander, Yanan He, Yihong Chen, John Orban, and Philip N. Bryan*
Center for Advanced Research in Biotechnology, University of Maryland Biotechnology Institute, 9600 Gudelsky Drive, Rockville, MD 20850
*To whom correspondence should be addressed. E-mail: bryan/at/umbi.umd.edu
Edited by David Baker, University of Washington, Seattle, WA, and approved June 6, 2007
Author contributions: P.A.A., J.O., and P.N.B. designed research; P.A.A., Y.H., Y.C., and J.O. performed research; P.A.A., Y.H., J.O., and P.N.B. analyzed data; and P.N.B. wrote the paper.
Received January 31, 2007.
To identify a simplified code for conformational switching, we have redesigned two natural proteins to have 88% sequence identity but different tertiary structures: a 3-α helix fold and an α/β fold. We describe the design of these homologous heteromorphic proteins, their structural properties as determined by NMR, their conformational stabilities, and their affinities for their respective ligands: IgG and serum albumin. Each of these proteins is completely folded at 25°C, is monomeric, and retains the native binding activity. The complete binding epitope for both ligands is encoded within each of the proteins. The IgG-binding epitope is functional only in the α/β fold, and the albumin-binding epitope is functional only in the 3-α fold. These results demonstrate that two monomeric folds and two different functions can be encoded with only 12% of the amino acids in a protein (7 of 56). The fact that 49 aa in these proteins are compatible with both folds shows that the essential information determining a fold can be highly concentrated in a few amino acids and that a very limited subset of interactions in the protein can tip the balance from one monomer fold to another. This delicate balance helps explain why protein structure prediction is so challenging. Furthermore, because a few mutations can result in both new conformation and new function, the evolution of new folds driven by natural selection for alternative functions may be much more probable than previously recognized.
Keywords: evolution, folding, NMR, protein design, protein structure
How amino acid sequence determines protein structure and ultimately protein function is perhaps the most fundamental unresolved question in biology. The degenerate nature of folding information, the vast number of conformations available to a polypeptide chain, and the low stability of most natural proteins (ΔGunfolding between 5 and 15 kcal/mol) all present challenges to understanding the folding code. In this study we create a set of proteins in which the essential folding information for two alternative folds resides within a small number of amino acids.
The starting point in this process was protein G, the multidomain, cell wall protein from Streptococcus (Lancefield group G) (1). Protein G contains two types of domains that bind to serum proteins in blood: GA domains of 47 structured amino acids that bind to HSA (2, 3) and GB domains of 56 structured amino acids that bind to the constant (Fc) region of IgG (4, 5). The ability to bind serum proteins apparently confers selective advantage to pathogenic bacteria by allowing them to camouflage themselves with host proteins (6).
The GA and GB domains share no significant sequence homology and have different folds, 3-α and α/β, respectively. We have previously characterized 56-aa versions of the GA and GB domains. High-resolution structures of these proteins have been determined, and the energetics of their folding and ligand binding reactions have been measured by microcalorimetry and hydrogen–deuterium exchange. The GA domain used in this study (PSD1) binds to HSA with a Kd = 20 nM and has a ΔGunfolding of 6 kcal/mol (25°C, 0.1 M KPO4, pH 7.2) (7). Amino acids 1–7 and 55–56 are disordered in GA (PSD1). The remaining 47 aa are well ordered in a 3-α helix bundle (8) (Fig. 1Fig. 1.A). The GB domain used in this study (GB1) binds to the constant (Fc) region of IgG with a Kd = 100 nM and has a ΔGunfolding of 7 kcal/mol (25°C, 0.1 M KPO4, pH 7.2) (9, 10). All 56 aa of GB1 are well ordered in a four-stranded β-sheet with an α-helix connecting strands two and three (11) (Fig. 1Fig. 1.B). Both solution and crystal structures have been determined for GB (1115).
Fig. 1.
Fig. 1.
Fig. 1.
Binding epitopes of GA and GB. (A) HSA-binding epitope of GA. Shown is a cartoon depiction of the secondary structure of GA. Amino acids making direct contacts with HSA (gray) are shown in yellow. Amino acids that were mutated to create a latent IgG epitope (more ...)
GA and GB constitute a heteromorphic pair of proteins (two proteins of equal length that have two different folds). We wish to study the structural and energetic relationship between the two distinct folded states for this pair. Consider the equilibrium shown in Scheme 1Scheme. 1..
Scheme. 1.
Scheme. 1.
Scheme. 1.
Experimental design.
For the native GA sequence, only the horizontal limb of the equilibrium is detectable because the α/β fold is poorly populated. An identical equilibrium exists for the native GB sequence, but only the vertical limb of the equilibrium is populated. The 47 positions of nonidentity in this initial heteromorphic pair shift the equilibrium from 99.996% 3-α fold (6 kcal/mol) and undetectable α/β to 99.999% α/β (7 kcal/mol) and undetectable 3-α. It is well known that not every amino acid in a protein contributes equally toward specifying the native fold, however (16). Thus, not all of the nonidentities will have equal information content toward specifying one fold or the other.
To identify a simplified code for conformational switching, we have determined mutations in binary sequence space (choice of either the GA or GB amino acid) which can be introduced into each member of the heteromorphic pair while preserving its native fold. Thus, new heteromorphic pairs of increasing identity are generated. The positions of nonidentity in the heteromorphic pair of highest identity constitute an essential fold-specific folding code parsed from the overall stability code. This basic approach has been taken in a number of previous studies (1721). We have been able to build on the previous work to create heteromorphic pairs of higher identity and higher stability while preserving biological function.
Encoding Latent Binding Sites.
The first step in the process of creating homologous versions of GA and GB was mutating key amino acids in each so that the IgG- and HSA-binding epitopes are encoded in both proteins. The IgG-binding epitope is functional in the α/β fold and latent in the 3-α fold, whereas the albumin-binding epitope is functional in the 3-α fold and latent in the α/β fold (Fig. 1Fig. 1.). This was done for two reasons. First we wished to determine whether a latent binding function could be encoded without altering native activity. This finding, in itself, demonstrates that latent functions could be encoded in alternative folds of other proteins. Second, we wished to create an experimental system in which binding function can be used to detect a specific fold in future studies. Consider the equilibrium below in which binding function is thermodynamically linked to a particular conformational state (see Scheme 2Scheme. 2.).
Scheme. 2.
Scheme. 2.
Scheme. 2.
Experimental design.
If both binding epitopes are present in both members of a heteromorphic pair, we can potentially detect a poorly populated fold through its thermodynamic linkage to the bound form.
The HSA binding site in GA comprises amino acids Y29, L32, N35, K37, T38, E40, and G41 (7, 22) (Fig. 1Fig. 1.A). The IgG binding site in GB comprises amino acids E27, K28, K31, and N35 on the central helix, W43 in the β3-strand, and main chain contacts in the intervening turn (3, 23, 24) (Fig. 1Fig. 1.B). A latent IgG-binding epitope was encoded in GA with three mutations: D27E, Y28K, and E43W. K31 and N35 are already present in GA. The resulting mutant is denoted GA30.
A latent HSA binding site was encoded in GB with the mutations V29Y, Q32L, N37K, G38T, and D40E. N35 and G41 are already present in GB. The resulting mutant is denoted GB30. GA30 and GB30 are 30% identical (Fig. 2Fig. 2.).
Fig. 2.
Fig. 2.
Fig. 2.
Iterative design of heteromorphic pairs. For each heteromorphic pair, amino acid identities are shown in blue and nonidentities are shown in red. Mutations introduced in each design cycle are shown as blue spheres in the corresponding panel. The seven (more ...)
GA30 and GB30 were characterized by CD to determine secondary structure content, thermal denaturation to determine conformation stability, gel filtration to confirm that the proteins remain monomeric, and affinity chromatography on IgG and HSA Sepharose to assess binding affinities. The CD spectra of GA30 and GB30 are shown in Fig. 3Fig. 3.A. The CD spectrum of GA30 is essentially identical to its parent (PSD1), and the CD spectrum of GB30 is essentially identical to that of native GB1. Thermal denaturation profiles followed by ellipticity at 222 nm are shown in Fig. 3Fig. 3.B. The thermal denaturation midpoint is 86°C for GA30 and 65°C for GB30. The CD melting data were converted to ΔGunfolding by using the relationship ΔGunfolding = −RTln([unfolded]/[folded]).
Fig. 3.
Fig. 3.
Fig. 3.
Analysis of conformation and thermal denaturation by CD. CD spectra and thermal denaturation curves are shown for GA30 (blue dashed line), GA77 (red dashed line), GA88 (black dashed line), GB30 (blue solid line), GB77 (red solid line), and GB88 (black (more ...)
A plot of ΔG vs. temperature for GA30 and GB30 is shown in Fig. 4Fig. 4.. GA30 is similar in stability to its parent (PSD1). Fig. 4Fig. 4.A shows the stability vs. temperature graph for GA30 determined from the CD data. For reference, the figure also shows the stability profile of PSD1 determined from differential scanning calorimetry and hydrogen–deuterium exchange experiments (7, 8). The introduction of the latent IgG binding site into PSD1 does not have any significant affect on the thermodynamics of the unfolding reaction, including any variation in the change in heat capacity for unfolding (ΔCp). ΔCp is correlated with the change in solvent-exposed surface area upon unfolding and determines the curvature of the stability curve. The absence of any change in ΔCp is an indication that no major perturbations in packing of the hydrophobic core have occurred (25).
Fig. 4.
Fig. 4.
Fig. 4.
Stability profiles of natural and mutant GA and GB proteins. (A) ΔG vs. temperature plots are shown for GA30 (blue), GA77 (red), and GA88 (black). For reference, the stability curve for the parent protein PSD1 is shown in green. (B) ΔG (more ...)
Fig. 4Fig. 4.B shows the stability vs. temperature graph for GB30 determined from the CD data. For reference, the figure also shows the stability profile of GB1 determined from differential scanning calorimetry and hydrogen–deuterium exchange experiments (9, 10). GB30 is less stable than GB1 by almost 3 kcal/mol but nevertheless exhibits stability typical of some natural IgG-binding domains. Thus, latent function can be built into both proteins in a way that is relatively benign in terms of stability.
The binding of GA30 and GB30 to IgG and HSA was determined by using HSA and IgG Sepharose columns. The concentrations of the two ligands on the respective columns were similar to their concentrations in human serum (10 mg/ml). The use of the affinity columns allowed rapid assessment of binding affinity (and no binding interaction), as well as the stoichiometry of binding (two GB molecules per IgG and one GA molecule per HSA). Retention on the column after washing with 20-column volumes indicates a dissociation constant of <1 μM. Elution from a column with no tailing of the elution peak indicates a dissociation constant of >1 mM. Approximately 1 mg of GA30 was tightly bound to HSA Sepharose and required 0.5 M NaOAc (pH 3.0) for elution. GA30 eluted as a sharp peak in the flow-through fractions from IgG Sepharose. In contrast, ≈0.7 mg of GB30 was tightly bound to IgG Sepharose and required 0.5 M NaOAc (pH 3.0) for elution. GB30 eluted as a sharp peak in the flow-through fractions from HSA Sepharose. Thus, the engineering of latent binding function into the two proteins was accomplished without significant alteration of the native binding function. As expected, the IgG-binding epitope is cryptic in GA30 because its correct presentation requires the α/β fold, and the HSA-binding epitope is cryptic in GB30 because its correct presentation requires the 3-α fold.
Finally, gel filtration experiments on G25 and G75 columns were used to confirm that GA30 and GB30 had the same hydrodynamic properties as the parents, PSD1 and GB1.
Increasing Identity.
GA30 and GB30 constitute a heteromorphic pair with wild-type binding function, as well as excess functional capacity in latent form. We needed to methodically examine the binary sequence space at the 39 positions of nonidentity to determine whether highly homologous regions of that sequence space encode for both of the folds.
Previous studies with GB have shown that this is not trivial. The tolerance of any particular mutation in a fold is context-dependent and cannot be determined by making sequential single amino acid substitutions. Blanco et al. (20) performed a methodical analysis of sequence space between GB and the all-β SH3 domain of spectrin. They minimized the context-dependence issue by avoiding mutations of residues in the hydrophobic cores. Several mutants of high identity were found in their sampling that had circular dichroic spectra indicative of alternative folds and cooperative thermal denaturation transitions typical of folded proteins. None of the homologous pairs was found to have a well defined tertiary structure as judged by 1H NMR, however. Dalal and colleagues (21, 26) carefully designed hybrid sequences of GB and the all α-helical ROP homodimer. They were able to create heteromorphic pairs of up to 80% identity. The ROP-like proteins were disordered at neutral pH but acquired α-helical CD spectra below pH 5 and exhibit cooperative thermal unfolding transitions. Their limited solubility precluded detailed analysis of their tertiary structures (26). We have previously used phage display selection to explore the sequence space between GB and the α-helical IgG-binding domain of protein A. A heteromorphic pair of 59% identity was evolved (27). High-resolution NMR structures of this pair are described by He et al. (28). Above 60% identity, the folding signal degraded to the point where the stability of heteromorphic pairs dropped below 2 kcal/mol and detailed structural analysis became problematic.
To find mutation-tolerant sites in GA30 and GB30 we relied heavily on previous experiments that used random mutagenesis of topological islands and phage display to select for functional folds (7, 8, 27). These experiments allowed us to classify each position in GA and each position in GB into one of three general categories: (i) mutation-tolerant and context-independent (mutations are tolerated independent of neighboring amino acids); (ii) mutation-tolerant but context-dependent (mutations are tolerated but only with compensating mutation of neighboring amino acids); and (iii) mutation-resistant (mutations seldom or never appear in selected populations). Restricting choices to category i, we mutated positions 1–8, 10, 15, 21, 22, 23, 26, 34, 44, 47, 53, and 54 in GA30 to the corresponding amino acid in GB30 and we mutated positions 11, 14, 16, 17, 36, 42, 46, and 48 in GB30 to the corresponding amino acid in GA30. The resulting heteromorphic pair (GA77 and GB77) was 77% identical (Fig. 2Fig. 2.).
GA77 and GB77 were characterized by CD, thermal denaturation, ligand binding, and gel filtration as described for GA30 and GB30. These data are presented in Figs. 3Fig. 3. and and4.4Fig. 4.. The stability of GA77 has decreased by ≈1 kcal/mol relative to GA30. The stability of GB77 has decreased by ≈0.6 kcal/mol relative to GB30. Both GA77 and GB77 are calculated to have a ΔGunfolding of >4 kcal/mol at 25°C (Fig. 4Fig. 4.). Both proteins show no decrease in binding affinity to their respective ligands relative to PSD1 and GB1 (by affinity chromatography) and are monomeric (by gel filtration). Despite the high degree of sequence identity (77%), the stability and binding characteristics of these two proteins remain similar to naturally occurring IgG and HSA binding domains. Both proteins are highly expressed in Escherichia coli in soluble form at 37°C.
In a final iteration, mutations at two sites from category i (51 and 52) were introduced into GA77 and mutations at four sites from category ii were introduced into GB77: G9L and L12A are compensating mutations and T18K and A20L are compensating mutations. The resulting proteins (GA88 and GB88) were 88% identical (Fig. 2Fig. 2.). The CD and thermal denaturation data are presented in Figs. 3Fig. 3. and and4.4Fig. 4.. The CD spectrum of GA88 is essentially identical to PSD1, and the CD spectrum of GB88 is essentially identical to that of native GB1. The thermal denaturation midpoint is 70°C for GA88 and 44°C for GB88. The stability of GA88 has decreased by ≈1 kcal/mol relative to GA77. The stability of GB88 has decreased by ≈2.4 kcal/mol relative to GB77. GA88 is calculated to have a ΔGunfolding of ≈4 kcal/mol, and GB88 is calculated to have a ΔGunfolding of ≈2.0 kcal/mol at 25°C (Fig. 4Fig. 4.). Both mutants unfold cooperatively. The small changes observed in the curvature of the stability curves for GB77 and GB88 may reflect small changes in ΔCpunfolding.
HSA and IgG Sepharose columns were used to show that GA88 binds only to HSA and GB88 binds only to IgG. GB88 eluted as a sharp peak in the flow-through fractions of the HSA column, and GA88 eluted as a sharp peak in the flow-through fractions of the IgG column. GB88 was completely retained through loading and washing steps on the IgG column, but GA88 eluted from the HSA column in a broad peak after ≈10-column volumes. This elution profile was independent of flow rate indicating that binding is in rapid equilibrium. If the concentration of HSA on the column (0.17 mM) is taken as the free ligand concentration, then the KD of GA88 is estimated to be 20 μM. Although GA88 binding affinity toward HSA has diminished relative to GA77, it remains in the range observed for some naturally occurring GA domains (7). Gel filtration over G25 and G75 columns showed that both proteins remain monomeric.
Analysis of Folds by NMR.
Previous studies have shown that native-like CD spectra and cooperative thermal denaturation profiles, although indicative of well defined tertiary structure, are not conclusive (20). Thus, to characterize their tertiary folds, GA77, GB77, GA88, and GB88 were isotopically labeled with 15N and 13C. 15N, 1H HSQC spectra of all four proteins are well dispersed with sharp line widths, indicative of folded proteins. Sequential assignments have been completed by using standard triple-resonance methods. The assigned spectra of GA88 and GB88 are shown in Fig. 5Fig. 5.. The different chemical environments in the two folds result in little overlap of amide proton resonances in the two proteins, even though 49 of 56 aa are identical. Consensus chemical shift index comparisons [supporting information (SI) Tables 1 and 2] and analysis of diagnostic, long-range NOEs showed that GA88 is very similar in fold to PSD1 and that GB88 is very similar in fold to GB1. High-resolution structures of GA88 and GB88 are nearing completion and will be published elsewhere.
Fig. 5.
Fig. 5.
Fig. 5.
1H, 15N HSQC spectra. Main chain amide assignments are shown for GA88 (black) and GB88 (red). Forty-nine of the 56 aa in these two proteins are identical but have different chemical environments reflecting the two different folds. Amide proton signals (more ...)
Previous studies have demonstrated that conformational preferences of many amino acid sequences are marginal. The structure of a short fragment within a protein can be completely context-dependent (29). Furthermore, monomeric proteins sometimes can assume alternative conformations in a multimeric form. This is observed in prion proteins and with monomeric and dimeric forms of the chemokine lymphotactin (30) and also with proteins designed to switch conformation and quaternary structure in the presence of a transition metal (31, 32). It has also been shown that five mutations to the core of GB result in a conformational change to a stable, intertwined tetrameric state (33). It is clear from these examples that the energy of multimer formation or metal binding can drive conformational change.
We show here that the fold and function of a monomeric protein can depend on only 7 of 56 positions. The final identity of GA88 and GB88 results from nine initial identities, 17 identities created by mutations in GB, 16 identities created by mutations in GA (structured region only), and seven identities created by extension of the N terminus in GA, for a total of 49 identities.
Thus, ≈30% of the structured amino acids in each member of the initial heteromorphic pair were mutated to produce the final 88% identity. Seven unique amino acids are preserved in each fold, and hence seven mutations are sufficient to shift the equilibrium from 99.9% 3-α fold to 97% α/β. The other 49 positions influence the 3-α to U and the α/β to U equilibria, but they can be tolerated in both folds and therefore provide a relatively neutral sequence background in which to observe an underlying fold-specific folding code. Clearly the complementary set of mutations in each member of the pair have to be very carefully selected to achieve this level of identity, but previous studies indicate that this result is probably not unique to this pair of proteins (1721).
This fact has profound implications for understanding the protein folding code, as well as understanding how new folds and functions evolve. Protein stability is typically analyzed as a two-state reaction between a single folded state and a population of disordered, unfolded states. Our results suggest that the ΔG of alternative folded states may be much more favorable than generally recognized. This conclusion might explain a major difficulty in predicting the native fold by computational methods.
Our observation that a latent binding epitope can be engineered into a small protein without interfering with its native function suggests that the “folding problem” might be viewed from a new perspective. The vast array of conformations available to a polypeptide chain makes the folding problem difficult, but it also means that many latent functions could exist in this vast population of conformers. Because of the prevailing view that all folds except the native are highly improbable, the acquisition of new function by conformational switching also is judged to be improbable. Because the conformational preference for a 3-α vs. an α/β fold can depend on a delicate balance of critical interactions within a protein, however, a few mutations can result in a new conformation and the unmasking of new functionality.
The binary sequence space separating GA88 and GB88 comprises only 128 (27) different variations. In this population, most variants will be predominantly unfolded. Some of these unfolded forms will likely have significant affinity for one and possibly even both ligands, if a sequence has significant propensity for both folds. Numerous examples exist of natural proteins that are largely unfolded unless bound to a ligand. We would suggest that transitional forms in the natural evolution of new folds may be predominantly unfolded (34). More than 30% of the eukaryotic proteome is predicted to be “natively unfolded.” Innovation (unmasking of new function) could result from a few mutations that lower the propensity for the native fold but increase the propensity for an alternative one. Thus, the three characteristics of proteins that make the folding problem difficult (large conformational space, degenerate folding code, and small ΔG) may enable facile evolution of new folds and functions.
Protein Expression and Purification.
To facilitate their rapid purification, GA and GB variants were cloned into the vector pG58, which encodes an engineered subtilisin prosequence as the N terminus of the fusion protein, and were purified by using an affinity-cleavage tag system that we developed (35). The system enabled the rapid, standardized purification of mutant proteins, even of low stability. Minimal medium (10) was used for 15N and 13C labeling. Soluble cell extract of prodomain fusion protein was injected on a 5-ml S189 column at 5 ml/min to allow binding and then washed with 10-column volumes of 100 mM KPO4 (pH 7.2) to remove impurities (35). To cleave and elute the purified target protein, 6 ml of 100 mM KF/100 mM KPO4 (pH 7.2) was injected at 0.5 ml/min. The purified protein was then dialyzed into 2 mM ammonium bicarbonate buffer (pH 7.0) and lyophilized.
CD.
Lyophilized powder was resuspended in 100 mM KPO4 (pH 7.2) for analysis by CD. Under these conditions all proteins are monomeric at concentrations up to 5 mg/ml (0.8 mM). This was demonstrated by gel filtration on G25 and G75 Sephadex. CD measurements were performed with a spectropolarimeter (model J-720; Jasco, Easton, MD) using water-jacketed quartz cells with path lengths of 1 cm on protein concentrations of 5 μM. The ellipticity results were expressed as mean residue ellipticity, [θ], degrees per cm2/dmol−1. Temperature-induced unfolding was performed in the temperature range between 25°C and 100°C in 1-cm cuvettes. Ellipticities at 222 nm were continuously monitored at a scanning rate of 1° per min. The fraction native is determined by subtracting unfolded baseline from the experimental CD signal and then dividing by the total CD difference between 100% folded and 0% folded at that temperature. Reversibility of the denaturations was confirmed by comparing the CD spectra at 25°C before melting and after heating to 100°C and cooling to 25°C. The temperature unfolding profiles measured by far-UV CD for GA and GB were converted to an apparent ΔGunfolding and fit to a theoretical curve calculated by using the Gibbs–Helmholtz equation: ΔGunfolding = ΔHoTΔSo + ΔCp(TToTlnT/To), where To = 298 K and ΔCp = 0.83 kcal/°mol for GB and 0.26 kcal/°mol for GA (10, 36).
Binding to IgG and Human Serum Albumin.
IgG and HSA were immobilized on GE HT1 columns containing NHS-activated agarose resin according to the manufacturer's instructions. Binding of PSD1, GB1, and their mutants was carried out in 0.1 M KPO4 (pH 7.2) by injecting 1 ml of a 1 mg/ml solution at 0.5 ml/min. Washing was with 15 ml of 0.1 M KPO4 (pH 7.2) at 1 ml/min. Elution was with 6 ml of 0.5 M NaOAc (pH 3.0). All proteins except GA88 were completely retained in the binding and washing steps.
NMR Spectroscopy.
Lyophilized protein samples were dissolved in NMR buffer (50 mM NaPi/50 mM NaCl, pH 7.0) containing 10% D2O. The final protein concentrations were in the range of 0.4–0.6 mM. NMR spectra were acquired on a AVANCE 600-MHz spectrometer (Bruker, Billerica, MA) equipped with a z axis gradient triple resonance (1H/13C/15N) cryoprobe. Backbone resonance assignments were obtained from the following three-dimensional triple-resonance experiments recorded on 13C/15N-labeled samples: HNCACB, CBCA(CO)NH, HBHA(CO)NH, and HNCO. All experiments were collected at 298 K. NMR spectra were processed by using nmrPipe (37) and analyzed with Sparky (38). Chemical shift index analysis was carried out with Cα, Cβ, Hα, and CO assignments (39).
Supplementary Material
Supporting Tables
Acknowledgments
We thank David Rozak, John Moult, and Edward Eisenstein for their assistance and advice. This work was supported by National Institutes of Health Grant GM62154 (to J.O.).
Footnotes
The authors declare no conflict of interest.
This article is a PNAS Direct Submission.
This article contains supporting information online at www.pnas.org/cgi/content/full/0700922104/DC1.
1. Fahnestock SR, Alexander P, Nagle J, Filpula D. J Bacteriol. 1986;167:870–880. [PubMed]
2. Falkenberg C, Bjork L, Åkerstrom B. 1992;31:1451–1457.
3. Frick IM, Wikstrom M, Forsen S, Drakenberg T, Gomi H, Sjobring U, Bjorck L. Proc Natl Acad Sci USA. 1992;89:8532–8536. [PubMed]
4. Myhre EB, Kronvall G. Infect Immun. 1977;17:475–482. [PubMed]
5. Reis KJ, Ayoub EM, Boyle MDP. J Immunol. 1984;132:3098–3102. [PubMed]
6. de Chateau M, Holst E, Bjorck L. J Biol Chem. 1996;271:26609–26615. [PubMed]
7. Rozak DA, Alexander PA, He Y, Chen Y, Orban J, Bryan PN. Biochemistry. 2006;45:3263–3271. [PubMed]
8. He Y, Rozak DA, Sari N, Chen Y, Bryan P, Orban J. Biochemistry. 2006;45:10102–10109. [PubMed]
9. Orban J, Alexander P, Bryan P. Biochemistry. 1994;33:5702–5710. [PubMed]
10. Alexander P, Fahnestock S, Lee T, Orban J, Bryan P. Biochemistry. 1992;31:3597–3603. [PubMed]
11. Gallagher TD, Alexander P, Bryan P, Gilliland G. Biochemistry. 1994;33:4721–4729. [PubMed]
12. Gronenborn AM, Filpula DR, Essig NZ, Achari A, Whitlow M, Wingfield PT, Clore GM. Science. 1991;253:657–661. [PubMed]
13. Achari A, Hale S, Howard AJ, Clore GM, Gronenborn AM, Hardman KD, Whitlow M. Biochemistry. 1992;31:10449–10457. [PubMed]
14. Lian L-Y, Derrick JP, Sutcliffe MJ, Yang JC, Roberts GCK. J Mol Biol. 1992;228:1219–1234. [PubMed]
15. Derrick JP, Wigley DB. J Mol Biol. 1994;243:906–918. [PubMed]
16. Lattman EE, Rose GD. Proc Natl Acad Sci USA. 1993;90:439–441. [PubMed]
17. Jones DT, Moody CM, Uppenbrink J, Viles JH, Doyle PM, Harris CJ, Pearl LH, Sadler PJ, Thornton JM. Proteins. 1996;24:502–513. [PubMed]
18. Dalal S, Balasubramanian S, Regan L. Folding Des. 1997;2:R71–R79.
19. Yuan SM, Clarke ND. Proteins. 1998;30:136–143. [PubMed]
20. Blanco FJ, Angrand I, Serrano L. J Mol Biol. 1999;285:741–753. [PubMed]
21. Dalal S, Regan L. Protein Sci. 2000;9:1651–1659. [PubMed]
22. Lejon S, Frick IM, Bjorck L, Wikstrom M, Svensson S. J Biol Chem. 2004;279:42924–42928. [PubMed]
23. Sauer-Eriksson AE, Keywegt GJ, Uhlen M, Jones TA. Structure (London). 1995;3:265–278.
24. Sloan DJ, Hellinga HW. Protein Sci. 1999;8:1643–1648. [PubMed]
25. Livingstone JR, Spolar RS, Record TM. Biochemistry. 1991;30:4237–4244. [PubMed]
26. Dalal S, Balasubramanian S, Regan L. Nat Struct Biol. 1997;4:548–552. [PubMed]
27. Alexander PA, Rozak DA, Orban J, Bryan PN. Biochemistry. 2005;44:14045–14054. [PubMed]
28. He Y, Yeh DC, Alexander P, Bryan PN, Orban J. Biochemistry. 2005;44:14055–14061. [PubMed]
29. Minor DL, Jr, Kim PS. Nature. 1996;380:730–734. [PubMed]
30. Kuloglu ES, McCaslin DR, Markley JL, Volkman BF. J Biol Chem. 2002;277:17863–17870. [PubMed]
31. Ambroggio XI, Kuhlman B. J Am Chem Soc. 2006;128:1154–1161. [PubMed]
32. Cerasoli E, Sharpe BK, Woolfson DN. J Am Chem Soc. 2005;127:15008–15009. [PubMed]
33. Kirsten Frank M, Dyda F, Dobrodumov A, Gronenborn AM. Nat Struct Biol. 2002;9:877–885. [PubMed]
34. Ward JJ, Sodhi JS, McGuffin LJ, Buxton BF, Jones DT. J Mol Biol. 2004;337:635–645. [PubMed]
35. Ruan B, Fisher KE, Alexander PA, Doroshko V, Bryan PN. Biochemistry. 2004;43:14539–14546. [PubMed]
36. Rozak DA, Orban J, Bryan PN. Biochim Biophys Acta. 2005;1753:226–233. [PubMed]
37. Delaglio F, Grzesiek S, Vuister GW, Zhu G, Pfeifer J, Bax A. J Biomol NMR. 1995;6:277–293. [PubMed]
38. Goddard TD, Kneller DG. SPARKY. San Francisco: Univ of California; 2001. version 3.
39. Wishart DS, Sykes BD. J Biomol NMR. 1994;4:171–180. [PubMed]
40. Derrick JP, Wigley DB. Nature. 1992;359:752–754. [PubMed]

See more articles cited in this paragraph
See more articles cited in this paragraph
See more articles cited in this paragraph
See more articles cited in this paragraph
See more articles cited in this paragraph
See more articles cited in this paragraph
See more articles cited in this paragraph
See more articles cited in this paragraph
See more articles cited in this paragraph
See more articles cited in this paragraph
See more articles cited in this paragraph
See more articles cited in this paragraph
See more articles cited in this paragraph
See more articles cited in this paragraph
See more articles cited in this paragraph
See more articles cited in this paragraph
See more articles cited in this paragraph
See more articles cited in this paragraph