Logo of narLink to Publisher's site
Nucleic Acids Res. 1998 May 15; 26(10): 2306–2312.
PMCID: PMC147552

Quantitative parameters for amino acid-base interaction: implications for prediction of protein-DNA binding sites.


Inspection of the amino acid-base interactions in protein-DNA complexes is essential to the understanding of specific recognition of DNA target sites by regulatory proteins. The accumulation of information on protein-DNA co-crystals challenges the derivation of quantitative parameters for amino acid-base interaction based on these data. Here we use the coordinates of 53 solved protein-DNA complexes to extract all non-homologous pairs of amino acid-base that are in close contact, including hydrogen bonds and hydrophobic interactions. By comparing the frequency distribution of the different pairs to a theoretical distribution and calculating the log odds, a quantitative measure that expresses the likelihood of interaction for each pair of amino acid-base could be extracted. A score that reflects the compatibility between a protein and its DNA target can be calculated by summing up the individual measures of the pairs of amino acid-base involved in the complex, assuming additivity in their contributions to binding. This score enables ranking of different DNA binding sites given a protein binding site and vice versa and can be used in molecular design protocols. We demonstrate its validity by comparing the predictions using this score with experimental binding results of sequence variants of zif268 zinc fingers and their DNA binding sites.

Full Text

The Full Text of this article is available as a PDF (100K).

Selected References

These references are in PubMed. This may not be the complete list of references from this article.
  • Tanikawa J, Yasukawa T, Enari M, Ogata K, Nishimura Y, Ishii S, Sarai A. Recognition of specific DNA sequences by the c-myb protooncogene product: role of three repeat units in the DNA-binding domain. Proc Natl Acad Sci U S A. 1993 Oct 15;90(20):9320–9324. [PMC free article] [PubMed]
  • Choo Y, Klug A. Toward a code for the interactions of zinc fingers with DNA: selection of randomized fingers displayed on phage. Proc Natl Acad Sci U S A. 1994 Nov 8;91(23):11163–11167. [PMC free article] [PubMed]
  • Choo Y, Klug A. Selection of DNA binding sites for zinc fingers using rationally randomized DNA reveals coded interactions. Proc Natl Acad Sci U S A. 1994 Nov 8;91(23):11168–11172. [PMC free article] [PubMed]
  • Desjarlais JR, Berg JM. Length-encoded multiplex binding site determination: application to zinc finger proteins. Proc Natl Acad Sci U S A. 1994 Nov 8;91(23):11099–11103. [PMC free article] [PubMed]
  • Seeman NC, Rosenberg JM, Rich A. Sequence-specific recognition of double helical nucleic acids by proteins. Proc Natl Acad Sci U S A. 1976 Mar;73(3):804–808. [PMC free article] [PubMed]
  • Suzuki M. A framework for the DNA-protein recognition code of the probe helix in transcription factors: the chemical and stereochemical rules. Structure. 1994 Apr 15;2(4):317–326. [PubMed]
  • Mandel-Gutfreund Y, Schueler O, Margalit H. Comprehensive analysis of hydrogen bonds in regulatory protein DNA-complexes: in search of common principles. J Mol Biol. 1995 Oct 20;253(2):370–382. [PubMed]
  • Suzuki M, Yagi N. DNA recognition code of transcription factors in the helix-turn-helix, probe helix, hormone receptor, and zinc finger families. Proc Natl Acad Sci U S A. 1994 Dec 20;91(26):12357–12361. [PMC free article] [PubMed]
  • Lustig B, Jernigan RL. Consistencies of individual DNA base-amino acid interactions in structures and sequences. Nucleic Acids Res. 1995 Nov 25;23(22):4707–4711. [PMC free article] [PubMed]
  • Jones DT, Thornton JM. Potential energy functions for threading. Curr Opin Struct Biol. 1996 Apr;6(2):210–216. [PubMed]
  • Jamieson AC, Kim SH, Wells JA. In vitro selection of zinc fingers with altered DNA-binding specificity. Biochemistry. 1994 May 17;33(19):5689–5695. [PubMed]
  • Rebar EJ, Pabo CO. Zinc finger phage: affinity selection of fingers with new DNA-binding specificities. Science. 1994 Feb 4;263(5147):671–673. [PubMed]
  • Jamieson AC, Wang H, Kim SH. A zinc finger directory for high-affinity DNA recognition. Proc Natl Acad Sci U S A. 1996 Nov 12;93(23):12834–12839. [PMC free article] [PubMed]
  • Pavletich NP, Pabo CO. Zinc finger-DNA recognition: crystal structure of a Zif268-DNA complex at 2.1 A. Science. 1991 May 10;252(5007):809–817. [PubMed]
  • Elrod-Erickson M, Rould MA, Nekludova L, Pabo CO. Zif268 protein-DNA complex refined at 1.6 A: a model system for understanding zinc finger-DNA interactions. Structure. 1996 Oct 15;4(10):1171–1180. [PubMed]
  • Choo Y, Klug A. Physical basis of a protein-DNA recognition code. Curr Opin Struct Biol. 1997 Feb;7(1):117–125. [PubMed]
  • Bernstein FC, Koetzle TF, Williams GJ, Meyer EF, Jr, Brice MD, Rodgers JR, Kennard O, Shimanouchi T, Tasumi M. The Protein Data Bank: a computer-based archival file for macromolecular structures. Arch Biochem Biophys. 1978 Jan 30;185(2):584–591. [PubMed]
  • Berman HM, Olson WK, Beveridge DL, Westbrook J, Gelbin A, Demeny T, Hsieh SH, Srinivasan AR, Schneider B. The nucleic acid database. A comprehensive relational database of three-dimensional structures of nucleic acids. Biophys J. 1992 Sep;63(3):751–759. [PMC free article] [PubMed]
  • Ippolito JA, Alexander RS, Christianson DW. Hydrogen bond stereochemistry in protein structure and function. J Mol Biol. 1990 Oct 5;215(3):457–471. [PubMed]
  • Pabo CO, Sauer RT. Transcription factors: structural families and principles of DNA recognition. Annu Rev Biochem. 1992;61:1053–1095. [PubMed]
  • Altschul SF. Amino acid substitution matrices from an information theoretic perspective. J Mol Biol. 1991 Jun 5;219(3):555–565. [PubMed]
  • Stormo GD. Consensus patterns in DNA. Methods Enzymol. 1990;183:211–221. [PubMed]
  • Bowie JU, Lüthy R, Eisenberg D. A method to identify protein sequences that fold into a known three-dimensional structure. Science. 1991 Jul 12;253(5016):164–170. [PubMed]
  • Bowie JU, Zhang K, Wilmanns M, Eisenberg D. Three-dimensional profiles for measuring compatibility of amino acid sequence with three-dimensional structure. Methods Enzymol. 1996;266:598–616. [PubMed]
  • Mandel-Gutfreund Y, Margalit H, Jernigan RL, Zhurkin VB. A role for CH...O interactions in protein-DNA recognition. J Mol Biol. 1998 Apr 17;277(5):1129–1140. [PubMed]
  • Takeda Y, Sarai A, Rivera VM. Analysis of the sequence-specific interactions between Cro repressor and operator DNA by systematic base substitution experiments. Proc Natl Acad Sci U S A. 1989 Jan;86(2):439–443. [PMC free article] [PubMed]
  • Deng QL, Ishii S, Sarai A. Binding site analysis of c-Myb: screening of potential binding sites by using the mutation matrix derived from systematic binding affinity measurements. Nucleic Acids Res. 1996 Feb 15;24(4):766–774. [PMC free article] [PubMed]
  • Aggarwal AK, Rodgers DW, Drottar M, Ptashne M, Harrison SC. Recognition of a DNA operator by the repressor of phage 434: a view at high resolution. Science. 1988 Nov 11;242(4880):899–907. [PubMed]
  • Kim JL, Burley SK. 1.9 A resolution refined structure of TBP recognizing the minor groove of TATAAAAG. Nat Struct Biol. 1994 Sep;1(9):638–653. [PubMed]
  • Parkinson G, Wilson C, Gunasekera A, Ebright YW, Ebright RH, Ebright RE, Berman HM. Structure of the CAP-DNA complex at 2.5 angstroms resolution: a complete picture of the protein-DNA interface. J Mol Biol. 1996 Jul 19;260(3):395–408. [PubMed]
  • Rice PA, Yang S, Mizuuchi K, Nash HA. Crystal structure of an IHF-DNA complex: a protein-induced DNA U-turn. Cell. 1996 Dec 27;87(7):1295–1306. [PubMed]

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press


Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...


  • Compound
    PubChem Compound links
  • MedGen
    Related information in MedGen
  • PubMed
    PubMed citations for these articles
  • Substance
    PubChem Substance links

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...