• We are sorry, but NCBI web applications do not support your browser and may not function properly. More information
Logo of prosciprotein sciencecshl presssubscriptionsetoc alertsthe protein societyjournal home
Protein Sci. Jun 2000; 9(6): 1162–1176.
PMCID: PMC2144653

Cascaded multiple classifiers for secondary structure prediction.

Abstract

We describe a new classifier for protein secondary structure prediction that is formed by cascading together different types of classifiers using neural networks and linear discrimination. The new classifier achieves an accuracy of 76.7% (assessed by a rigorous full Jack-knife procedure) on a new nonredundant dataset of 496 nonhomologous sequences (obtained from G.J. Barton and J.A. Cuff). This database was especially designed to train and test protein secondary structure prediction methods, and it uses a more stringent definition of homologous sequence than in previous studies. We show that it is possible to design classifiers that can highly discriminate the three classes (H, E, C) with an accuracy of up to 78% for beta-strands, using only a local window and resampling techniques. This indicates that the importance of long-range interactions for the prediction of beta-strands has been probably previously overestimated.

Full Text

The Full Text of this article is available as a PDF (286K).

Selected References

These references are in PubMed. This may not be the complete list of references from this article.
  • Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990 Oct 5;215(3):403–410. [PubMed]
  • Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997 Sep 1;25(17):3389–3402. [PMC free article] [PubMed]
  • Anfinsen CB. Principles that govern the folding of protein chains. Science. 1973 Jul 20;181(4096):223–230. [PubMed]
  • Avbelj F, Fele L. Role of main-chain electrostatics, hydrophobic effect and side-chain conformational entropy in determining the secondary structure of proteins. J Mol Biol. 1998 Jun 12;279(3):665–684. [PubMed]
  • Baldi P, Brunak S, Frasconi P, Soda G, Pollastri G. Exploiting the past and the future in protein secondary structure prediction. Bioinformatics. 1999 Nov;15(11):937–946. [PubMed]
  • Baldwin RL, Rose GD. Is protein folding hierarchic? I. Local structure and peptide folding. Trends Biochem Sci. 1999 Jan;24(1):26–33. [PubMed]
  • Barton GJ, Sternberg MJ. A strategy for the rapid multiple alignment of protein sequences. Confidence levels from tertiary structure comparisons. J Mol Biol. 1987 Nov 20;198(2):327–337. [PubMed]
  • Biou V, Gibrat JF, Levin JM, Robson B, Garnier J. Secondary structure prediction: combination of three different methods. Protein Eng. 1988 Sep;2(3):185–191. [PubMed]
  • Chou PY, Fasman GD. Prediction of protein conformation. Biochemistry. 1974 Jan 15;13(2):222–245. [PubMed]
  • Cohen FE, Abarbanel RM, Kuntz ID, Fletterick RJ. Secondary structure assignment for alpha/beta proteins by a combinatorial approach. Biochemistry. 1983 Oct 11;22(21):4894–4904. [PubMed]
  • Cuff JA, Barton GJ. Evaluation and improvement of multiple sequence methods for protein secondary structure prediction. Proteins. 1999 Mar 1;34(4):508–519. [PubMed]
  • Eisenberg D. Three-dimensional structure of membrane and surface proteins. Annu Rev Biochem. 1984;53:595–623. [PubMed]
  • Ewbank JJ, Creighton TE. Protein folding by stages. Curr Biol. 1992 Jul;2(7):347–349. [PubMed]
  • Feng DF, Johnson MS, Doolittle RF. Aligning amino acid sequences: comparison of commonly used methods. J Mol Evol. 1984;21(2):112–125. [PubMed]
  • Frishman D, Argos P. Incorporation of non-local interactions in protein secondary structure prediction from the amino acid sequence. Protein Eng. 1996 Feb;9(2):133–142. [PubMed]
  • Frishman D, Argos P. Seventy-five percent accuracy in protein secondary structure prediction. Proteins. 1997 Mar;27(3):329–335. [PubMed]
  • Garnier J, Gibrat JF, Robson B. GOR method for predicting protein secondary structure from amino acid sequence. Methods Enzymol. 1996;266:540–553. [PubMed]
  • Garnier J, Osguthorpe DJ, Robson B. Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins. J Mol Biol. 1978 Mar 25;120(1):97–120. [PubMed]
  • Geourjon C, Deléage G. SOPM: a self-optimized method for protein secondary structure prediction. Protein Eng. 1994 Feb;7(2):157–164. [PubMed]
  • Gibrat JF, Garnier J, Robson B. Further developments of protein secondary structure prediction using information theory. New parameters and consideration of residue pairs. J Mol Biol. 1987 Dec 5;198(3):425–443. [PubMed]
  • Henikoff S, Henikoff JG. Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci U S A. 1992 Nov 15;89(22):10915–10919. [PMC free article] [PubMed]
  • Holley LH, Karplus M. Protein secondary structure prediction with a neural network. Proc Natl Acad Sci U S A. 1989 Jan;86(1):152–156. [PMC free article] [PubMed]
  • Hubbard TJ, Sander C. The role of heat-shock and chaperone proteins in protein folding: possible molecular mechanisms. Protein Eng. 1991 Oct;4(7):711–717. [PubMed]
  • Jones DT. Protein secondary structure prediction based on position-specific scoring matrices. J Mol Biol. 1999 Sep 17;292(2):195–202. [PubMed]
  • Kabsch W, Sander C. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers. 1983 Dec;22(12):2577–2637. [PubMed]
  • Kawabata T, Doi J. Improvement of protein secondary structure prediction using binary word encoding. Proteins. 1997 Jan;27(1):36–46. [PubMed]
  • King RD, Sternberg MJ. Identification and application of the concepts important for accurate and reliable protein secondary structure prediction. Protein Sci. 1996 Nov;5(11):2298–2310. [PMC free article] [PubMed]
  • King RD, Sternberg MJ. Machine learning approach for the prediction of protein secondary structure. J Mol Biol. 1990 Nov 20;216(2):441–457. [PubMed]
  • Kneller DG, Cohen FE, Langridge R. Improvements in protein secondary structure prediction by an enhanced neural network. J Mol Biol. 1990 Jul 5;214(1):171–182. [PubMed]
  • Levin JM. Exploring the limits of nearest neighbour secondary structure prediction. Protein Eng. 1997 Jul;10(7):771–776. [PubMed]
  • Levin JM, Pascarella S, Argos P, Garnier J. Quantification of secondary structure prediction improvement using multiple alignments. Protein Eng. 1993 Nov;6(8):849–854. [PubMed]
  • Lim VI. Algorithms for prediction of alpha-helical and beta-structural regions in globular proteins. J Mol Biol. 1974 Oct 5;88(4):873–894. [PubMed]
  • Matthews BW. Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochim Biophys Acta. 1975 Oct 20;405(2):442–451. [PubMed]
  • Muggleton S, King RD, Sternberg MJ. Protein secondary structure prediction using logic-based machine learning. Protein Eng. 1992 Oct;5(7):647–657. [PubMed]
  • Needleman SB, Wunsch CD. A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol. 1970 Mar;48(3):443–453. [PubMed]
  • Orengo CA, Michie AD, Jones S, Jones DT, Swindells MB, Thornton JM. CATH--a hierarchic classification of protein domain structures. Structure. 1997 Aug 15;5(8):1093–1108. [PubMed]
  • Ptitsyn OB, Finkelstein AV. Theory of protein secondary structure and algorithm of its prediction. Biopolymers. 1983 Jan;22(1):15–25. [PubMed]
  • Qian N, Sejnowski TJ. Predicting the secondary structure of globular proteins using neural network models. J Mol Biol. 1988 Aug 20;202(4):865–884. [PubMed]
  • Riis SK, Krogh A. Improving prediction of protein secondary structure using structured neural networks and multiple sequence alignments. J Comput Biol. 1996 Spring;3(1):163–183. [PubMed]
  • Robson B, Pain RH. Analysis of the code relating sequence to conformation in proteins: possible implications for the mechanism of formation of helical regions. J Mol Biol. 1971 May 28;58(1):237–259. [PubMed]
  • Robson B, Suzuki E. Conformational properties of amino acid residues in globular proteins. J Mol Biol. 1976 Nov 5;107(3):327–356. [PubMed]
  • Rost B. PHD: predicting one-dimensional protein structure by profile-based neural networks. Methods Enzymol. 1996;266:525–539. [PubMed]
  • Rost B, Sander C. Prediction of protein secondary structure at better than 70% accuracy. J Mol Biol. 1993 Jul 20;232(2):584–599. [PubMed]
  • Rost B, Sander C, Schneider R. Redefining the goals of protein secondary structure prediction. J Mol Biol. 1994 Jan 7;235(1):13–26. [PubMed]
  • Salamov AA, Solovyev VV. Prediction of protein secondary structure by combining nearest-neighbor algorithms and multiple sequence alignments. J Mol Biol. 1995 Mar 17;247(1):11–15. [PubMed]
  • Salamov AA, Solovyev VV. Protein secondary structure prediction using local alignments. J Mol Biol. 1997 Apr 25;268(1):31–36. [PubMed]
  • Tatusov RL, Altschul SF, Koonin EV. Detection of conserved segments in proteins: iterative scanning of sequence databases with alignment blocks. Proc Natl Acad Sci U S A. 1994 Dec 6;91(25):12091–12095. [PMC free article] [PubMed]
  • Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994 Nov 11;22(22):4673–4680. [PMC free article] [PubMed]
  • Yi TM, Lander ES. Protein secondary structure prediction using nearest-neighbor methods. J Mol Biol. 1993 Aug 20;232(4):1117–1129. [PubMed]
  • Zemla A, Venclovas C, Fidelis K, Rost B. A modified definition of Sov, a segment-based measure for protein secondary structure prediction assessment. Proteins. 1999 Feb 1;34(2):220–223. [PubMed]
  • Zhang CT, Chou KC. An optimization approach to predicting protein structural class from amino acid composition. Protein Sci. 1992 Mar;1(3):401–408. [PMC free article] [PubMed]
  • Zimmermann K, Gibrat JF. In unison: regularization of protein secondary structure predictions that makes use of multiple sequence alignments. Protein Eng. 1998 Oct;11(10):861–865. [PubMed]
  • Zvelebil MJ, Barton GJ, Taylor WR, Sternberg MJ. Prediction of protein secondary structure and active sites using the alignment of homologous sequences. J Mol Biol. 1987 Jun 20;195(4):957–961. [PubMed]

Articles from Protein Science : A Publication of the Protein Society are provided here courtesy of The Protein Society

Formats:

Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...

Links

  • Cited in Books
    Cited in Books
    PubMed Central articles cited in books
  • PubMed
    PubMed
    PubMed citations for these articles
  • Substance
    Substance
    PubChem Substance links

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...