• We are sorry, but NCBI web applications do not support your browser and may not function properly. More information
Logo of pnasPNASInfo for AuthorsSubscriptionsAboutThis Article
Proc Natl Acad Sci U S A. Sep 12, 1995; 92(19): 8700–8704.

Prediction of protein folding class using global description of amino acid sequence.


We present a method for predicting protein folding class based on global protein chain description and a voting process. Selection of the best descriptors was achieved by a computer-simulated neural network trained on a data base consisting of 83 folding classes. Protein-chain descriptors include overall composition, transition, and distribution of amino acid attributes, such as relative hydrophobicity, predicted secondary structure, and predicted solvent exposure. Cross-validation testing was performed on 15 of the largest classes. The test shows that proteins were assigned to the correct class (correct positive prediction) with an average accuracy of 71.7%, whereas the inverse prediction of proteins as not belonging to a particular class (correct negative prediction) was 90-95% accurate. When tested on 254 structures used in this study, the top two predictions contained the correct class in 91% of the cases.

Full text

Full text is available as a scanned copy of the original print version. Get a printable copy (PDF file) of the complete article (1.0M), or click on a page image below to browse page by page. Links to PubMed are also available for Selected References.

Selected References

These references are in PubMed. This may not be the complete list of references from this article.
  • Chothia C. Proteins. One thousand families for the molecular biologist. Nature. 1992 Jun 18;357(6379):543–544. [PubMed]
  • Finkelstein AV, Ptitsyn OB. Why do globular proteins fit the limited set of folding patterns? Prog Biophys Mol Biol. 1987;50(3):171–190. [PubMed]
  • Richardson JS. The anatomy and taxonomy of protein structure. Adv Protein Chem. 1981;34:167–339. [PubMed]
  • Chothia C, Finkelstein AV. The classification and origins of protein folding patterns. Annu Rev Biochem. 1990;59:1007–1039. [PubMed]
  • Orengo CA, Flores TP, Taylor WR, Thornton JM. Identification and classification of protein fold families. Protein Eng. 1993 Jul;6(5):485–500. [PubMed]
  • Pascarella S, Argos P. A data bank merging related protein structures and sequences. Protein Eng. 1992 Mar;5(2):121–137. [PubMed]
  • Levitt M, Chothia C. Structural patterns in globular proteins. Nature. 1976 Jun 17;261(5561):552–558. [PubMed]
  • White JV, Stultz CM, Smith TF. Protein classification by stochastic modeling and optimal filtering of amino-acid sequences. Math Biosci. 1994 Jan;119(1):35–75. [PubMed]
  • Nakashima H, Nishikawa K, Ooi T. The folding type of a protein is relevant to the amino acid composition. J Biochem. 1986 Jan;99(1):153–162. [PubMed]
  • Klein P, Delisi C. Prediction of protein structural class from the amino acid sequence. Biopolymers. 1986 Sep;25(9):1659–1672. [PubMed]
  • Klein P. Prediction of protein structural class by discriminant analysis. Biochim Biophys Acta. 1986 Nov 21;874(2):205–215. [PubMed]
  • Chou KC, Zhang CT. A correlation-coefficient method to predicting protein-structural classes from amino acid compositions. Eur J Biochem. 1992 Jul 15;207(2):429–423. [PubMed]
  • Chou KC, Zhang CT. A new approach to predicting protein folding types. J Protein Chem. 1993 Apr;12(2):169–178. [PubMed]
  • Zhang CT, Chou KC. An optimization approach to predicting protein structural class from amino acid composition. Protein Sci. 1992 Mar;1(3):401–408. [PMC free article] [PubMed]
  • Dubchak I, Holbrook SR, Kim SH. Prediction of protein folding class from amino acid composition. Proteins. 1993 May;16(1):79–91. [PubMed]
  • Metfessel BA, Saurugger PN, Connelly DP, Rich SS. Cross-validation of protein structural class prediction using statistical clustering and neural networks. Protein Sci. 1993 Jul;2(7):1171–1182. [PMC free article] [PubMed]
  • Qian N, Sejnowski TJ. Predicting the secondary structure of globular proteins using neural network models. J Mol Biol. 1988 Aug 20;202(4):865–884. [PubMed]
  • Holley LH, Karplus M. Protein secondary structure prediction with a neural network. Proc Natl Acad Sci U S A. 1989 Jan;86(1):152–156. [PMC free article] [PubMed]
  • Muskal SM, Kim SH. Predicting protein secondary structure content. A tandem neural network approach. J Mol Biol. 1992 Jun 5;225(3):713–727. [PubMed]
  • Holbrook SR, Muskal SM, Kim SH. Predicting surface exposure of amino acids from protein sequence. Protein Eng. 1990 Aug;3(8):659–665. [PubMed]
  • Rost B, Sander C. Prediction of protein secondary structure at better than 70% accuracy. J Mol Biol. 1993 Jul 20;232(2):584–599. [PubMed]
  • Holbrook SR, Dubchak I, Kim SH. PROBE: a computer program employing an integrated neural network approach to protein structure prediction. Biotechniques. 1993 Jun;14(6):984–989. [PubMed]
  • Kneller DG, Cohen FE, Langridge R. Improvements in protein secondary structure prediction by an enhanced neural network. J Mol Biol. 1990 Jul 5;214(1):171–182. [PubMed]

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences


Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...


  • MedGen
    Related information in MedGen
  • PubMed
    PubMed citations for these articles
  • Substance
    PubChem Substance links

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...