• We are sorry, but NCBI web applications do not support your browser and may not function properly. More information
Logo of narLink to Publisher's site
Nucleic Acids Res. Sep 1994; 22(17): 3590–3596.
PMCID: PMC308327

PRINTS--a database of protein motif fingerprints.

Abstract

PRINTS is a compendium of protein motif 'fingerprints'. A fingerprint is defined as a group of motifs excised from conserved regions of a sequence alignment, whose diagnostic power or potency is refined by iterative databasescanning (in this case the OWL composite sequence database). Generally, the motifs do not overlap, but are separated along a sequence, though they may be contiguous in 3D-space. The use of groups of independent, linearly- or spatially-distinct motifs allows protein folds and functionalities to be characterised more flexibly and powerfully than conventional single-component patterns or regular expressions. The current version of the database contains 200 entries (encoding 950 motifs), covering a wide range of globular and membrane proteins, modular polypeptides, and so on. The growth of the databaseis influenced by a number of factors; e.g. the use of multiple motifs; the maximisation of sequence information through iterative database scanning; and the fact that the database searched is a large composite. The information contained within PRINTS is distinct from, but complementary to the consensus expressions stored in the widely-used PROSITE dictionary of patterns.

Full text

Full text is available as a scanned copy of the original print version. Get a printable copy (PDF file) of the complete article (1.1M), or click on a page image below to browse page by page. Links to PubMed are also available for Selected References.

Selected References

These references are in PubMed. This may not be the complete list of references from this article.
  • Bairoch A. The PROSITE dictionary of sites and patterns in proteins, its current status. Nucleic Acids Res. 1993 Jul 1;21(13):3097–3103. [PMC free article] [PubMed]
  • Pongor S, Skerl V, Cserzö M, Hátsági Z, Simon G, Bevilacqua V. The SBASE protein domain library, release 2.0: a collection of annotated protein sequence segments. Nucleic Acids Res. 1993 Jul 1;21(13):3111–3115. [PMC free article] [PubMed]
  • Gribskov M, Homyak M, Edenfield J, Eisenberg D. Profile scanning for three-dimensional structural patterns in protein sequences. Comput Appl Biosci. 1988 Mar;4(1):61–66. [PubMed]
  • Henikoff S, Henikoff JG. Automated assembly of protein blocks for database searching. Nucleic Acids Res. 1991 Dec 11;19(23):6565–6572. [PMC free article] [PubMed]
  • Ogiwara A, Uchiyama I, Seto Y, Kanehisa M. Construction of a dictionary of sequence motifs that characterize groups of related proteins. Protein Eng. 1992 Sep;5(6):479–488. [PubMed]
  • Seto Y, Ikeuchi Y, Kanehisa M. Fragment peptide library for classification and functional prediction of proteins. Proteins. 1990;8(4):341–351. [PubMed]
  • Bairoch A, Boeckmann B. The SWISS-PROT protein sequence data bank, recent developments. Nucleic Acids Res. 1993 Jul 1;21(13):3093–3096. [PMC free article] [PubMed]
  • Bleasby AJ, Wootton JC. Construction of validated, non-redundant composite protein sequence databases. Protein Eng. 1990 Jan;3(3):153–159. [PubMed]
  • Barker WC, George DG, Mewes HW, Pfeiffer F, Tsugita A. The PIR-International databases. Nucleic Acids Res. 1993 Jul 1;21(13):3089–3092. [PMC free article] [PubMed]
  • Benson D, Lipman DJ, Ostell J. GenBank. Nucleic Acids Res. 1993 Jul 1;21(13):2963–2965. [PMC free article] [PubMed]
  • Pattabiraman N, Namboodiri K, Lowrey A, Gaber BP. NRL-3D: a sequence-structure database derived from the protein data bank (PDB) and searchable within the PIR environment. Protein Seq Data Anal. 1990 Oct;3(5):387–405. [PubMed]
  • Parry-Smith DJ, Attwood TK. SOMAP: a novel interactive approach to multiple protein sequences alignment. Comput Appl Biosci. 1991 Apr;7(2):233–235. [PubMed]
  • Parry-Smith DJ, Attwood TK. ADSP--a new package for computational sequence analysis. Comput Appl Biosci. 1992 Oct;8(5):451–459. [PubMed]
  • Attwood TK, Findlay JB. Design of a discriminating fingerprint for G-protein-coupled receptors. Protein Eng. 1993 Feb;6(2):167–176. [PubMed]
  • Saqi MA, Sternberg MJ. Identification of sequence motifs from a set of proteins with related function. Protein Eng. 1994 Feb;7(2):165–171. [PubMed]
  • Attwood TK, Beck ME. PRINTS--a protein motif fingerprint database. Protein Eng. 1994 Jul;7(7):841–848. [PubMed]
  • Akrigg D, Attwood TK, Bleasby AJ, Findlay JB, North AC, Maughan NA, Parry-Smith DJ, Perkins DN, Wootton JC. SERPENT--an information storage and analysis resource for protein sequences. Comput Appl Biosci. 1992 Jun;8(3):295–296. [PubMed]
  • Flower DR, North AC, Attwood TK. Structure and sequence relationships in the lipocalins and related proteins. Protein Sci. 1993 May;2(5):753–761. [PMC free article] [PubMed]
  • Flower DR, North AC, Attwood TK. Mouse oncogene protein 24p3 is a member of the lipocalin protein family. Biochem Biophys Res Commun. 1991 Oct 15;180(1):69–74. [PubMed]
  • Boguski MS, Bairoch A, Attwood TK, Michaels GS. Proto-vav and gene expression. Nature. 1992 Jul 9;358(6382):113–113. [PubMed]
  • Chee MS, Satchwell SC, Preddie E, Weston KM, Barrell BG. Human cytomegalovirus encodes three G protein-coupled receptor homologues. Nature. 1990 Apr 19;344(6268):774–777. [PubMed]
  • Attwood TK, Findlay JB. Fingerprinting G-protein-coupled receptors. Protein Eng. 1994 Feb;7(2):195–203. [PubMed]
  • Henikoff S, Henikoff JG. Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci U S A. 1992 Nov 15;89(22):10915–10919. [PMC free article] [PubMed]
  • Jones DT, Taylor WR, Thornton JM. The rapid generation of mutation data matrices from protein sequences. Comput Appl Biosci. 1992 Jun;8(3):275–282. [PubMed]
  • Jones DT, Taylor WR, Thornton JM. A mutation data matrix for transmembrane proteins. FEBS Lett. 1994 Feb 21;339(3):269–275. [PubMed]
  • Mehldau G, Myers G. A system for pattern matching applications on biosequences. Comput Appl Biosci. 1993 Jun;9(3):299–314. [PubMed]
  • Persson B, Argos P. Prediction of transmembrane segments in proteins utilising multiple sequence alignments. J Mol Biol. 1994 Mar 25;237(2):182–192. [PubMed]

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

Formats:

Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...

Links

  • PubMed
    PubMed
    PubMed citations for these articles
  • Substance
    Substance
    PubChem Substance links

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...