• We are sorry, but NCBI web applications do not support your browser and may not function properly. More information
Logo of prosciprotein sciencecshl presssubscriptionsetoc alertsthe protein societyjournal home
Protein Sci. Aug 1995; 4(8): 1618–1632.
PMCID: PMC2143180

Gibbs motif sampling: detection of bacterial outer membrane protein repeats.


The detection and alignment of locally conserved regions (motifs) in multiple sequences can provide insight into protein structure, function, and evolution. A new Gibbs sampling algorithm is described that detects motif-encoding regions in sequences and optimally partitions them into distinct motif models; this is illustrated using a set of immunoglobulin fold proteins. When applied to sequences sharing a single motif, the sampler can be used to classify motif regions into related submodels, as is illustrated using helix-turn-helix DNA-binding proteins. Other statistically based procedures are described for searching a database for sequences matching motifs found by the sampler. When applied to a set of 32 very distantly related bacterial integral outer membrane proteins, the sampler revealed that they share a subtle, repetitive motif. Although BLAST (Altschul SF et al., 1990, J Mol Biol 215:403-410) fails to detect significant pairwise similarity between any of the sequences, the repeats present in these outer membrane proteins, taken as a whole, are highly significant (based on a generally applicable statistical test for motifs described here). Analysis of bacterial porins with known trimeric beta-barrel structure and related proteins reveals a similar repetitive motif corresponding to alternating membrane-spanning beta-strands. These beta-strands occur on the membrane interface (as opposed to the trimeric interface) of the beta-barrel. The broad conservation and structural location of these repeats suggests that they play important functional roles.

Full Text

The Full Text of this article is available as a PDF (6.2M).

Selected References

These references are in PubMed. This may not be the complete list of references from this article.
  • Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990 Oct 5;215(3):403–410. [PubMed]
  • Bairoch A, Boeckmann B. The SWISS-PROT protein sequence data bank. Nucleic Acids Res. 1992 May 11;20 (Suppl):2019–2022. [PMC free article] [PubMed]
  • Baldi P, Chauvin Y, Hunkapiller T, McClure MA. Hidden Markov models of biological primary sequence information. Proc Natl Acad Sci U S A. 1994 Feb 1;91(3):1059–1063. [PMC free article] [PubMed]
  • Barker WC, George DG, Mewes HW, Pfeiffer F, Tsugita A. The PIR-International databases. Nucleic Acids Res. 1993 Jul 1;21(13):3089–3092. [PMC free article] [PubMed]
  • Bennett PB, Jr, Makita N, George AL., Jr A molecular basis for gating mode transitions in human skeletal muscle Na+ channels. FEBS Lett. 1993 Jul 12;326(1-3):21–24. [PubMed]
  • Benson D, Lipman DJ, Ostell J. GenBank. Nucleic Acids Res. 1993 Jul 1;21(13):2963–2965. [PMC free article] [PubMed]
  • Bork P, Holm L, Sander C. The immunoglobulin fold. Structural classification, sequence patterns and common core. J Mol Biol. 1994 Sep 30;242(4):309–320. [PubMed]
  • Bosch D, Scholten M, Verhagen C, Tommassen J. The role of the carboxy-terminal membrane-spanning fragment in the biogenesis of Escherichia coli K12 outer membrane protein PhoE. Mol Gen Genet. 1989 Mar;216(1):144–148. [PubMed]
  • Brennan RG, Matthews BW. The helix-turn-helix DNA binding motif. J Biol Chem. 1989 Feb 5;264(4):1903–1906. [PubMed]
  • Cowan SW, Schirmer T, Rummel G, Steiert M, Ghosh R, Pauptit RA, Jansonius JN, Rosenbusch JP. Crystal structures explain functional properties of two E. coli porins. Nature. 1992 Aug 27;358(6389):727–733. [PubMed]
  • Gallegos MT, Michán C, Ramos JL. The XylS/AraC family of regulators. Nucleic Acids Res. 1993 Feb 25;21(4):807–810. [PMC free article] [PubMed]
  • Gribskov M, Lüthy R, Eisenberg D. Profile analysis. Methods Enzymol. 1990;183:146–159. [PubMed]
  • Gribskov M, McLachlan AD, Eisenberg D. Profile analysis: detection of distantly related proteins. Proc Natl Acad Sci U S A. 1987 Jul;84(13):4355–4358. [PMC free article] [PubMed]
  • Harpaz Y, Chothia C. Many of the immunoglobulin superfamily domains in cell adhesion molecules and surface receptors belong to a new structural set which is close to that containing variable domains. J Mol Biol. 1994 May 13;238(4):528–539. [PubMed]
  • Henikoff S, Henikoff JG. Automated assembly of protein blocks for database searching. Nucleic Acids Res. 1991 Dec 11;19(23):6565–6572. [PMC free article] [PubMed]
  • Henikoff S, Henikoff JG. Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci U S A. 1992 Nov 15;89(22):10915–10919. [PMC free article] [PubMed]
  • Henikoff S, Henikoff JG. Protein family classification based on searching a database of blocks. Genomics. 1994 Jan 1;19(1):97–107. [PubMed]
  • Hunkapiller T, Hood L. The growing immunoglobulin gene superfamily. Nature. 1986 Sep 4;323(6083):15–16. [PubMed]
  • Jap BK, Walian PJ, Gehring K. Structural architecture of an outer membrane channel as determined by electron crystallography. Nature. 1991 Mar 14;350(6314):167–170. [PubMed]
  • Jeanteur D, Lakey JH, Pattus F. The bacterial porin superfamily: sequence alignment and structure prediction. Mol Microbiol. 1991 Sep;5(9):2153–2164. [PubMed]
  • Jin S, Sonenshein AL. Identification of two distinct Bacillus subtilis citrate synthase genes. J Bacteriol. 1994 Aug;176(15):4669–4679. [PMC free article] [PubMed]
  • Kaufmann A, Stierhof YD, Henning U. New outer membrane-associated protease of Escherichia coli K-12. J Bacteriol. 1994 Jan;176(2):359–367. [PMC free article] [PubMed]
  • Kreusch A, Neubüser A, Schiltz E, Weckesser J, Schulz GE. Structure of the membrane channel porin from Rhodopseudomonas blastica at 2.0 A resolution. Protein Sci. 1994 Jan;3(1):58–63. [PMC free article] [PubMed]
  • Krogh A, Brown M, Mian IS, Sjölander K, Haussler D. Hidden Markov models in computational biology. Applications to protein modeling. J Mol Biol. 1994 Feb 4;235(5):1501–1531. [PubMed]
  • Lawrence CE, Altschul SF, Boguski MS, Liu JS, Neuwald AF, Wootton JC. Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment. Science. 1993 Oct 8;262(5131):208–214. [PubMed]
  • Mackett M, Conway MJ, Arrand JR, Haddad RS, Hutt-Fletcher LM. Characterization and expression of a glycoprotein encoded by the Epstein-Barr virus BamHI I fragment. J Virol. 1990 Jun;64(6):2545–2552. [PMC free article] [PubMed]
  • Morona R, Klose M, Henning U. Escherichia coli K-12 outer membrane protein (OmpA) as a bacteriophage receptor: analysis of mutant genes expressing altered proteins. J Bacteriol. 1984 Aug;159(2):570–578. [PMC free article] [PubMed]
  • Needleman SB, Wunsch CD. A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol. 1970 Mar;48(3):443–453. [PubMed]
  • Neuwald AF, Green P. Detecting patterns in protein sequences. J Mol Biol. 1994 Jun 24;239(5):698–712. [PubMed]
  • Nikaido H. Porins and specific channels of bacterial outer membranes. Mol Microbiol. 1992 Feb;6(4):435–442. [PubMed]
  • Nikaido H. Porins and specific diffusion channels in bacterial outer membranes. J Biol Chem. 1994 Feb 11;269(6):3905–3908. [PubMed]
  • Pohlner J, Halter R, Beyreuther K, Meyer TF. Gene structure and extracellular secretion of Neisseria gonorrhoeae IgA protease. Nature. 325(6103):458–462. [PubMed]
  • Schirmer T, Cowan SW. Prediction of membrane-spanning beta-strands and its application to maltoporin. Protein Sci. 1993 Aug;2(8):1361–1363. [PMC free article] [PubMed]
  • Smith TF, Waterman MS. Identification of common molecular subsequences. J Mol Biol. 1981 Mar 25;147(1):195–197. [PubMed]
  • Staden R. Methods for calculating the probabilities of finding patterns in sequences. Comput Appl Biosci. 1989 Apr;5(2):89–96. [PubMed]
  • Stout V, Torres-Cabassa A, Maurizi MR, Gutnick D, Gottesman S. RcsA, an unstable positive regulator of capsular polysaccharide synthesis. J Bacteriol. 1991 Mar;173(5):1738–1747. [PMC free article] [PubMed]
  • Struyvé M, Moons M, Tommassen J. Carboxy-terminal phenylalanine is essential for the correct assembly of a bacterial outer membrane protein. J Mol Biol. 1991 Mar 5;218(1):141–148. [PubMed]
  • Treisman J, Harris E, Wilson D, Desplan C. The homeodomain: a new face for the helix-turn-helix? Bioessays. 1992 Mar;14(3):145–150. [PubMed]
  • Viale AM, Kobayashi H, Akazawa T, Henikoff S. rbcR [correction of rcbR], a gene coding for a member of the LysR family of transcriptional regulators, is located upstream of the expressed set of ribulose 1,5-bisphosphate carboxylase/oxygenase genes in the photosynthetic bacterium Chromatium vinosum. J Bacteriol. 1991 Aug;173(16):5224–5229. [PMC free article] [PubMed]
  • Vogel H, Jähnig F. Models for the structure of outer-membrane proteins of Escherichia coli derived from raman spectroscopy and prediction methods. J Mol Biol. 1986 Jul 20;190(2):191–199. [PubMed]
  • Weickert MJ, Adhya S. A family of bacterial regulators homologous to Gal and Lac repressors. J Biol Chem. 1992 Aug 5;267(22):15869–15874. [PubMed]
  • Weiss MS, Wacker T, Weckesser J, Welte W, Schulz GE. The three-dimensional structure of porin from Rhodobacter capsulatus at 3 A resolution. FEBS Lett. 1990 Jul 16;267(2):268–272. [PubMed]
  • Williams AF, Barclay AN. The immunoglobulin superfamily--domains for cell surface recognition. Annu Rev Immunol. 1988;6:381–405. [PubMed]

Articles from Protein Science : A Publication of the Protein Society are provided here courtesy of The Protein Society


Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...


  • Cited in Books
    Cited in Books
    PubMed Central articles cited in books
  • PubMed
    PubMed citations for these articles

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...