Logo of prosciprotein sciencecshl presssubscriptionsetoc alertsthe protein societyjournal home
Protein Sci. 1999 Jun; 8(6): 1358–1361.
PMCID: PMC2144344

Simple sequence is abundant in eukaryotic proteins.


All proteins of Saccharomyces cerevisiae have been compared to determine how frequently segments from one protein are present in other proteins. Proteins that are recently evolutionarily related were excluded. The most frequently present protein segments are long, tandem repetitions of a single amino acid. For some of these segments, up to 14% of all proteins in the genome were found to have similar peptides within them. These peptide segments may not be functional protein domains. Although they are the most common shared feature of yeast proteins, their ubiquity and simplicity argue that their probable function may be to simply serve as spacers between other protein motifs.

Full Text

The Full Text of this article is available as a PDF (153K).

Selected References

These references are in PubMed. This may not be the complete list of references from this article.
  • Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990 Oct 5;215(3):403–410. [PubMed]
  • Cox GW, Taylor LS, Willis JD, Melillo G, White RL, 3rd, Anderson SK, Lin JJ. Molecular cloning and characterization of a novel mouse macrophage gene that encodes a nuclear protein comprising polyglutamine repeats and interspersing histidines. J Biol Chem. 1996 Oct 11;271(41):25515–25523. [PubMed]
  • de Souza SJ, Long M, Schoenbach L, Roy SW, Gilbert W. Intron positions correlate with module boundaries in ancient proteins. Proc Natl Acad Sci U S A. 1996 Dec 10;93(25):14632–14636. [PMC free article] [PubMed]
  • Di Como CJ, Bose R, Arndt KT. Overexpression of SIS2, which contains an extremely acidic region, increases the expression of SWI4, CLN1 and CLN2 in sit4 mutants. Genetics. 1995 Jan;139(1):95–107. [PMC free article] [PubMed]
  • Doolittle RF. The multiplicity of domains in proteins. Annu Rev Biochem. 1995;64:287–314. [PubMed]
  • Dorit RL, Schoenbach L, Gilbert W. How big is the universe of exons? Science. 1990 Dec 7;250(4986):1377–1382. [PubMed]
  • Duboule D, Haenlin M, Galliot B, Mohier E. DNA sequences homologous to the Drosophila opa repeat are present in murine mRNAs that are differentially expressed in fetuses and adult tissues. Mol Cell Biol. 1987 May;7(5):2003–2006. [PMC free article] [PubMed]
  • Gatti E, Popolo L, Vai M, Rota N, Alberghina L. O-linked oligosaccharides in yeast glycosyl phosphatidylinositol-anchored protein gp115 are clustered in a serine-rich region not essential for its function. J Biol Chem. 1994 Aug 5;269(31):19695–19700. [PubMed]
  • Gilbert W. Why genes in pieces? Nature. 1978 Feb 9;271(5645):501–501. [PubMed]
  • Gilbert W, de Souza SJ, Long M. Origin of genes. Proc Natl Acad Sci U S A. 1997 Jul 22;94(15):7698–7703. [PMC free article] [PubMed]
  • Goffeau A, Barrell BG, Bussey H, Davis RW, Dujon B, Feldmann H, Galibert F, Hoheisel JD, Jacq C, Johnston M, et al. Life with 6000 genes. Science. 1996 Oct 25;274(5287):546–567. [PubMed]
  • Heinonen TY, Pearlman RE. A germ line-specific sequence element in an intron in Tetrahymena thermophila. J Biol Chem. 1994 Jul 1;269(26):17428–17433. [PubMed]
  • Lin Y, Gross JK. Molecular cloning and characterization of winter flounder antifreeze cDNA. Proc Natl Acad Sci U S A. 1981 May;78(5):2825–2829. [PMC free article] [PubMed]
  • Mewes HW, Albermann K, Bähr M, Frishman D, Gleissner A, Hani J, Heumann K, Kleine K, Maierl A, Oliver SG, et al. Overview of the yeast genome. Nature. 1997 May 29;387(6632 Suppl):7–65. [PubMed]
  • Milbrandt J. A nerve growth factor-induced gene encodes a possible transcriptional regulatory factor. Science. 1987 Nov 6;238(4828):797–799. [PubMed]
  • O'Hara PJ, Horowitz H, Eichinger G, Young ET. The yeast ADR6 gene encodes homopolymeric amino acid sequences and a potential metal-binding domain. Nucleic Acids Res. 1988 Nov 11;16(21):10153–10169. [PMC free article] [PubMed]
  • Ohno S. Early genes that were oligomeric repeats generated a number of divergent domains on their own. Proc Natl Acad Sci U S A. 1987 Sep;84(18):6486–6490. [PMC free article] [PubMed]
  • Persengiev SP, Kilpatrick DL. Characterization of a cDNA containing trinucleotide repeat sequences that is highly enriched in spermatogenic cells. Mol Reprod Dev. 1997 Apr;46(4):476–481. [PubMed]
  • Shaw DR, Richter H, Giorda R, Ohmachi T, Ennis HL. Nucleotide sequences of Dictyostelium discoideum developmentally regulated cDNAs rich in (AAC) imply proteins that contain clusters of asparagine, glutamine, or threonine. Mol Gen Genet. 1989 Sep;218(3):453–459. [PubMed]
  • Sidén-Kiamos I, Favia G, Artiaco D, Saccone G, Furia M, Polito LC, Louis C. Opa-like repeats in the genome of the Medfly Ceratitis capitata. Genetica. 1993;92(1):43–53. [PubMed]
  • Sudo S, Fujikawa T, Nagakura T, Ohkubo T, Sakaguchi K, Tanaka M, Nakashima K, Takahashi T. Structures of mollusc shell framework proteins. Nature. 1997 Jun 5;387(6633):563–564. [PubMed]
  • Vai M, Gatti E, Lacanà E, Popolo L, Alberghina L. Isolation and deduced amino acid sequence of the gene encoding gp115, a yeast glycophospholipid-anchored protein containing a serine-rich region. J Biol Chem. 1991 Jul 5;266(19):12242–12248. [PubMed]
  • Wharton KA, Yedvobnick B, Finnerty VG, Artavanis-Tsakonas S. opa: a novel family of transcribed repeats shared by the Notch locus and other developmentally regulated loci in D. melanogaster. Cell. 1985 Jan;40(1):55–62. [PubMed]
  • White MJ, Hirsch JP, Henry SA. The OPI1 gene of Saccharomyces cerevisiae, a negative regulator of phospholipid biosynthesis, encodes a protein containing polyglutamine tracts and a leucine zipper. J Biol Chem. 1991 Jan 15;266(2):863–872. [PubMed]
  • Wolfe KH, Shields DC. Molecular evidence for an ancient duplication of the entire yeast genome. Nature. 1997 Jun 12;387(6634):708–713. [PubMed]
  • Yamamoto A, DeWald DB, Boronenkov IV, Anderson RA, Emr SD, Koshland D. Novel PI(4)P 5-kinase homologue, Fab1p, essential for normal vacuole function and morphology in yeast. Mol Biol Cell. 1995 May;6(5):525–539. [PMC free article] [PubMed]

Articles from Protein Science : A Publication of the Protein Society are provided here courtesy of The Protein Society


Save items

Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...


Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...