• We are sorry, but NCBI web applications do not support your browser and may not function properly. More information
Logo of narLink to Publisher's site
Nucleic Acids Res. Jan 1, 2003; 31(1): 216–218.
PMCID: PMC165501

Saccharomyces Genome Database (SGD) provides biochemical and structural information for budding yeast proteins


The Saccharomyces Genome Database (SGD: http://genome-www.stanford.edu/Saccharomyces/) has recently developed new resources to provide more complete information about proteins from the budding yeast Saccharomyces cerevisiae. The PDB Homologs page provides structural information from the Protein Data Bank (PDB) about yeast proteins and/or their homologs. SGD has also created a resource that utilizes the eMOTIF database for motif information about a given protein. A third new resource is the Protein Information page, which contains protein physical and chemical properties, such as molecular weight and hydropathicity scores, predicted from the translated ORF sequence.


The Saccharomyces Genome Database (SGD) collects, organizes, and presents information about the molecular biology and genetics of the budding yeast Saccharomyces cerevisiae. SGD contains diverse types of biological data and provides tools for their search and analysis. Information in SGD is generally organized around a gene; each gene in the genome has a Locus page (2), which contains basic information about that gene such as Gene Ontology (GO) annotations (1,3) and phenotype data as well as links to additional tools and resources. The protein resources described below are newly provided links located in the Protein Info and Structure Resources section of the Locus page.

The Protein Data Bank (PDB) Homologs page provides a list of structures available in PDB (7) relevant to a given S. cerevisiae protein (Fig. (Fig.1A).1A). The list is generated by comparing the sequences of the systematically defined S. cerevisiae proteins against the protein sequences within PDB using the Smith–Waterman (6) sequence comparison algorithm. All PDB sequences with a p-value of 0.01 or less are presented, regardless of species. Thus, if the structure of an S. cerevisiae protein is unknown, the structure of the homo-logous protein in another species may be available. For each PDB homolog, Smith–Waterman scores are provided, with links to PDB, other protein databases, and an alignment page. The alignment page (Fig. (Fig.1B)1B) displays the Smith–Waterman alignment of the S. cerevisiae protein against the PDB protein, a colour ribbon image of the structure, and a link to the interactive QuickPDB structure viewer provided by PDB.

Figure 1Figure 1
Protein structure information at SGD: the new PDB Homologs tool. Due to space considerations, both (A) and (B) are just a portion of the web display. (A) The PDB Homologs results page lists the results of the Smith–Waterman sequence comparison ...

SGD has recently released another protein information tool. The new Motif tool displays the motifs in a particular protein using results from the eMOTIF resource (4). The Motif results page includes an image illustrating all the motifs in the protein of interest and also lists all the other S. cerevisiae proteins that share these motifs. Links to other motif databases are also provided.

The third new protein resource in SGD is the Protein Information page. This page contains calculated data based on protein sequence, such as molecular weight, amino acid content, protein length, pI, and codon adaptation index (5). Links are provided to graphical displays of hydropathy plots, helical wheels and other tools to predict secondary structure. SGD is developing enhancements to this page and will soon be incorporating predicted transmembrane domains, signal sequences and other types of data that can be generated based on protein sequence. All of these data for the entire predicted yeast proteome are available for download on the SGD anonymous FTP site at: ftp://genome-ftp.stanford.edu/yeast/data_download/protein_info/.

From its beginning in 1994, SGD has provided biological information about yeast genes as well as search and analysis tools. We are now expanding our scope to provide more protein information by developing tools like the PDB Homolog, Motif, and Protein Information pages. SGD is also creating resources for global analyses, in addition to the gene-by-gene tools and resources currently provided. As more eukaryotic genomes are completely sequenced, comparative genomics will become an increasingly powerful method for solving biological problems; thus, SGD is exploring new ways of presenting the results of these and other types of genome-wide studies. Check the SGD home page at http://genome-www.stanford.edu/Saccharomyces/ for announcements as these new tools become available. Supplemental material for this paper can be found online at: http://genome-www.stanford.edu/Saccharomyces/help/NAR2003Supplement.html.


1. Ashburner M., Ball,C.A., Blake,J.A., Botstein,D., Butler,H., Cherry,J.M., Davis,A.P., Dolinski,K., Dwight,S.S., Eppig,J.T., Harris,M.A., Hill,D.P., Issel-Tarver,L., Kasarskis,A., Lewis,S., Matese,J.C., Richardson,J.E., Ringwald,M., Rubin,G.M. and Sherlock,G. (2000) Gene ontology: tool for the unification of biology. Nature Genet., 25, 25–29. [PMC free article] [PubMed]
2. Ball C.A., Dolinski,K., Dwight,S.S., Harris,M.A., Issel-Tarver,L., Kasarskis,A., Scafe,C.R., Sherlock,G., Binkley,G., Jin,H., Kaloper,M., Orr,S.D., Schroeder,M., Weng,S., Zhu,Y., Botstein,D. and Cherry,J.M. (2000) Integrating functional genomic information into the Saccharomyces genome database. Nucleic Acids Res., 28, 77–80. [PMC free article] [PubMed]
3. Dwight S.S., Harris,M.A., Dolinski,K., Ball,C.A., Binkley,G., Christie,K.R., Fisk,D.G., Issel-Tarver,L., Schroeder,M., Sherlock,G., Sethuraman,A., Weng,S., Botstein,D. and Cherry,J.M. (2002) Saccharomyces Genome Database (SGD) provides secondary gene annotation using the Gene Ontology (GO). Nucleic Acids Res., 30, 69–72. [PMC free article] [PubMed]
4. Huang J.Y. and Brutlag,D.L. (2001) The EMOTIF database. Nucleic Acids Res., 29, 202–204. [PMC free article] [PubMed]
5. Sharp P.M. and Li,W.H. (1987) The codon Adaptation Index—a measure of directional synonymous codon usage bias, and its potential applications. Nucleic Acids Res., 15, 1281–1295. [PMC free article] [PubMed]
6. Smith T.F. and Waterman,M.S. (1981) Identification of common molecular subsequences. J. Mol. Biol., 147, 195–197. [PubMed]
7. Westbrook J., Feng,Z., Jain,S., Bhat,T.N., Thanki,N., Ravichandran,V., Gilliland,G.L., Bluhm,W., Weissig,H., Greer,D.S., Bourne,P.E. and Berman,H.M. (2002) The Protein Data Bank: unifying the archive. Nucleic Acids Res., 30, 245–248. [PMC free article] [PubMed]

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press
PubReader format: click here to try


Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...


Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...