![]() | ![]() |
Formats:
|
||||||||||
Copyright © 2005 Oxford University Press Fungal BLAST and Model Organism BLASTP Best Hits: new comparison resources at the Saccharomyces Genome Database (SGD) Department of Genetics, School of Medicine, Stanford University, Stanford, CA 94305-5120, USA and 1Lewis-Sigler Institute for Integrative Genomics, Carl Icahn Laboratory, Princeton University, Washington Road, Princeton, NJ 08544, USA *To whom correspondence should be addressed. Tel: +1 650 723 7541; Fax: +1 650 725 1534; Email: cherry/at/genome.stanford.edu aThe online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use permissions, please contact journals.permissions/at/oupjournals.org. a© 2005, the authors Received September 15, 2004; Revised September 21, 2004; Accepted September 21, 2004. This article has been cited by other articles in PMC.Abstract The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org/) is a scientific database of gene, protein and genomic information for the yeast Saccharomyces cerevisiae. SGD has recently developed two new resources that facilitate nucleotide and protein sequence comparisons between S.cerevisiae and other organisms. The Fungal BLAST tool provides directed searches against all fungal nucleotide and protein sequences available from GenBank, divided into categories according to organism, status of completeness and annotation, and source. The Model Organism BLASTP Best Hits resource displays, for each S.cerevisiae protein, the single most similar protein from several model organisms and presents links to the database pages of those proteins, facilitating access to curated information about potential orthologs of yeast proteins. INTRODUCTION The Saccharomyces Genome Database (SGD) collects and organizes biological information about genes and proteins of Saccharomyces cerevisiae, and presents this information on individual Locus Pages for each yeast gene (1,2). In addition to assembling a detailed library of information about S.cerevisiae, we continually strive to develop tools and resources that allow users to identify connections between S.cerevisiae genes and proteins and those from different species. These connections may help researchers studying other organisms to glean knowledge from more extensively studied S.cerevisiae genes, or may enhance the study of S.cerevisiae genes with data from other organisms. These resources include the Gene Ontology (3) as well as comparison resources such as PSI-BLAST analyses and the Synteny and Fungal Alignment Viewers (1). The Fungal BLAST and Model Organism BLASTP Best Hits tools, described here, are two new SGD resources that extend the users' reach beyond S.cerevisiae by allowing a variety of sequence comparisons. The Fungal BLAST tool may be used for comparison of any sequence of choice with a wide range of fungal nucleotide or protein sequences, while the Model Organism BLASTP Best Hits resource specifically makes connections between each S.cerevisiae protein and its best hit in protein sets from several other model organisms. FUNGAL BLAST The numerous publicly available fungal sequences in GenBank provide a rich source of information for the identification of conserved, functionally important coding and non-coding sequences. For example, recent large-scale comparisons among related fungal species have sparked new insights into the evolution of chromosome structure and regulatory sequences (4,5). This information also initiated revisions of the S.cerevisiae genome sequence (4–8). While bioinformatics approaches to sequence comparisons have been invaluable for gaining a broad understanding of genomes, single gene comparisons across species are often useful to researchers focused on particular areas of biology. The Fungal BLAST tool is designed to put these sorts of comparisons into the hands of researchers who concentrate on single loci or gene families. The Fungal BLAST tool uses the WU-BLAST software (9) to compare any query nucleotide or protein sequence to fungal sequence datasets at GenBank. These include genome sequences from multiple Saccharomyces species (including S.cerevisiae, S.bayanus, S.castellii, S.kluyveri, S.kudriavzevii, S.mikatae, S.paradoxus) as well as sequences from genome projects, ESTs and other available sequences from all phyla in the kingdom Fungi. The sequences are updated periodically, and new fungal sequence datasets are added. The current list of species whose genomic sequences are included in Fungal BLAST analysis at SGD is provided on the Fungal BLAST help page (http://www.yeastgenome.org/help/fungal-blast.html). Although these sequences are available for searching at NCBI, the Fungal BLAST tool facilitates faster, directed searching by dividing the sequences into searchable subcategories according to organism, the status of genome sequencing and annotation (Complete Genomes, Annotated Genomes, Assembled Genomes, etc.), and type of sequence (e.g. Mitochondrial, EST, etc.). In order to accommodate different types of query and target sequence, the Fungal BLAST tool offers four BLAST programs:
The ability to analyze this broad spectrum of fungal sequence data provides a powerful means to identify sequences conserved through evolution and presumably important for the biology of the organisms. Limiting the genomes to those from fungi allows researchers to identify signatures in fungal-specific genes and gene products that might not be discernible by comparing sequences across kingdoms. MODEL ORGANISM BLASTP BEST HITS Comparison of sequences across diverse taxa is a powerful technique for finding universally conserved domains. If a curated database page exists for a protein of another organism to which a S.cerevisiae protein has similarity, the significance of the sequence conservation may become clear. To help our users find curated information concerning protein sequences conserved between S.cerevisiae and other organisms, SGD has developed the Model Organism BLASTP Best Hits page. The Model Organism BLASTP Best Hits page (Figure (Figure1)1
Out of the 6591 protein coding S.cerevisiae genes, 5368 have a hit in at least one other model organism database (MOD) while 2387 ORFs have a hit in all 5 MODs (excluding SGD). Of the 78 dubious ORFs (considered unlikely to encode a protein), five had hits in the Ashbya gossypii dataset and the rest had hits only within the S.cerevisiae protein dataset (Table 2).
The Model Organism BLASTP Best Hits results page (Figure (Figure1)1 SUMMARY SGD is continually expanding its resources to increase the ease of access to information about genes and proteins from fungi and other organisms. The Fungal BLAST and the Model Organism BLASTP Best Hits resources allow easy identification and examination of the conserved sequence regions in fungal genomes and facilitate the use of S.cerevisiae as a model organism and reference for comparison with other species. This will further aid in understanding the function and evolution of these sequences. REFERENCES 1. Christie K.R., Weng,S., Balakrishnan,R., Costanzo,M.C., Dolinski,K., Dwight,S.S., Engel,S.R., Feierbach,B., Fisk,D.G., Hirschman, et al. (2004) Saccharomyces Genome Database (SGD) provides tools to identify and analyze sequences from Saccharomyces cerevisiae and related sequences from other organisms. Nucleic Acids. Res., 32, D311–D314. [PubMed] 2. Dwight S.S., Balakrishnan,R., Christie,K.R., Costanzo,M.C., Dolinski,K., Engel,S.R., Feierbach,B., Fisk,D.G., Hirschman,J.E., Hong,E.L. et al. (2004) Saccharomyces genome database: underlying principles and organisation. Brief. Bioinformatics, 5, 9–22. [PubMed] 3. Dwight S.S., Harris,M.A., Dolinski,K., Ball,C.A., Binkley,G., Christie,K.R., Fisk,D.G., Issel-Tarver,L., Schroeder,M., Sherlock,G. et al. (2002) Saccharomyces Genome Database (SGD) provides secondary gene annotation using the Gene Ontology (GO). Nucleic Acids Res., 30, 69–72. [PubMed] 4. Kellis M., Patterson,N., Endrizzi,M., Birren,B. and Lander,E.S. (2003) Sequencing and comparison of yeast species to identify genes and regulatory elements. Nature, 423, 241–254. [PubMed] 5. Cliften P., Sudarsanam,P., Desikan,A., Fulton,L., Fulton,B., Majors,J., Waterston,R., Cohen,B.A. and Johnston,M. (2003) Finding functional features in Saccharomyces genomes by phylogenetic footprinting. Science, 301, 71–76. [PubMed] 6. Blandin G., Durrens,P., Tekaia,F., Aigle,M., Bolotin-Fukuhara,M., Bon,E., Casaregola,S., de Montigny,J., Gaillardin,C., Lepingle,A. et al. (2000) Genomic exploration of the hemiascomycetous yeasts: 4. The genome of Saccharomyces cerevisiae revisited. FEBS Lett., 487, 31–36. [PubMed] 7. Brachat S., Dietrich,F.S., Voegeli,S., Zhang,Z., Stuart,L., Lerch,A., Gates,K., Gaffney,T. and Philippsen,P. (2003) Reinvestigation of the Saccharomyces cerevisiae genome annotation by comparison to the genome of a related fungus: Ashbya gossypii. Genome Biol., 4, R45. [PubMed] 8. Dietrich F.S., Voegeli,S., Brachat,S., Lerch,A., Gates,K., Steiner,S., Mohr,C., Pohlmann,R., Luedi,P., Choi,S. et al. (2004) The Ashbya gossypii genome as a tool for mapping the ancient Saccharomyces cerevisiae genome. Science, 304, 304–307. [PubMed] 9. Altschul S.F. and Gish,W. (1996) Local alignment statistics. Methods Enzymol., 266, 460–480. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||
Nucleic Acids Res. 2004 Jan 1; 32(Database issue):D311-4.
[Nucleic Acids Res. 2004]Brief Bioinform. 2004 Mar; 5(1):9-22.
[Brief Bioinform. 2004]Nucleic Acids Res. 2002 Jan 1; 30(1):69-72.
[Nucleic Acids Res. 2002]Nature. 2003 May 15; 423(6937):241-54.
[Nature. 2003]Science. 2003 Jul 4; 301(5629):71-6.
[Science. 2003]Science. 2004 Apr 9; 304(5668):304-7.
[Science. 2004]Methods Enzymol. 1996; 266():460-80.
[Methods Enzymol. 1996]