NCBI LogoNCBI News Masthead



In this issue...

Model Maker

Virus Reference
Sequences


Release of the
1,000th Virus
Reference Genome


New MapViewer
Displays


Other Genomic
MapViews


Mouse Genome
BLAST


New Genomes
in GenBank


Organism-Specific
BLAST


ProtEST

Trace Archive
Expands


BLAST Lab

Find Out
“About NCBI”


New FTP
Hierarchy


Barbara Rapp
Leaves NCBI


Masthead


Virus Reference Sequences


Release of the 1,000th Virus Reference Genome!

In April 2002, NCBI released the 1,000th virus reference genome into the public databases. The release of the 1,000th viral RefSeq is the fruit of a successful collaborative effort between the NCBI RefSeq team and numerous external experts who have expended much effort in annotating the virus reference genomes. These genomes, a part of NCBI’s Reference Sequence collection, are available from Entrez Genomes.

What is the 1,000th viral genome RefSeq? Read about this interesting virus in the summer issue!

The Virus Reference Genome project aims to provide molecular standards for viral genomic research. Viral RefSeqs start with one well-studied and/or best annotated full-length genomic sequence taken from GenBank. Each virus RefSeq genome is then curated by NCBI staff with the aid of outside advisors. The curation process results in the addition of relevant biological information, taken from the literature or other sequence records, as well as the correction of taxonomy and lineages. Viral RefSeqs are searchable in Entrez Genomes, which contains over 1,100 reference sequences for over 800 viral genomes. Access Viral Genomes using the link from the Related Resources section on the Entrez Genomes home page.

The Viral Genomes page includes a general introduction to viruses, a depiction of a scheme for the replication of influenza virus A and a variety of genome retrieval tools. Additionally, there are links to a complete alphabetical list of all reference genomes and to lists in which viruses are grouped by families/genera or genome type (e.g. dsDNA viruses, ssRNA posi-tive strand viruses). In the list, each virus name shown in green italics is hyperlinked to a graphical representation of the genome— the link to the actual genome record is accessible from the genome accession number (the number that begins with the letters “NC”). The rightmost column “Nbrs” of the viruses’ list leads to Genome Neighbors, which are related complete genomes found in GenBank. When available, links to the protein table for each virus genome are also displayed.

The Viral Genomes Finder, located on the main page and linked from the Tools section, allows the selection of all viruses, or viral genomes, that fall within a particular taxonomic node. The Query Tips offer help on how to formulate the search terms. For example, to retrieve all virus genomes within a particular taxonomic node, one must capitalize the first letter in the search word (i.e., “Flaviviridae”). To go to a Taxonomy Browser display of all viruses in a node, make sure the initial letter of the query is in lowercase.

Links from the blue sidebar menu include general over-views of viruses, statistics regarding virus reference genomes, FAQs, and help documentation on searching and viewing viral genome records. There are also links to external sites dedicated to virus biology, taxonomy and nomenclature, and sequences. The Help document presents samples of graphical formats and data layouts of the virus records in the database.

Figure 1
Click on figure to view enlarged version

Figure 1:Entrez Genomes graphical view of the viral RefSeq for the acute bee paralysis virus.

The sample record of Figure 1 shows the graphical view of a virus genome, which can have a linear or circular, monopartite or segmented architecture. The record gives a link to the GenBank flatfile record, the total number of nucleotides in the genome, and one or more literature citations. Also included are taxonomic information, additional comments about the reference sequence record, a reference to the source record from which it was derived, and the status of the record; either provisional or reviewed. Links in the blue sidebar lead to a table that lists the genes and protein products of the genome (Coding Regions), and to the Microbial Genomes BLAST program, which allows users to BLAST query sequences against the finished and unfinished microbial genomes database.

Virus genome reference sequences are added to the database daily and more analytical programs will be introduced in the near future. 
—VP


Continue


NCBI News | Spring 2002 NCBI News Footer NCBI News