NCBI LogoNCBI News Masthead


In this issue...

Model Maker

Virus Reference
Sequences


Release of the
1,000th Virus
Reference Genome


New MapViewer
Displays


Other Genomic
MapViews


Mouse Genome
BLAST


New Genomes
in GenBank


Organism-Specific
BLAST


ProtEST

Trace Archive
Expands


BLAST Lab

Find Out
“About NCBI”


New FTP
Hierarchy


Barbara Rapp
Leaves NCBI


Masthead

Mouse Genome BLAST: Scan NCBI Contigs
and Three Draft Assemblies


New Genomes in GenBank!

Visit the Entrez Genomes Web page to see the latest batch of new genomes to enter GenBank. There are now over 70 complete bacterial and more than 15 complete archaeal genomes in GenBank. A couple of recent arrivals are:

Pyrococcus furiosus: An anerobic, hyperthermophilic member of the archaea that cannot live below 70 degrees Fahrenheit. Pyrococcus furiosus dwells near undersea thermal vents, where it uses peptides as a carbon source and sulfur as a final electron acceptor. The genome of Pyrococcus furiosus features 2,065 annotated genes and 105 annotated structural RNAs. The Entrez Genomes report indicates that 631 Pyrococcus furiosus proteins bear significant sequence similarity to proteins of known 3-D structure.

Schizosaccharomyces pombe: The popular fission-yeast so vital to molecular biologists and first isolated from an East African beer, called “pombe”. This genome can be viewed in the NCBI MapViewer, which displays a Gene, Genetic, and Clone map for each of the three chromosomes and integrates a total of 5,095 genes, 169 genetic markers, and 582 clones. Other features annotated on this genome include promoters, introns, and repeat regions.

BLASTing against the growing body of mouse genomic data is easier than ever using the Mouse Genome BLAST page, which supports both untranslated and translated searches of a variety of mouse genome databases, including NCBI contigs and three draft assemblies of the genome.

The default database for Mouse Genome BLAST is the database of curated NCBI contigs, called “NT_Contigs”. These are assemblies of finished mouse BAC clones that are annotated with SNPs and STSs. Other databases include HTGS, Traces, BAC-Ends, Reference mRNAs and proteins from the NCBI RefSeq project, and ESTs.

The HTGS database is comprised of draft and finished sequence as submitted by the sequencing centers. The “Traces” database contains the mouse Whole Genome Shotgun (WGS) traces. The WGS Traces database may be searched with MegaBLAST, while the other databases may be queried with either nucleotide or protein sequences using the suite of blastn, blastp, tblastn, and blastx sequence-similarity search programs.

In addition, three assemblies of the mouse genome are available for searching. The first two, Phusion™ and Arachne, are preliminary assemblies of the mouse WGS reads based on a February data freeze and were created by the Sanger Center and the Whitehead Institute respectively. The third, the MGSCv3 assembly, produced by the Mouse Genome Sequencing Consortium, is comprised of WGS scaffolds generated using the end pairing information from the WGS reads. All three assemblies should be considered to be preliminary and will change as new data is added.



Continue


NCBI News | Spring 2002 NCBI News Footer NCBI News