NCBI logo What's New Archive  
PubMed Entrez BLAST OMIM Books TaxBrowser Structure

NCBI
Back to NCBI homepage
Back to NCBI homepage
SITE MAP


What's New Archive

2001
2000
1999
1998
1997
1996
1995
1994
 
BACK

 

bullet1997

12/19 Nucleotide Sequence Databases' Feature Table Documentation
Version 2.0 of the Feature Table documentation is available through the web and FTP. The document details the format and definitions of the biological annotation used in common by the International Nucleotide Sequence Database Collaboration in the creation and maintenance of DDBJ, EMBL, and GenBank sequence records.

11/26 Malaria Genetics and Genomics A new set of Web pages devoted to Plasmodium falciparum genetics and genomics is now available. Pages include information relevant to P. falciparum chromosome mapping and sequencing, including linkage markers, physical maps, and DNA sequences. The Web site also includes graphical views and the nucleotide sequence of a chromosome 7 segment containing a candidate chloroquine resistance gene. This site is expected to grow in scope and will be integrated with other P. falciparum genome-related Web pages being developed at NCBI.

10/30 Clusters of Orthologous Groups Clusters of Orthologous Groups (COGs) were delineated by comparing protein sequences encoded in seven complete genomes, representing five major phylogenetic lineages. Each COG consists of individual proteins or groups of paralogs from at least three lineages and thus corresponds to an ancient conserved domain. See: Science 1997 Oct 24;278(5338):631-637.

10/20 HTG Sequences A new set of Web pages describes High Throughput Genome (HTG) sequences in GenBank. These pages include information for both regular GenBank users and sequencing centers about HTG sequence submission, processing, and access.

09/10 PSI-BLAST A new service called Position-Specific Iterated BLAST, or PSI-BLAST, is now available in Version 2.0 of the BLAST program suite. Currently, PSI-BLAST may be used only for comparing protein queries with protein databases. Given a query, PSI-BLAST performs an initial gapped BLAST search of the database. In subsequent iterations, it uses statistically significant alignments from the previous search to construct a position-specific score matrix for use, in place of the query and standard amino acid substitution matrix, in the next round of searching. PSI-BLAST is under development and may change substantially over time.

08/19 Gaps in BLAST Version 2.0 of BLAST allows the introduction of gaps (deletions and insertions) into alignments. With a gapped alignment tool, homologous domains do not have to be broken into several segments. Also, the scoring of gapped results tends to be more biologically meaningful than ungapped results.

The programs, blastn and blastp, offer fully gapped alignments. blastx and tblastn have "in-frame" gapped alignments and use sum statistics to link alignments from different frames. tblastx provides only ungapped alignments.

08/15 4800 EST Sequences Recalled Because of recently discovered uncertainty about the true nature of the tissue source, 4829, EST sequences originating from the "Life Tech mouse brain" cDNA library have been withdrawn from dbEST and GenBank.

08/12 Complete Genome, Helicobacter pylori The complete genome sequence and annotation of Helicobacter pylori prepared by The Institute for Genomic Research (TIGR) is now available through Entrez Genome, as well as in GenBank, where the 1.7 megabase sequence has been separated into 134 records of approximately 11,000 bp each. The graphical view (as well as a link to underlying data) of the complete genome is present in the Genomes division of Entrez.

The complete sequence is also available by anonymous FTP; see the README.

08/12 Zea mays Maps The UMC (University of Missouri-Columbia RFLP Laboratory) and BNL (Brookhaven National Laboratory) maps are now available in the Entrez Genomes division. These maps are linked to a sequence map generated using available Zea mays sequences deposited in GenBank. Further information may be obtained at the Maize Genome Database.

08/01 CGAP Announced by Vice President Gore and NCI Director Klausner Vice President Gore announced the availability of the NCI's Cancer Genome Anatomy Project (CGAP) Web site at a Capitol Hill press conference today. The CGAP site, jointly developed by NCI and NCBI, brings together data on gene expression in normal, precancerous, and malignant tissues. The data are the result of collaborations among many government, academic, and industrial laboratories.

07/03 BLAST for Newly Released Microbial Genomes from TIGR A new BLAST service for the recently released microbial genomes from The Institute for Genomic Research (TIGR) is now available. These unfinished genomes (see details) are not yet available in GenBank or Entrez. The genomes are searchable via TBLASTN (a user's protein query vs. a 6-frame translation of the microbial DNA sequences) using the new gapped BLAST algorithm ( Altschul et al., 1997, submitted).

06/26 PubMed Announced by Vice President Gore, NIH Officials The availability of PubMed was officially announced at a Capitol Hill press conference by Vice President Gore, Senators Harkin, and Specter and the Directors of NIH, NLM, and NCBI.

03/13 Cn3D
68K Macintosh Version
A 68K Macintosh version of Cn3D, a 3D structure viewer, is now available.


01/25 Complete Genome, E. coli The complete genome sequence and annotation of Escherichia coli as prepared by Fred Blattner and colleagues is now available via Entrez Genome division, GenBank, and the "nr" BLAST databases.

In Entrez Genome, the entire E. coli genome can be examined and explored at once. Links to the underlying data used to build the graphical representation are provided. In addition, entire-genome data files (GenBank flatfile, FASTA, etc.) for E. coli are available from NCBI's FTP server. See the README file for a description of the various files in the genomes FTP directory at ftp.ncbi.nih.gov.

In GenBank itself, the 4.6 megabase E. coli sequence has been split into 400 records of approximately 11,500 bp each. These sequences are available via the usual GenBank Update data files that are available from NCBI's FTP site.

The Japanese Escherichia coli database has also assembled the E. coli genome and has made their data available on their World Wide Web server as well as in a FASTA file present on the NCBI FTP server.

01/24 THC BLAST A new BLAST service for Tentative Human Consensus Sequences (THCs) is now available. THCs have been generated by The Institute for Genomic Research (TIGR) by automatic assembly of ESTs from world-wide EST projects into virtual transcripts.

Revised: June 6, 2002.