Display Settings:

Format

Send to:

Choose Destination
We are sorry, but NCBI web applications do not support your browser and may not function properly. More information
Science. 1997 Sep 5;277(5331):1453-62.

The complete genome sequence of Escherichia coli K-12.

Author information

  • 1Laboratory of Genetics, University of Wisconsin-Madison, 445 Henry Mall, Madison, WI 53706, USA. ecoli@genetics.wisc.edu

Abstract

The 4,639,221-base pair sequence of Escherichia coli K-12 is presented. Of 4288 protein-coding genes annotated, 38 percent have no attributed function. Comparison with five other sequenced microbes reveals ubiquitous as well as narrowly distributed gene families; many families of similar genes within E. coli are also evident. The largest family of paralogous proteins contains 80 ABC transporters. The genome as a whole is strikingly organized with respect to the local direction of replication; guanines, oligonucleotides possibly related to replication and recombination, and most genes are so oriented. The genome also contains insertion sequence (IS) elements, phage remnants, and many other patches of unusual composition indicating genome plasticity through horizontal transfer.

Comment in

  • Laboratory workhorse decoded. [Science. 1997]
  • Science. 1998 Mar 20;279(5368):1827.
PMID:
9278503
[PubMed - indexed for MEDLINE]
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Icon for HighWire
    Loading ...
    Write to the Help Desk