Sign in to NCBI
skip to main content

Sequence Set Browser

 

AXUG00000000.1 Escherichia coli 113303

Master
# of Contigs: 149
# of Proteins: 5,191
# of Scaffolds/Chrs: 101
Total length: 5,018,165 bp
BioProject: PRJNA183801
BioSample: SAMN02436736
Keywords: WGS
Annotation: Contigs
Organism: Escherichia coli 113303show lineagehide lineage
Biosource:
/host = Homo sapiens
/mol_type = genomic
/strain = 113303
WGS: AXUG01000001:AXUG01000149
Scaffolds: KI521919:KI522019
101 scaffolds, total length is 5,022,965 bases
Submission:
Submitted (26-JUN-2013) The Genome Institute, Washington University School of Medicine, 4444 Forest Park, St. Louis, MO 63108, USA – show 18 authorshide authors
Weinstock,G., Sodergren,E., Wylie,T., Fulton,L., Fulton,R., Fronick,C., O'Laughlin,M., Godfrey,J., Miner,T., Herter,B., Appelbaum,E., Cordes,M., Lek,S., Wollam,A., Pepin,K.H., Palsikar,V.B., Mitreva,M., Wilson,R.K.

The Escherichia coli 113303 whole genome shotgun (WGS) project has the project accession AXUG00000000. This version of the project (01) has the accession number AXUG01000000, and consists of sequences AXUG01000001-AXUG01000149.

Bacteria provided by David Creely and William Dunne (BioMerieux, Inc., 595 Anglum Road, Hazelwood, MO 63042). Coding sequences were predicted using GeneMark and Glimmer3. Intergenic regions not spanned by GeneMark and Glimmer3 were blasted against NCBI's non-redundant (NR) database and predictions generated based on protein alignments. tRNA genes were determined using tRNAscan-SE and non-coding RNA genes by RNAmmer and Rfam. The final gene set is processed through several programs such as Kegg, psortB and Interproscan to determine possible function. Gene product names are determined by BER. Gene names are generated at the contig level and may not necessarily reflect any known order or orientation between contigs. This is a reference genome for the Human Microbiome Project. This project is co-owned with the Human Microbiome Project DACC. This work was funded by the National Human Genome Research Institute (NHGRI)/National Institutes of Health (NIH) grant 5U54HG00496804 for characterization of this genome.

##Genome-Assembly-Data-START##
Finishing Goal : High-Quality Draft
Current Finishing Status : High-Quality Draft
Assembly Method : Velvet v. 1.1.06
Genome Coverage : 79x
Sequencing Technology : Illumina
##Genome-Assembly-Data-END##
Contigs
Proteins
Download
GenBank:AXUG01.1.gbff.gz 3.7 Mb
FASTA:AXUG01.1.fsa_nt.gz 1.5 Mb
ASN.1:AXUG01.1.bbs.gz 2.9 Mb