skip to main content

Sequence Set Browser

 

AXTH00000000.1 Escherichia coli 907710

Master
# of Contigs: 262
# of Proteins: 4,899
# of Scaffolds/Chrs: 125
Total length: 4,767,159 bp
BioProject: PRJNA183808
BioSample: SAMN02436877
Keywords: WGS
Annotation: Contigs
Organism: Escherichia coli 907710show lineagehide lineage
Biosource:
/host = Homo sapiens
/mol_type = genomic
/strain = 907710
WGS: AXTH01000001:AXTH01000262
Scaffolds: KI531601:KI531725
125 scaffolds, total length is 4,780,859 bases
Submission:
Submitted (26-JUN-2013) The Genome Institute, Washington University School of Medicine, 4444 Forest Park, St. Louis, MO 63108, USA – show 18 authorshide authors
Weinstock,G., Sodergren,E., Wylie,T., Fulton,L., Fulton,R., Fronick,C., O'Laughlin,M., Godfrey,J., Miner,T., Herter,B., Appelbaum,E., Cordes,M., Lek,S., Wollam,A., Pepin,K.H., Palsikar,V.B., Mitreva,M., Wilson,R.K.

The Escherichia coli 907710 whole genome shotgun (WGS) project has the project accession AXTH00000000. This version of the project (01) has the accession number AXTH01000000, and consists of sequences AXTH01000001-AXTH01000262.

Bacteria provided by David Creely and William Dunne (BioMerieux, Inc., 595 Anglum Road, Hazelwood, MO 63042). Coding sequences were predicted using GeneMark and Glimmer3. Intergenic regions not spanned by GeneMark and Glimmer3 were blasted against NCBI's non-redundant (NR) database and predictions generated based on protein alignments. tRNA genes were determined using tRNAscan-SE and non-coding RNA genes by RNAmmer and Rfam. The final gene set is processed through several programs such as Kegg, psortB and Interproscan to determine possible function. Gene product names are determined by BER. Gene names are generated at the contig level and may not necessarily reflect any known order or orientation between contigs. This is a reference genome for the Human Microbiome Project. This project is co-owned with the Human Microbiome Project DACC. This work was funded by the National Human Genome Research Institute (NHGRI)/National Institutes of Health (NIH) grant 5U54HG00496804 for characterization of this genome.

##Genome-Assembly-Data-START##
Finishing Goal : High-Quality Draft
Current Finishing Status : High-Quality Draft
Assembly Method : Velvet v. 1.1.06
Genome Coverage : 83x
Sequencing Technology : Illumina
##Genome-Assembly-Data-END##
Contigs
Proteins
Download
GenBank:AXTH01.1.gbff.gz 3.7 Mb
FASTA:AXTH01.1.fsa_nt.gz 1.4 Mb
ASN.1:AXTH01.1.bbs.gz 2.8 Mb