Bacterial Genome Submission Examples
Figure 1: Sample FASTA-formatted sequence
>HTE831 [organism=Oceanobacillus iheyensis] [strain=HTE831] actttcaaaaaaatcagcgtaaaaaacatactaatttgggcaaattcccacctgttttta gggacatttttctttgaattagagcctcagcagctcgtcattgctgaattttcttgaagt [etc.]
Figure 2: Sequin table format
>Feature HTE831 1830 2966 gene gene dnaN locus_tag OBB_0002 1830 2966 CDS product DNA-directed DNA polymerase III beta chain EC_number 2.7.7.7 protein_id gnl|ncbi|OBB_0002 3219 3440 gene locus_tag OBB_0003 3219 3440 CDS product hypothetical protein protein_id gnl|ncbi|OBB_0003 3443 4552 gene gene recF locus_tag OBB_0004 3443 4552 CDS product RecF function DNA repair and genetic recombination protein_id gnl|ncbi|OBB_0004 5109 7034 gene gene gyrB locus_tag OBB_0006 5109 7034 CDS product DNA gyrase subunit B EC_number 5.99.1.3 protein_id gnl|ncbi|OBB_0006 45081 44806 gene gene abrB locus_tag OBB_0045 45081 44806 CDS product AbrB protein_id gnl|ncbi|OBB_0045 function transcriptional pleiotropic regulator 64225 64758 gene locus_tag OBB_0064 64225 64758 CDS product stage V sporulation protein T function transcriptional regulator protein_id gnl|ncbi|OBB_0064 84524 85393 gene locus_tag OBB_0082 84524 85393 CDS product chaperonin product heat shock protein 33 protein_id gnl|ncbi|OBB_0082 89569 91050 gene locus_tag OBB_0088 89569 91050 CDS product lysine-tRNA ligase EC_number 6.1.1.6 protein_id gnl|ncbi|OBB_0088 91493 96462 operon operon rrnA 91493 93058 gene gene rrsA locus_tag OBB_0089 91493 93058 rRNA product 16S ribosomal RNA 93292 96213 gene gene rrlA locus_tag OBB_0090 93292 96213 rRNA product 23S ribosomal RNA 96347 96462 gene gene rrfA locus_tag OBB_0091 96347 96462 rRNA product 5S ribosomal RNA 96468 96744 operon operon trnC 96468 96543 gene gene trnV locus_tag OBB_0092 96468 96543 tRNA product tRNA-Val 96545 96620 gene gene trnT locus_tag OBB_0093 96545 96620 tRNA product tRNA-Thr 96669 96744 gene gene trnK locus_tag OBB_0094 96669 96744 tRNA product tRNA-Lys 1914923 1914066 gene gene folD locus_tag OBB_1880 1914923 1914066 CDS product bifunctional methylenetetrahydrofolate dehydrogenase (NADP+)/methenyltetrahydrofolate cyclohydrolase EC_number 1.5.1.5 EC_number 3.5.4.9 protein_id gnl|ncbi|OB1880
Figure 3: GenBank flatfile
LOCUS OB_HTE831 3630528 bp DNA circular BCT 11-DEC-2002
DEFINITION Oceanobacillus iheyensis HTE831, complete genome.
ACCESSION
VERSION
KEYWORDS .
SOURCE Oceanobacillus iheyensis HTE831
ORGANISM Oceanobacillus iheyensis HTE831
Bacteria; Firmicutes; Bacillales; Oceanobacillus.
REFERENCE 1 (bases 1 to 3630528)
AUTHORS Takami,H., Takaki,Y. and Uchiyama,I.
TITLE Genome sequence of Oceanobacillus iheyensis isolated from the Iheya
Ridge and its unexpected adaptive capabilities to extreme
environments
JOURNAL Nucleic Acids Res. 30 (18), 3927-3935 (2002)
PUBMED 12235376
REFERENCE 2 (bases 1 to 3630528)
AUTHORS Takami,H., Takaki,Y. and Chee,G.
TITLE Direct Submission
JOURNAL Submitted (26-DEC-2001) Hideto Takami, Japan Marine Science and
Technology Center, Deep-sea Microorganisms Research Group; 2-15
Natsushima-cho, Yokosuka, Kanagawa 237-0061, Japan
FEATURES Location/Qualifiers
source 1..3630528
/organism="Oceanobacillus iheyensis HTE831"
/strain="HTE831"
/db_xref="taxon:221109"
gene 1830..2966
/gene="dnaN"
/locus_tag="OBB_0002"
CDS 1830..2966
/gene="dnaN"
/locus_tag="OBB_0002"
/EC_number="2.7.7.7"
/codon_start=1
/transl_table=11
/product="DNA-directed DNA polymerase III beta chain"
/translation="MRFTIQRDKLINGVSNVMKAISARTVIPILTGMKIEVKNHGVTL
TGSDSDISIEYYIPIEEDGIVHVENIEEGTIILQAKYFPDIVRKLPESTVDIVVDDQL
NVRITSGKAEFNLNGQSAEEYPQLPKVQTENSFELPIDLLKSMIKQTVFAVSTMETRP
ILTGVNLKLVDNSLSFTATDSHRLARREIPVSNAPIEISQIVVPGKSLNELNKILGDS
EETVEISVTNNQILFRTKHLNFLSRLLDGNYPETSRLIPEQSKTKIQLKTKELLGTID
RASLLAKEERNNVVKFNAPGNSMIEISSNSPEVGNVVEEITADQMEGEDVKISFSSKY
MIDALKAIEYDEVQIEFTGAMRPFIIRPVGDDSILQLILPVRTY"
gene 3219..3440
/locus_tag="OBB_0003"
CDS 3219..3440
/locus_tag="OBB_0003"
/codon_start=1
/transl_table=11
/product="hypothetical protein"
/translation="MHEQIQIDTEYITLGQLIKLLNFLESGGMVKTFLQEEGALVNGH
LEQRRGRKLYPKDVVEIQGIGSYIVIKED"
gene 3443..4552
/gene="recF"
/locus_tag="OBB_0004"
CDS 3443..4552
/gene="recF"
/locus_tag="OBB_0004"
/function="DNA repair and genetic recombination"
/codon_start=1
/transl_table=11
/product="RecF"
/translation="MHIEKLELTNYRNYDQLEIAFDDQINVIIGENAQGKTNLMEAIY
VLSFARSHRTPREKELIQWDKDYAKIEGRITKRNQSIPLQISITSKGKKAKVNHLEQH
RLSDYIGSVNVVMFAPEDLTIVKGAPQIRRRFMDMELGQIQPTYIYHLAQYQKVLKQR
NHLLKQLQRKPNSDTTMLEVLTDQLIEHASILLERRFIYLELLRKWAQPIHRGISREL
EQLEIQYSPSIEVSEDANKEKIGNIYQMKFAEVKQKEIERGTTLAGPHRDDLIFFVNG
KDVQTYGSQGQQRTTALSIKLAEIELIYQEVGEYPILLLDDVLSELDDYRQSHLLNTI
QGKVQTFVSTTSVEGIHHETLQQAELFRVTDGVVN"
gene 5109..7034
/gene="gyrB"
/locus_tag="OBB_0006"
CDS 5109..7034
/gene="gyrB"
/locus_tag="OBB_0006"
/EC_number="5.99.1.3"
/codon_start=1
/transl_table=11
/product="DNA gyrase subunit B"
/translation="MSMEDKITENQEYGADQIQVLEGLEAVRKRPGMYIGSTSEKGLH
HLVWEIVDNSIDEALAGYCDHIQVVVEEDNSITVKDNGRGIPVDIQQKTGRPALEVIM
TVLHAGGKFGGGGYKVSGGLHGVGASVVNALSSELEVYVHRDGKVHFLSFKKGVPDGE
IKVIGDTDITGTVTHFRPDTEIFTETTEYNFDTLEQRLRELAFLNKGLKISIEDKRTD
REQVTYHYEGGISSYVEFINKNKEVLHEPFFAEGEDQGISVEVAIQYNDGFASNLYSF
ANNIHTYEGGSHEVGFRSGLTRIINDYAKKNGLIKDGDSNLSGDDVREGMTTIVSIKH
PDPQFEGQTKTKLGNSEVRAITDGVFSEAFSKFLYENPSTAKIIVEKGLMASRARLAA
KKARELTRRKSNLEISNLPGKLADCSSRDAAISELYIVEGDSAGGSAKSGRDRHFQAI
LPLRGKILNVEKARLDRILSNNEVRAMITALGSGVGEEFDISKARYHKIVIMTDADVD
GAHIRTLLLTFFYRYMRPLIEQGYIYIAQPPLYQVKQGKTVNYAYNDKELDRILNEIP
KAPKPNIQRYKGLGEMNADQLWDTTMDPDTRTLLQVELSDAIDADQVFDMLMGDKVEP
RRIFIEENAQYVKNLDI"
gene complement(44806..45081)
/gene="abrB"
/locus_tag="OBB_0045"
CDS complement(44806..45081)
/gene="abrB"
/locus_tag="OBB_0045"
/function="transcriptional pleiotropic regulator"
/codon_start=1
/transl_table=11
/product="AbrB"
/translation="MKSTGIVRKVDELGRVVIPIELRRTLDIHEKDTMEIYVDNDKIV
LKKYKPNMTCQVTGEVSDENLSIANGNLVLSPAGAQILLEEIQSRFK"
gene 64225..64758
/locus_tag="OBB_0064"
CDS 64225..64758
/locus_tag="OBB_0064"
/function="transcriptional regulator"
/codon_start=1
/transl_table=11
/product="stage V sporulation protein T"
/translation="MKATGIVRRIDDLGRVVVPKEIRRTLRIREGDPLEIFVDREGEV
ILKKYSPINELGHFAKEYAEALFQSLQTPVMITDRDDVIAVAGESKKEYLNKPISNAI
ADTIEGRSQVFEVDTKSMEIIDGQEQQLQSYCIHPVIANGDPIGCVLIFSKEEKLSKI
EQKAAETASTFLAKQME"
gene 84524..85393
/locus_tag="OBB_0082"
CDS 84524..85393
/locus_tag="OBB_0082"
/note="heat shock protein 33"
/codon_start=1
/transl_table=11
/product="chaperonin"
/translation="MKDYLIKATANNGKIRAYAVQSTNTIEEARRRQDTFATASAALG
RTITITAMMGAMLKGDDSITTKVMGNGPLGAIVADADADGHVRGYVTNPHVDFDLNDK
GKLDVARAVGTEGNISVIKDLGLKDFFTGETPIVSGEISEDFTYYYATSEQLPSAVGA
GVLVNPDHTILAAGGFIVQVMPGAEEEVINELEDQIQAIPAISSLIREGKSPEEILTQ
LFGEECLTIHEKMPIEFRCKCSKDRLAQAIIGLGNDEIQAMIEEDQGAEATCHFCNEK
YHFTEEELEDLKQ"
gene 89569..91050
/locus_tag="OBB_0088"
CDS 89569..91050
/locus_tag="OBB_0088"
/EC_number="6.1.1.6"
/codon_start=1
/transl_table=11
/product="lysine-tRNA ligase"
/translation="MSEELNEHMQVRRDKLAEHMEKGLDPFGGKFERSHQATDLIEKY
DSYSKEELEETTDEVTIAGRLMTKRGKGKAGFAHIQDLSGQIQLYVRKDMIGDDAYEV
FKSADLGDIVGVTGVMFKTNVGEISVKAKQFQLLTKSLRPLPEKYHGLKDIEQRYRQR
YLDLITNPDSRGTFVSRSKIIQSMREYLNGQGFLEVETPMMHSIPGGASARPFITHHN
ALDIELYMRIAIELHLKRLMVGGLEKVYEIGRVFRNEGVSTRHNPEFTMIELYEAYAD
YHDIMELTENLVAHIAKQVHGSTTITYGEHEINLEPKWTRLHIVDAVKDATGVDFWKE
VSDEEARALAKEHGVQVTESMSYGHVVNEFFEQKVEETLIQPTFIHGHPVEISPLAKK
NKEDERFTDRFELFIVGREHANAFSELNDPIDQRARFEAQVKERAEGNDEAHYMDEDF
LEALEYGMPPTGGLGIGVDRLVMLLTNSPSIRDVLLFPQMRTK"
operon 91493..96462
/operon="rrnA"
gene 91493..93058
/gene="rrsA"
/locus_tag="OBB_0089"
/operon="rrnA"
rRNA 91493..93058
/gene="rrsA"
/locus_tag="OBB_0089"
/operon="rrnA"
/product="16S ribosomal RNA"
gene 93292..96213
/gene="rrlA"
/locus_tag="OBB_0090"
/operon="rrnA"
rRNA 93292..96213
/gene="rrlA"
/locus_tag="OBB_0090"
/operon="rrn"
/product="23S ribosomal RNA"
gene 96347..96462
/gene="rrfA"
/locus_tag="OBB_0091"
/operon="rrnA"
>rRNA 96347..96462
/gene="rrfA"
/locus_tag="OBB_0091"
/operon="rrnA"
/product="5S ribosomal RNA"
operon 96468..96744
/operon="trnC"
gene 96468..96543
/gene="trnV"
/locus_tag="OBB_0092"
/operon="trnC"
tRNA 96468..96543
/gene="trnV"
/locus_tag="OBB_0092"
/operon="trnC"
/product="tRNA-Val"
gene 96545..96620
/gene="trnT"
/locus_tag="OBB_0093"
/operon="trnC"
tRNA 96545..96620
/gene="trnT"
/locus_tag="OBB_0093"
/operon="trnC"
/product="tRNA-Thr"
gene 96669..96744
/gene="trnK"
/locus_tag="OBB_0094"
/operon="trnC"
tRNA 96669..96744
/gene="trnK"
/locus_tag="OBB_0094"
/operon="trnC"
/product="tRNA-Lys"
gene complement(1914066..1914923)
/gene="folD"
/locus_tag="OBB_1880"
CDS complement(1914066..1914923)
/gene="folD"
/EC_number="1.5.1.5"
/EC_number="3.5.4.9"
/locus_tag="OBB_1880"
/codon_start=1
/transl_table=11
/product="bifunctional methylenetetrahydrofolate dehydrogenase (NADP+)/
methenyltetrahydrofolate cyclohydrolase"
/translation="MATLLNGKELSEELKQKMKIEVDELKEKGLTPHLTVILVGDNPA
SKSYVKGKEKACAVTGISSNLIELPENISQDELLQIIDEQNNDDSVHGILVQLPLPDQ
MDEQKIIHAISPAKDVDGFHPINVGKMMTGEDTFIPCTPYGILTMLRSKDISLEGKHA
VIIGRSNIVGKPIGLLLLQENATVTYTHSRTKNLQEITKQADILIVAIGRAHAINADY
IKEDAVVIDVGINRKDDGKLTGDVDFESAEQKASYITPVPRGVGPMTITMLLKNTIKA
AKGLNDVER"
BASE COUNT 1165552 a 648314 c 647106 g1169556 t
ORIGIN
1 actttcaaaa aaatcagcgt aaaaaacata ctaatttggg caaattccca cctgttttta
61 gggacatttt tctttgaatt agagcctcag cagctcgtca ttgctgaatt ttcttgaagt
For an additional examples see GenBank Accession Number CP000141.
Revised April 12, 2006