Optional defline information that may be included is:
Biological
- strain [strain=S288C]
- isolate [isolate=CWS1]
- chromosome [chromosome=XVI]
Other elements
- topology [topology=circular]
- location [location=mitochondrion]
- molecule [moltype=mRNA] (DNA is the default)
- technique [tech=wgs]
- protein name [protein=helicase] (if using -c)
- genetic code [gcode=4]
For a complete list of source modifiers click here.
See the Tax
Browser
for the genetic code values.
Example FASTA:
>Sc_16 [organism=Saccharomyces cerevisiae]
tataggcgaatcgagtatattattttttctcaacatatgtat
atgaacatgagaatatatttataggaatgtataaaattgtga
cctctcctgctattttagttactgattttatgtatgtagggg
gaataggggctgcctttcttaatgcagttttaattttttctt
ttaattttttcttagtaaaattatttaaagtaaagattaatg
gaataaccattgcgcttttttttacagtttttggtttttcat
tttttggaaaaaatattttaaatattttacctttttatttag
ggggtattttatatagtatctatacttcaacagatttttctg
aacatatagttcctattgctttttcaagtgcattagcccctt
ttgtaagcagtgttgctttttatggagaaatatcctatgaaa
catcatatataaatgcaattttaattggtattttaattggtt
ttatagtggttcctttgtctaaaagtctttatgactttcatg
agggatatgatttatataatttaggttttacagcaggtt
Feature table format (.tbl) |
Tbl2asn reads features from a five-column tab-delimited table called a Feature table .
The feature table specifies the location and type of each feature. Tbl2asn will
process the feature intervals and translate any CDSs into proteins.
The first line of the table should contain the following information:
>Features SeqID table_name
The SeqID must match the nucleotide sequence SeqID in the corresponding .fsa
file.
Example Feature Table:
>Feature Sc_16 Table1
69 543 gene
gene sde3p
69 543 CDS
product SDE3P
protein_id WS1030
Protein sequence format (.pep) |
- Set up as a FASTA file using the protein sequence.
- This file will substitute the automatically translated products of the CDS
features with the provided protein sequences.
- Serves as a check that the conceptual translation of the nucleotide sequence is
as predicted.
- SeqID must match protein_id in the .tbl file
Example FASTA:
>WS1030 [gene=sde3p] [protein=SDE3P]
MYKIVTSPAILVTDFMYVGGIGAAFLNAVLIFSFNFFL
VKLFKVKINGITIAAFFTVFGFSFFGKNILNILPFYLG
GILYSIYTSTDFSEHIVPIAFSSALAPFVSSVAFYGEI
SYETSYINAILIGILIGFIVVPLSKSLYDFHEGYDLYN
LGFTAG
Source table format (.src) |
For sets of sequences, a source modifier table can be placed in
a tab-delimited file with
a .src extension. The first column must be the sequence's SeqID. The first row
gives the names of
the source qualifiers being added, separated by tabs. Any additional rows list the
SeqID and
related source qualifiers for each sequence in the corresponding .fsa file.
SeqID organism strain isolate
Sc_16 Zea mays A69Y JH90.6-2x12
Quality scores table format (.qvl) |