|
The abbreviations, or gene symbols, by which a gene has been known often vary
over time. Nomenclature committees exist for a number of organisms. RefSeq
Collaborators include nomenclature committees for various genomes for
example, the Human
Gene Nomenclature Committee. The committee assign official symbols to
an organism's genes, and keeps a record of the alternate symbols
("aliases") by which the gene has been known.
Sequence records in archival databases such as
GenBank/EMBL/DDBJ do not use a controlled vocabulary, and do not
necessarily contain an official gene symbol (or in some cases, any gene symbol).
Instead, the records contain the information that was provided by the submitters
of the sequence records. Depending on whether, and when, submitters checked with
the nomenclature for their organism of study, their sequence record might
contain an old gene symbol, a current official gene symbol, or no gene symbol at
all but just the full spelling.
Therefore, if a user wants to do a comprehensive search of the sequence
databases for a gene of interest, it is often helpful to first identify the
various names by which the gene has been known. The user can then construct a
query that includes the various synonyms: full spelling of gene name, official
gene symbol, and aliases. A good place to look for that type of information
is the web site of the nomenclature committee for an organism of interest, or a
curated, gene-centered database such as Entrez
Gene.
|