|
The
Limits page of the Entrez CoreNucleotide database allows you to easily exclude certain categories
of sequence data, such as STSs, working draft high throughput
genomic sequences, TPAs, and patents. However, the Limits page does not allow you to
include, or retrieve only, those types of sequences. This exercise
shows how to do that.
While some users prefer to exclude patent sequences because they are often
short with little or no biological annotation, other users might specifically be
interested in them. To enhance your skill in using Entrez by demonstrating a few
tips&tricks, several search methods are shown below. If you are short on
time, just do the first. The first three examples achieve the same search results
using different techniques. In the fourth example, we added a synonym to
the query, so that retrieves more records.
This exercise also deliberately focuses on patent sequences to bring up
a discussion, under additional tips, about
the limited scope of patent data in
GenBank/EMBL/DDBJ and the recommendation to also consult external databases
specializing in patent sequences as needed. There is also a tip on how to search by patent number.
|