New features and datasets for the NCBI UniGene Database

Query Tips

Entrez query allows users to search for UniGene clusters. The user may look for UniGene clusters using terms derived from each of their constituent sequences. However, the summary information displayed with Entrez results might not contain the query term used. Queries can be made more specific by restricting the terms to specific fields.
General rules for searching Entrez apply.

Term Description
[ESTC] Number of ESTs in a cluster. Query using one value, or a a range of integers, for example, to retrieve UniGene clusters having only 1-10 EST sequences. 1:10[ESTC]
[MRNA] Number of sequences in cluster from GenBank divisions other than dbEST.
[SEQC] Total sequence count in cluster.
[TEXT] Text fields.
[CLUSTERID]  UniGene Cluster Identifier, usually but not always stable in successive UniGene builds, e.g., Hs.2[CLUSTERID].
[CHRO] Mapping from a variety of sources.
[CLON] cDNA clone identifier, as submitted in cDNA record. Note that the colon character is a range delimiter in all Entrez searches and must be surrounded by quote marks if a clone identifier includes a colon.
[GDSC] All words appearing in the description of a cluster corresponding to a gene with an assigned symbol, e.g., hypothetical[GDSC].
[GENE] Approved or provisional gene symbol, e.g., TTN[GENE].
[KYWD] UniGene-specific keywords including: dbSTS, LocusLink, MGC, RefSeq, ProtEST, HomoloGene, polyA, and end3prime.
[LIBR] dbEST library ID, e.g., 665[LIBR] .
[LID] Entrez Gene ID e.g., 1[LID].
[NACC] GenBank accession identifier of nucleotide sequence.
[NUID] GenBank sequence identifier of nucleotide sequence.
[PACC] The protein accession of any proteins corresponding to the cluster as provided by NCBI's ProtEST resource; restricted to eight organisms.
[PUID] The protein ID of any proteins corresponding to the cluster as provided by ProtEST.
[ORGN] Description of the organism or the NCBI Taxonomy ID of a species, e.g., Rodents[Organism].
[TEXT] Any word appearing in the description of any sequence within a cluster.
[TISS] Description of the Tissue type or Organ fields describing the cDNA library of origin of any clustered sequence.
[TITLE] Any word appearing in a cluster's title, taken from either gene descriptions, the description of a representative sequence from the cluster, or from protein similarities computed at NCBI.
Boolean operators AND, OR, NOT must be in uppercase, and "*" is used as a wild card.
Queries must not start with NOT, and wild cards must be used at the end, not within, the query string. Wild cards will only consider the first 150 matches to the string.

National Center for Biotechnology Information
U.S. National Library of Medicine
8600 Rockville Pike, Bethesda, MD 20894