Format of Sequence Record: User Question and Answer
Course Home Modules Schedule Exercises Comments Credits

Answers: NM_022817

(available live in Entrez)
Questions
  1. This record describes what kind of sequence? (DNA? mRNA? Protein?) mRNA. List several ways to find the answer. LOCUS line says mRNA and lists base pairs (bp). Record accession number is an "NM" record, in which the "M" stands for mRNA. Data at end of record is nucleotide data.

  2. This gene is involved in what processes and was sequenced from what organism? Involved in circadian rhythms (from COMMENT section). SOURCE ORGANISM is homo sapiens (human).

  3. What lab submitted the sequence record? (Trick question!) No lab submitted the sequence record. It is a reference sequence (RefSeq) record created and curated by NCBI staff. The sequence data were taken from AB002345, as well as DA226434 and AC012485, all of which are cited in the COMMENT field as being the source GenBank records. A source GenBank record is used to build a foundation for a RefSeq record. Then additional references and other annotations (such as the official gene symbol, additional biological features, summary comments, etc.) are added by NCBI staff.

  4. What is the gene symbol for this gene? PER2. Where in the record can you find it? The gene symbol, PER2, can be found in the LOCUS field and DEFINITION field. The gene symbol is also listed in the FEATURES table: /gene="PER2". (Note that this record is from the curated RefSeq database, and therefore includes the official gene symbol in the places noted above. An archival GenBank record might contain the official gene symbol, an older symbol, or no gene symbol at all. In addition, the gene symbol, when present in GenBank records, would not necessarily appear in the Locus and Definition fields. That is just the convention used for RefSeq records.)

  5. What is the base span of the coding sequence, and what is the accession number of the protein record in which the amino acid translation exists? The coding sequence (see CDS Feature) is from bp 238..4005. The associated protein record is NP_073728. The translation of the nucleotides into amino acids is listed under the CDS feature.


Sequence Record Format:
User Question
Return to Slides Revised 11/01/2007
Return to Circadian Rhythms Umbrella Page