Information Hubs
Course Home Modules Schedule Exercises Comments Credits
Slide 1 Previous Next Slide List

Gauge Your Current Familiarity with NCBI Resources...An Informal Pretest

 

Below is a sample human sequence, followed by twelve questions about it, to gauge your current familiarity with some of NCBI's bioinformatics resources. Just answer the questions you can and don't worry about the others. By the end of the module, you should be able to answer all of the questions in 10-15 minutes or less. The questions apply some key concepts and skills covered in the 3-day introductory course, which are briefly summarized in this Information Hubs module.

>gi|904118 Sample human sequence for NAWBIS/InfoHubs module
TGGGACAGGCAGCTCCGGGGTCCGCGGTTTCACATCGGAAACAAAACAGCGGCTGGTCTGGAAGGAACCT
GAGCTACGAGCCGCGGCGGCAGCGGGGCGGCGGGGAAGCGTATACCTAATCTGGGAGCCTGCAAGTGACA
ACAGCCTTTGCGGTCCTTAGACAGCTTGGCCTGGAGGAGAACACATGAAAGAAAGAACCTCAAGAGGCTT
TGTTTTCTGTGAAACAGTATTTCTATACAGTTGCTCCAATGACAGAGTTACCTGCACCGTTGTCCTACTT
CCAGAATGCACAGATGTCTGAGGACAACCACCTGAGCAATACTGTACGTAGCCAGAATGACAATAGAGAA
CGGCAGGAGCACAACGACAGACGGAGCCTTGGCCACCCTGAGCCATTATCTAATGGACGACCCCAGGGTA
ACTCCCGGCAGGTGGTGGAGCAAGATGAGGAAGAAGATGAGGAGCTGACATTGAAATATGGCGCCAAGCA
TGTGATCATGCTCTTTGTCCCTGTGACTCTCTGCATGGTGGTGGTCGTGGCTACCATTAAGTCAGTCAGC
TTTTATACCCGGAAGGATGGGCAGCTAATCTATACCCCATTCACAGAAGATACCGAGACTGTGGGCCAGA
GAGCCCTGCACTCAATTCTGAATGCTGCCATCATGATCAGTGTCATTGTTGTCATGACTATCCTCCTGGT
GGTTCTGTATAAATACAGGTGCTATAAGGTCATCCATGCCTGGCTTATTATATCATCTCTATTGTTGCTG
TTCTTTTTTTCATTCATTTACTTGGGGGAAGTGTTTAAAACCTATAACGTTGCTGTGGACTACATTACTG
TTGCACTCCTGATCTGGAATTTTGGTGTGGTGGGAATGATTTCCATTCACTGGAAAGGTCCACTTCGACT
CCAGCAGGCATATCTCATTATGATTAGTGCCCTCATGGCCCTGGTGTTTATCAAGTACCTCCCTGAATGG
ACTGCGTGGCTCATCTTGGCTGTGATTTCAGTATATGATTTAGTGGCTGTTTTGTGTCCGAAAGGTCCAC
TTCGTATGCTGGTTGAAACAGCTCAGGAGAGAAATGAAACGCTTTTTCCAGCTCTCATTTACTCCTCAAC
AATGGTGTGGTTGGTGAATATGGCAGAAGGAGACCCGGAAGCTCAAAGGAGAGTATCCAAAAATTCCAAG
TATAATGCAGAAAGCACAGAAAGGGAGTCACAAGACACTGTTGCAGAGAATGATGATGGCGGGTTCAGTG
AGGAATGGGAAGCCCAGAGGGACAGTCATCTAGGGCCTCATCGCTCTACACCTGAGTCACGAGCTGCTGT
CCAGGAACTTTCCAGCAGTATCCTCGCTGGTGAAGACCCAGAGGAAAGGGGAGTAAAACTTGGATTGGGA
GATTTCATTTTCTACAGTGTTCTGGTTGGTAAAGCCTCAGCAACAGCCAGTGGAGACTGGAACACAACCA
TAGCCTGTTTCGTAGCCATATTAATTGGTTTGTGCCTTACATTATTACTCCTTGCCATTTTCAAGAAAGC
ATTGCCAGCTCTTCCAATCTCCATCACCTTTGGGCTTGTTTTCTACTTTGCCACAGATTATCTTGTACAG
CCTTTTATGGACCAATTAGCATTCCATCAATTTTATATCTAGCATATTTGCGGTTAGAATCCCATGGATG
TTTCTTCTTTGACTATAACCAAATCTGGGGAGGACAAAGGTGATTTTCCTGTGTCCACATCTAACAAAGT
CAAGATTCCCGGCTGGACTTTTGCAGCTTCCTTCCAAGTCTTCCTGACCACCTTGCACTATTGGACTTTG
GAAGGAGGTGCCTATAGAAAACGATTTTGAACATACTTCATCGCAGTGGACTGTGTCCCTCGGTGCAGAA
ACTACCAGATTTGAGGGACGAGGTCAAGGAGATATGATAGGCCCGGAAGTTGCTGTGCCCCATCAGCAGC
TTGACGCGTGGTCACAGGACGATTTCACTGACACTGCGAACTCTCAGGACTACCGGTTACCAAGAGGTTA
GGTGAAGTGGTTTAAACCAAACGGAACTCTTCATCTTAAACTACACGTTGAAAATCAACCCAATAATTCT
GTATTAACTGAATTCTGAACTTTTCAGGAGGTACTGTGAGGAAGAGCAGGCACCAGCAGCAGAATGGGGA
ATGGAGAGGTGGGCAGGGGTTCCAGCTTCCCTTTGATTTTTTGCTGCAGACTCATCCTTTTTAAATGAGA
CTTGTTTTCCCCTCTCTTTGAGTCAAGTCAAATATGTAGATTGCCTTTGGCAATTCTTCTTCTCAAGCAC
TGACACTCATTACCGTCTGTGATTGCCATTTCTTCCCAAGGCCAGTCTGAACCTGAGGTTGCTTTATCCT
AAAAGTTTTAACCTCAGGTTCCAAATTCAGTAAATTTTGGAAACAGTACAGCTATTTCTCATCAATTCTC
TATCATGTTGAAGTCAAATTTGGATTTTCCACCAAATTCTGAATTTGTAGACATACTTGTACGCTCACTT
GCCCCCAGATGCCTCCTCTGTCCTCATTCTTCTCTCCCACACAAGCAGTCTTTTTCTACAGCCAGTAAGG
CAGCTCTGTCRTGGTAGCAGATGGTCCCATTATTCTAGGGTCTTACTCTTTGTATGATGAAAAGAATGTG
TTATGAATCGGTGCTGTCAGCCCTGCTGTCAGACCTTCTTCCACAGCAAATGAGATGTATGCCCAAAGCG
GTAGAATTAAAGAAGAGTAAAATGGCTGTTGAAGC
 

 

Questions:

  1. This is the sequence data from what human gene?
  2. Did the data come from an archival (primary) or a curated (derivative) database? How can you tell?
  3. If it is archival, how can you find a curated mRNA record for this human gene, or vice versa?
  4. What is the official gene symbol? By what other gene symbols has it been known?
  5. What is the location of this gene on a cytogenetic map, and what is its bp location on the assembled human genome sequence?
  6. How many transcript variants is it known to have?
  7. On what mouse and rat chromosomes are the homologs found?
  8. What phenotypes are associated with this gene?
  9. How many allelic variants are documented in OMIM for this gene?
  10. Name a clinical laboratory in the united states that offers genetic testing for one of the phenotypes?
  11. Does NCBI offer a software tool for making a restriction map of the query sequence? How can you find out?
  12. If a user has the genomic DNA for the gene and wants to identify transcription factor binding sites, what are some databases and/or software tools that could potentially be useful? How/where did you find these?

Additional (optional) questions:

  1. What conserved domains exist in the protein product?
  2. From what distributor can you obtain a clone for the full length mRNA?
  3. How can you download the genomic sequence and 3KB upstream?
  4. From what source can you obtain the genomic DNA clone for the chromosome region that contains the gene?
 

Information Hubs
Slide 1 Previous Next Slide List
Revised 08/03/2006