Workshop II

 Course Main Page

 Workshop II Exercises

 Amino Acid Explorer




   Finding a structural template for a gene product

Launch guide window Launch NCBI Home Page

Goals

  • Find protein sequences associated with a given gene
  • Find the most closely related structure to these proteins
  • Use the structure and CD to explore a SNP known to cause a disease phenotype
  • Import and align a transcript variant to the template alignment.

Steps

  1. Retrieve a gene record using Entrez
    • From the NCBI home page, search for "LHX3[sym] AND human[orgn]".
    • Click on the Gene hits to view them.
  2. Review the gene record
    • Scroll down to graphic under Genomic regions, transcripts, and products. How many proteins is this gene expected to produce?
    • Make a note of the NP accession numbers of these proteins.

  3. Load a related OMIM record describing a disease phenotype
    • Follow the link to OMIM in the Links menu.
    • Click on the accession of the record labeled "LIM Homeobox Gene 3"

  4. Find a SNP in LHX3 resulting in a disease phenotype
    • Click on "Allelic Variants" in the left bar of the page.
    • Concentrate on variant .0001, Y116C.
    • What are the clinical consequences of this SNP?

  5. Find the SNP site in an annotated protein sequence
    • Scroll up to the top of the OMIM page, and follow the link to Gene in the Links menu.
    • Click on the accession of NP_055379 (isoform b) to the right of the graphic under Genomic regions, transcripts, and proteins.
    • Open the Genpept format and confirm that the sequence has a tyrosine at position 116 (Y116).

  6. Find a structural template for the region of the protein including the SNP
    • Click 'Related Structure' in the Links menu in the upper right of the protein record.
    • Use the ruler above the graphic to find the best-matching structures that contain a sequence match at the position of the SNP (116).

  7. Locate the SNP position in the sequence-similar structure
    • Click on the red bar representing a best-matching structure containing position 116.
    • Find the SNP position in the text alignment. The green numbers indicate residue positions in each sequence. What position is the SNP in the structure record?

  8. Build a VAST alignment
    • Click on the MMDB link (to the right of Reference) in the top section of the page.
    • On the structure summary page, click anywhere in the blue bar (domain 2) that contains the SNP position.
    • In the row to the right of the List button, change Graphics to Table, and click List.

  9. View the VAST alignment
    • Check the boxes to the left of the top three structure neighbors.
    • Click View 3D Alignment to launch Cn3D.

  10. Import NP_055379
    • In the alignment window, choose Edit / Enable Editor.
    • Choose Imports / Show Imports.
    • In the Imports window, choose Edit / Import Sequences.
    • Choose Network via accession and click OK.
    • Type NP_055379 in the box and click OK.

  11. Align NP_055379
    • Choose Algorithms / BLAST/PSSM and click anywhere on the pair of sequences.
    • If there are red-shaded regions, try the Block Aligner. Otherwise, skip the next two steps.
    • Choose Algorithms / Block Align Single and click on the sequences.
    • In the dialog, uncheck Global Alignment, then click OK.
    • Choose Alignments / Merge All to merge your new alignment.
    • Close the Imports window.

  12. Find the SNP position in the alignment
    • Find the SNP position in NP_055379 (the bottom row) in the alignment window by moving your mouse across the sequence and monitoring the location in the lower left corner of the alignment window.
    • In the alignment window, choose Mouse Mode / Select Columns.
    • Click on the alignment column at the SNP position.

  13. Locate the SNP position in the structure
    • Locate the wild type residue in the structure. What metal binding site is it near?
    • Clear your highlighting by clicking in whitespace in the alignment window.
    • Discover what amino acids contact the metal ion by double-clicking the metal ion from any of the structures and searching by distance with a radius of 4 Angstroms. What residues are they?

  14. Investigate the possible changes caused by the mutation
    • Click here to launch the Amino Acid Explorer.
    • Using the Compare menus in the left bar of the page, compare the wild type (Tyr) and mutant (Cys) residues using Text.
    • What are the significant differences between these residues?
    • Compare them using Graphics to assess the relative sizes and shapes of the side chains.

  15. Find residues in contact with the SNP position
    • Return to Cn3D.
    • Again select the column at the SNP position as you did before.
    • In the structure window, choose Show/Hide / Select by Distance / Residues Only.
    • Set the radius to 3.0 Angstroms.

  16. Predict the consequences of the SNP
    • Are any of the residues that contact the ligand within 3.0 Angstroms of the SNP position?
    • Knowing what the mutant residue is, what additional interactions might occur in the mutant protein? What might be the biochemical and biological consequences of these interactions?

Revised January 7, 2008