3MZV: Crystal structure of a decaprenyl diphosphate synthase from Rhodobacter capsulatus

The number of available protein sequences has increased exponentially with the advent of high-throughput genomic sequencing, creating a significant challenge for functional annotation. Here, we describe a large-scale study on assigning function to unknown members of the trans-polyprenyl transferase (E-PTS) subgroup in the isoprenoid synthase superfamily, which provides substrates for the biosynthesis of the more than 55,000 isoprenoid metabolites. Although the mechanism for determining the product chain length for these enzymes is known, there is no simple relationship between function and primary sequence, so that assigning function is challenging. We addressed this challenge through large-scale bioinformatics analysis of >5,000 putative polyprenyl transferases; experimental characterization of the chain-length specificity of 79 diverse members of this group; determination of 27 structures of 19 of these enzymes, including seven cocrystallized with substrate analogs or products; and the development and successful application of a computational approach to predict function that leverages available structural data through homology modeling and docking of possible products into the active site. The crystallographic structures and computational structural models of the enzyme-ligand complexes elucidate the structural basis of specificity. As a result of this study, the percentage of E-PTS sequences similar to functionally annotated ones (BLAST e-value </= 1e(-70)) increased from 40.6 to 68.8%, and the percentage of sequences similar to available crystal structures increased from 28.9 to 47.4%. The high accuracy of our blind prediction of newly characterized enzymes indicates the potential to predict function to the complete polyprenyl transferase subgroup of the isoprenoid synthase superfamily computationally.
PDB ID: 3MZVDownload
MMDB ID: 82670
PDB Deposition Date: 2010/5/13
Updated in MMDB: 2017/11
Experimental Method:
x-ray diffraction
Resolution: 1.9  Å
Source Organism:
Similar Structures:
Biological Unit for 3MZV: dimeric; determined by author and by software (PISA)
Molecular Components in 3MZV
Label Count Molecule
Proteins (2 molecules)
Decaprenyl Diphosphate Synthase
Molecule annotation
* Click molecule labels to explore molecular sequence information.

Citing MMDB