Send to

Choose Destination
Protein Sci. 2004 Sep;13(9):2559-63. Epub 2004 Aug 4.

Delineation and analysis of the conceptual data model implied by the "IUPAC Recommendations for Biochemical Nomenclature".

Author information

Department of Engineering and Science, Rensselaer at Hartford, Hartford, Connecticut 06120, USA.


Computational analysis of the bonding, geometric, and topological relationships within proteins typically takes on the order of hours, mainly devoted to the writing of scripts and code to correctly parse the data. The Structured Query Language (SQL) built into modern database management systems eliminates the need for data parsing, effectively reducing the analysis time to seconds. To this end, we have formulated a conceptual data model (CDM) for proteins based on the IUPAC recommendations for biochemical nomenclature. This conceptual data model makes explicit the inherent bonding relationships between the atoms of a protein, as well as the geometric (bond angle and torsion angle) and topological (chirality) relationships between the bonds. The validity of the CDM has been tested with a reduced implementation using commercial database software. The ease in both populating the database with data from the Protein Data Bank and formulating/executing queries supports the correctness of the model. The ability to conduct truly interactive analyses of protein structure is essential to fully capitalize on the explosion in postgenomic protein structure data.

[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Wiley Icon for PubMed Central
Loading ...
Support Center