ProML--the protein markup language for specification of protein sequences, structures and families

In Silico Biol. 2002;2(3):313-24.

Abstract

We propose a specification language ProML for protein sequences, structures, and families based on the open XML standard. The language allows for portable, system-independent, machine-parsable and human-readable representation of essential features of proteins. The language is of immediate use for several bioinformatics applications: we discuss clustering of proteins into families and the representation of the specific shared features of the respective clusters. Moreover, we use ProML for specification of data used in fold recognition bench-marks exploiting experimentally derived distance constraints.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Programming Languages*
  • Protein Conformation
  • Proteins / chemistry*

Substances

  • Proteins