Format

Send to

Choose Destination
See comment in PubMed Commons below
BMC Bioinformatics. 2009 Jun 9;10:174. doi: 10.1186/1471-2105-10-174.

Combining specificity determining and conserved residues improves functional site prediction.

Author information

1
EMBL Heidelberg, Heidelberg, Germany. kalinina@embl.de

Abstract

BACKGROUND:

Predicting the location of functionally important sites from protein sequence and/or structure is a long-standing problem in computational biology. Most current approaches make use of sequence conservation, assuming that amino acid residues conserved within a protein family are most likely to be functionally important. Most often these approaches do not consider many residues that act to define specific sub-functions within a family, or they make no distinction between residues important for function and those more relevant for maintaining structure (e.g. in the hydrophobic core). Many protein families bind and/or act on a variety of ligands, meaning that conserved residues often only bind a common ligand sub-structure or perform general catalytic activities.

RESULTS:

Here we present a novel method for functional site prediction based on identification of conserved positions, as well as those responsible for determining ligand specificity. We define Specificity-Determining Positions (SDPs), as those occupied by conserved residues within sub-groups of proteins in a family having a common specificity, but differ between groups, and are thus likely to account for specific recognition events. We benchmark the approach on enzyme families of known 3D structure with bound substrates, and find that in nearly all families residues predicted by SDPsite are in contact with the bound substrate, and that the addition of SDPs significantly improves functional site prediction accuracy. We apply SDPsite to various families of proteins containing known three-dimensional structures, but lacking clear functional annotations, and discusse several illustrative examples.

CONCLUSION:

The results suggest a better means to predict functional details for the thousands of protein structures determined prior to a clear understanding of molecular function.

PMID:
19508719
PMCID:
PMC2709924
DOI:
10.1186/1471-2105-10-174
[Indexed for MEDLINE]
Free PMC Article
PubMed Commons home

PubMed Commons

0 comments

    Supplemental Content

    Full text links

    Icon for BioMed Central Icon for PubMed Central
    Loading ...
    Support Center