A perturbation-based method for calculating explicit likelihood of evolutionary co-variance in multiple sequence alignments

Bioinformatics. 2004 Jul 10;20(10):1565-72. doi: 10.1093/bioinformatics/bth128. Epub 2004 Feb 12.

Abstract

Motivation: The constituent amino acids of a protein work together to define its structure and to facilitate its function. Their interdependence should be apparent in the evolutionary record of each protein family: positions in the sequence of a protein family that are intimately associated in space or in function should co-vary in evolution. A recent approach by Ranganathan and colleagues proposes to look at subsets of a protein family, selected for their sequence at one position, to see how this affects variation at other positions.

Results: We present a quantitative algorithm for assessing covariation with this approach, based on explicit likelihood calculations. By applying our algorithm to 138 Pfam families with at least one member of known structure, we demonstrate that our method has improved power in finding physically close residues in crystal structures, compared to that of Ranganathan and colleagues.

Supplementary information: www.afodor.net/bioinfosup.html

Publication types

  • Comparative Study
  • Evaluation Study
  • Validation Study

MeSH terms

  • Algorithms*
  • Computer Simulation
  • Evolution, Molecular*
  • Genetic Variation
  • Likelihood Functions
  • Models, Genetic
  • Models, Statistical
  • Regression Analysis
  • Sequence Alignment / methods*
  • Sequence Analysis, Protein / methods*