Format

Send to

Choose Destination
Bioinformatics. 2002;18 Suppl 2:S44-53.

A new distance measure for comparing sequence profiles based on path lengths along an entropy surface.

Author information

1
Department of Biomathematical Sciences, Mount Sinai School of Medicine, New York, USA.

Abstract

We describe a new distance measure for comparing DNA sequence profiles. For this measure, columns in a multiple alignment are treated as character frequency vectors (sum of the frequencies equal to one). The distance between two vectors is based on minimum path length along an entropy surface. Path length is estimated using a random graph generated on the entropy surface and Dijkstra's algorithm for all shortest paths to a source. We use the new distance measure to analyze similarities within familes of tandem repeats in the C. elegans genome and show that this new measure gives more accurate refinement of family relationships than a method based on comparing consensus sequences.

PMID:
12385982
[Indexed for MEDLINE]

Supplemental Content

Loading ...
Support Center