Format

Send to

Choose Destination
Bioinformatics. 2008 Dec 15;24(24):2926-7. doi: 10.1093/bioinformatics/btn547. Epub 2008 Oct 20.

Effective cluster-based seed design for cross-species sequence comparisons.

Author information

1
Department of Computer Science, George Washington University, Washington, DC 20052, USA. lmzhou@gwmail.gwu.edu

Abstract

SUMMARY:

To annotate newly sequenced organisms, cross-species sequence comparison algorithms can be applied to align gene sequences to the genome of a related species. To improve the accuracy of alignment, spaced seeds must be optimized for each comparison. As the number and diversity of genomes increase, an efficient alternative is to cluster pairwise comparisons into groups and identify seeds for groups instead of individual comparisons. Here we investigate a measure of comparison closeness and identify classes of comparisons that show similar seed behavior and therefore can employ the same seed.

AVAILABILITY:

Source code is freely available at http://dna.cs.gwu.edu and from Bioinformatics online.

PMID:
18940827
DOI:
10.1093/bioinformatics/btn547
[Indexed for MEDLINE]

Supplemental Content

Full text links

Icon for Silverchair Information Systems
Loading ...
Support Center