MotifCluster: an interactive online tool for clustering and visualizing sequences using shared motifs.
Department of Computer Science, University of Colorado, Boulder, CO 80309, USA.
MotifCluster finds related motifs in a set of sequences, and clusters the sequences into families using the motifs they contain. MotifCluster, at http://bmf.colorado.edu/motifcluster, lets users test whether proteins are related, cluster sequences by shared conserved motifs, and visualize motifs mapped onto trees, sequences and three-dimensional structures. We demonstrate MotifCluster's accuracy using gold-standard protein superfamilies; using recommended settings, families were assigned to the correct superfamilies with 0.17% false positive and no false negative assignments.
PMID: 18706079 [PubMed - indexed for MEDLINE]
PMCID: PMC2575518