Display Settings:

Format

Send to:

Choose Destination
    Bioinformatics. 2011 Dec 15;27(24):3348-55. Epub 2011 Oct 8.

    KABOOM! A new suffix array based algorithm for clustering expression data.

    Source

    Wits Bioinformatics, School of Electrical and Information Engineering, University of the Witwatersrand, Johannesburg, Private Bag 3, 2050 Wits, South Africa. scott.hazelhurst@wits.ac.za

    Abstract

    MOTIVATION:

    Second-generation sequencing technology has reinvigorated research using expression data, and clustering such data remains a significant challenge, with much larger datasets and with different error profiles. Algorithms that rely on all-versus-all comparison of sequences are not practical for large datasets.

    RESULTS:

    We introduce a new filter for string similarity which has the potential to eliminate the need for all-versus-all comparison in clustering of expression data and other similar tasks. Our filter is based on multiple long exact matches between the two strings, with the additional constraint that these matches must be sufficiently far apart. We give details of its efficient implementation using modified suffix arrays. We demonstrate its efficiency by presenting our new expression clustering tool, wcd-express, which uses this heuristic. We compare it to other current tools and show that it is very competitive both with respect to quality and run time.

    AVAILABILITY:

    Source code and binaries available under GPL at http://code.google.com/p/wcdest. Runs on Linux and MacOS X.

    CONTACT:

    scott.hazelhurst@wits.ac.za; zsuzsa@cebitec.uni-bielefeld.de

    SUPPLEMENTARY INFORMATION:

    Supplementary data are available at Bioinformatics online.

    PMID:
    21984769
    [PubMed - indexed for MEDLINE]

      Supplemental Content

      Icon for HighWire Press

      Save items

      loading

      Recent activity

      Your browsing activity is empty.

      Activity recording is turned off.

      Turn recording back on

      See more...
      Write to the Help Desk