Display Settings:

Format

Send to:

Choose Destination
    Nucleic Acids Res. 2005 Sep 2;33(15):4899-913. Print 2005.

    Limitations and potentials of current motif discovery algorithms.

    Source

    Department of Biological Sciences, College of Science, Purdue University, West Lafayette, IN 47907, USA.

    Abstract

    Computational methods for de novo identification of gene regulation elements, such as transcription factor binding sites, have proved to be useful for deciphering genetic regulatory networks. However, despite the availability of a large number of algorithms, their strengths and weaknesses are not sufficiently understood. Here, we designed a comprehensive set of performance measures and benchmarked five modern sequence-based motif discovery algorithms using large datasets generated from Escherichia coli RegulonDB. Factors that affect the prediction accuracy, scalability and reliability are characterized. It is revealed that the nucleotide and the binding site level accuracy are very low, while the motif level accuracy is relatively high, which indicates that the algorithms can usually capture at least one correct motif in an input sequence. To exploit diverse predictions from multiple runs of one or more algorithms, a consensus ensemble algorithm has been developed, which achieved 6-45% improvement over the base algorithms by increasing both the sensitivity and specificity. Our study illustrates limitations and potentials of existing sequence-based motif discovery algorithms. Taking advantage of the revealed potentials, several promising directions for further improvements are discussed. Since the sequence-based algorithms are the baseline of most of the modern motif discovery algorithms, this paper suggests substantial improvements would be possible for them.

    PMID:
    16284194
    [PubMed - indexed for MEDLINE]
    PMCID:
    PMC1199555
    Free PMC Article

    Images from this publication.See all images (11) Free text

    Figure 2
    Figure 4
    Figure 6
    Figure 8
    Figure 10
    Figure 1
    Figure 3
    Figure 5
    Figure 7
    Figure 9
    Figure 11

      Supplemental Content

      Icon for HighWire Press Icon for PubMed Central

      Save items

      loading

      Recent activity

      Your browsing activity is empty.

      Activity recording is turned off.

      Turn recording back on

      See more...
      Write to the Help Desk