Display Settings:

Format

Send to:

Choose Destination
See comment in PubMed Commons below
Genome Res. 2008 Feb;18(2):310-23. Epub 2007 Dec 20.

Generic eukaryotic core promoter prediction using structural features of DNA.

Author information

  • 1Department of Plant Systems Biology, Flanders Institute for Biotechnology (VIB), 9052 Gent, Belgium,

Abstract

Despite many recent efforts, in silico identification of promoter regions is still in its infancy. However, the accurate identification and delineation of promoter regions is important for several reasons, such as improving genome annotation and devising experiments to study and understand transcriptional regulation. Current methods to identify the core region of promoters require large amounts of high-quality training data and often behave like black box models that output predictions that are difficult to interpret. Here, we present a novel approach for predicting promoters in whole-genome sequences by using large-scale structural properties of DNA. Our technique requires no training, is applicable to many eukaryotic genomes, and performs extremely well in comparison with the best available promoter prediction programs. Moreover, it is fast, simple in design, and has no size constraints, and the results are easily interpretable. We compared our approach with 14 current state-of-the-art implementations using human gene and transcription start site data and analyzed the ENCODE region in more detail. We also validated our method on 12 additional eukaryotic genomes, including vertebrates, invertebrates, plants, fungi, and protists.

PMID:
18096745
[PubMed - indexed for MEDLINE]
PMCID:
PMC2203629
Free PMC Article

Images from this publication.See all images (5)Free text

Figure 1.
Figure 2.
Figure 3.
Figure 4.
Figure 5.
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for HighWire Icon for PubMed Central
    Loading ...
    Write to the Help Desk