Display Settings:

Items per page
We are sorry, but NCBI web applications do not support your browser and may not function properly. More information

Results: 6

1.
Figure 6

Figure 6. Distribution of phyloP scores in lowest and highest peaks.. From: A ChIP-Seq Benchmark Shows That Sequence Conservation Mainly Improves Detection of Strong Transcription Factor Binding Sites.

Boxplot showing for each TF the averaged phyloP scores in promoter peak regions on lowest peaks ( percentile), and highest peaks ( percentile). The higher peaks generally show higher sequence conservation across genomes.

Tony Håndstad, et al. PLoS One. 2011;6(4):e18430.
2.
Figure 5

Figure 5. ROC scores for PWM and BBLS PWM on low and high peaks.. From: A ChIP-Seq Benchmark Shows That Sequence Conservation Mainly Improves Detection of Strong Transcription Factor Binding Sites.

ROC scores on each TF promoter dataset for PWM and BBLS PWM methods on the lowest peaks ( percentile), and highest peaks ( percentile). The difference between PWM and the conservation-based BBLS PWM method is generally greater, and in favor of BBLS PWM, on the higher peaks more than the lower peaks.

Tony Håndstad, et al. PLoS One. 2011;6(4):e18430.
3.
Figure 4

Figure 4. Max PWM score and phyloP values correlate with center of peak regions.. From: A ChIP-Seq Benchmark Shows That Sequence Conservation Mainly Improves Detection of Strong Transcription Factor Binding Sites.

The figures show a region of 500 bp surrounding each peak region. On the left is shown for each of the 500 positions the number of times that position has the maximum PWM score in the 500 bp region. On the right is the average phyloP score. The grey lines show the average peak width. Both max PWM score and higher phyloP values tend to be clustered in the center of the peak regions, but the clustering varies for each TF.

Tony Håndstad, et al. PLoS One. 2011;6(4):e18430.
4.
Figure 1

Figure 1. Defining positive and negative regions for the site benchmark.. From: A ChIP-Seq Benchmark Shows That Sequence Conservation Mainly Improves Detection of Strong Transcription Factor Binding Sites.

The maximum score in each region is used to calculate the ROC curve. In the site benchmark, the negative regions around a peak are further divided into smaller regions of length 200 bp (not shown). The promoter benchmark is based on the same principle as the site benchmark, but the test regions are then derived from regions surrounding gene transcription start sites and from first introns, and the negative regions are not further divided into smaller regions.

Tony Håndstad, et al. PLoS One. 2011;6(4):e18430.
5.
Figure 2

Figure 2. Cumulative ROC score on site and promoter benchmarks.. From: A ChIP-Seq Benchmark Shows That Sequence Conservation Mainly Improves Detection of Strong Transcription Factor Binding Sites.

The cumulative number of TF datasets for which a method has a ROC AUC of more than a given value on the A) site and B) promoter benchmark. Each line represents a method and shows for each point along the y-axis how many datasets that have at least the ROC score given on the x-axis. The ROC score, or area under the ROC curve (AUC), is a measure of accuracy that summarizes the true-positive and false-positive rate and the implied trade-offs at all score thresholds.

Tony Håndstad, et al. PLoS One. 2011;6(4):e18430.
6.
Figure 3

Figure 3. ROC score correlates with motif length and information content.. From: A ChIP-Seq Benchmark Shows That Sequence Conservation Mainly Improves Detection of Strong Transcription Factor Binding Sites.

A) ROC score for PWM scanning as a function of motif length. B) ROC score for PWM scanning as a function of motif information content. Longer, information-rich motif achieve better scores. Note that YY1 has the second longest motif (V$YY1_01), but this motif also has the second lowest information content, which likely explains its lower score compared to the most information rich motif (V$NRSF_Q4). C) ROC curves for all methods on the E2F4 dataset in the promoter benchmark. The V$E2F_Q2 motif is one of the least informative motifs and the performance of the prediction methods on the E2F4 dataset is relatively low. D) ROC curves for all methods on the NRSF dataset in the promoter benchmark. The V$NRSF_Q4 motif is the most informative motif and the NRSF dataset is among the highest scoring datasets.

Tony Håndstad, et al. PLoS One. 2011;6(4):e18430.

Display Settings:

Items per page

Supplemental Content

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...
Write to the Help Desk