Genome binding/occupancy profiling by high throughput sequencing
Summary
We comprehensively benchmarked CUT&Tag for H3K27ac and H3K27me3 against published ChIP-seq profiles from ENCODE in K562 cells. Across a total of 30 new and 6 published CUT&Tag datasets we found that no experiment recovers more than 50% of known ENCODE peaks, regardless of the histone mark. We tested peak callers MACS2 and SEACR, identifying optimal peak calling parameters. Balancing both precision and recall of known ENCODE peaks, SEACR without retention of duplicates showed the best performance. We found that reducing PCR cycles during library preparation lowered duplication rates at the expense of ENCODE peak recovery. Despite the moderate ENCODE peak recovery, peaks identified by CUT&Tag represent the strongest ENCODE peaks and show the same functional and biological enrichments as ChIP-seq peaks identified by ENCODE. Our workflow systematically evaluates the merits of methodological adjustments and will facilitate future efforts to apply CUT&Tag in human tissues and single cells.
Overall design
30 CUT&Tag experiments testing H3K27ac and H3K27me3 antibodies at different concentrations, with and without addition of HDAC inhibitor (TSA), performing SDS- or column-based DNA extraction, and using different numbers of PCR cycles for sequencing library amplification.