Format

Send to

Choose Destination
Stat Appl Genet Mol Biol. 2017 Nov 27;16(5-6):333-347. doi: 10.1515/sagmb-2016-0062.

A smoothed EM-algorithm for DNA methylation profiles from sequencing-based methods in cell lines or for a single cell type.

Abstract

We consider the assessment of DNA methylation profiles for sequencing-derived data from a single cell type or from cell lines. We derive a kernel smoothed EM-algorithm, capable of analyzing an entire chromosome at once, and to simultaneously correct for experimental errors arising from either the pre-treatment steps or from the sequencing stage and to take into account spatial correlations between DNA methylation profiles at neighbouring CpG sites. The outcomes of our algorithm are then used to (i) call the true methylation status at each CpG site, (ii) provide accurate smoothed estimates of DNA methylation levels, and (iii) detect differentially methylated regions. Simulations show that the proposed methodology outperforms existing analysis methods that either ignore the correlation between DNA methylation profiles at neighbouring CpG sites or do not correct for errors. The use of the proposed inference procedure is illustrated through the analysis of a publicly available data set from a cell line of induced pluripotent H9 human embryonic stem cells and also a data set where methylation measures were obtained for a small genomic region in three different immune cell types separated from whole blood.

KEYWORDS:

DNA methylation; EM-algorithm; Kernel smoothing; MethylC-Seq; RRBS

PMID:
29055941
DOI:
10.1515/sagmb-2016-0062
[Indexed for MEDLINE]

Supplemental Content

Full text links

Icon for Sheridan PubFactory
Loading ...
Support Center