NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM3402940 Query DataSets for GSM3402940
Status Public on Nov 26, 2018
Title control_3_blood (RRBS)
Sample type SRA
 
Source name peripheral blood leukocytes
Organism Homo sapiens
Characteristics disease state: healthy control
cell type: peripheral blood leukocytes
genotype: control
Treatment protocol N/A
Growth protocol N/A
Extracted molecule genomic DNA
Extraction protocol Genomic DNA isolated with DNeasy Blood & Tissue Kit (QIAGEN) or Nucleon BACC2 Genomic DNA Extraction Kit (illustra) and quantified by Qubit (Invitrogen). DNA from mouse cortex samples were concentrated using Agencourt AMPure XP technology.
200ng of purified DNA samples were processed using the Ovation RRBS Methyl-Seq system kit according to instructions with the following modifications (NuGen Technologies). Briefly, unmethylated phage λ DNA 0.5ng (NEB) was spiked into each sample to allow assessment of bisulfite conversion efficiency. The methylation-insensitive restriction enzyme MspI was then used to digest the gDNA, and digested fragments were ligated to adapters. Adapter ligated fragments were then repaired before bisulfite conversion with the Qiagen Epitect fast bisulfite conversion kit. Bisulfite-treated adapter-ligated fragments were amplified by 9 cycles of PCR and purified using Agencourt RNAClean XP beads. Libraries were quantified using the Qubit dsDNA HS assay and assessed for size and quality using the Agilent Bioanalyzer DNA HS kit. Sequencing was performed using the NextSeq 500/550 high-output version 2 kit (75bp paired end reads) on the Illumina NextSeq 550 platform. As instructed for the NuGen RRBS kit, 12bp index reads were generated to sequence the Unique Molecular Identifiers (UMI) in addition to the index present in the adaptors. Libraries were combined into equimolar pools and run on a single flow cell. 10% PhiX control library was spiked in to facilitate sequencing by generating additional sequence diversity. Library preparation and sequencing performed by WTCRF, Edinburgh.
 
Library strategy Bisulfite-Seq
Library source genomic
Library selection Reduced Representation
Instrument model NextSeq 550
 
Description C3
Data processing Raw Illumina sequencing output from the NextSeq were converted to paired FASTQ files without demultiplexing using ‘bcl2fastq’ and default settings (v2.17.1.14). These FASTQ files were then demultiplexed using custom python scripts considering indexes with perfect matches to the sample indexes. The different lanes for each sample were then combined.
Sequencing quality was assessed with FASTQC (v0.11.4). Low quality reads and remaining adaptors were removed using TrimGalore (v0.4.1, Settings: --adapter AGATCGGAAGAGC --adapter2 AAATCAAAAAAAC). NuGen adaptors contain extra diversity bases to facilitate sequencing. These were removed using the ‘trimRRBSdiversityAdaptCustomers.py’ Python script provided by NuGen (v1.11). The paired end reads were then aligned to the hg19 genome using Bismark (v 0.16.3 with Bowtie2 v2.2.6 with settings: -N 0 -L 20) before PCR duplicates were identified and removed using the 6bp UMIs present in the index reads and the ‘nudup.py’ Python script supplied by NuGen (v2.3). Aligned BAM files were processed to report coverage and number of methylated reads for each CpG observed. Forward and reverse strands were combined using Bismark’s methylation extractor and bismark2bedgraph modules with custom Python and AWK scripts.
Processed RRBS files were assessed for conversion efficiency based on the proportion of methylated reads mapping to the λ genome spike-in (>99.5% in all cases).
BigWigs were generated from RRBS data using CpGs with coverage ≥5
Genome_build: hg19
Supplementary_files_format_and_content: Processed coverage values for each CpG observed in sample. CpG position and total coverage (T) and methylated coverage (M) combining both strands. Columns = chr, start, end, T, M. Positions are 0-based, ie BED format.
Supplementary_files_format_and_content: bigWig files for each sample also supplied. Only contain CpGs with coverage >= 5 in that sample.
Raw sequencing reads are available through EGA (protected patient data): EGAS00001003232
 
Submission date Sep 27, 2018
Last update date Nov 26, 2018
Contact name Duncan Sproul
Organization name University of Edinburgh
Department MRC Human Genetics Unit
Street address Crewe Road South
City Edinburgh
State/province Mid Lothian
ZIP/Postal code EH4 2XU
Country United Kingdom
 
Platform ID GPL21697
Series (2)
GSE120554 Gain of function DNMT3A mutations cause microcephalic dwarfism and hypermethylation of Polycomb-regulated regions (human RRBS)
GSE120558 Gain of function DNMT3A mutations cause microcephalic dwarfism and hypermethylation of Polycomb-regulated regions
Relations
BioSample SAMN10136763

Supplementary file Size Download File type/resource
GSM3402940_C3_RRBS.bigWig 15.4 Mb (ftp)(http) BIGWIG
GSM3402940_C3_RRBS.cov.gz 11.0 Mb (ftp)(http) COV
Raw data not provided for this record
Processed data provided as supplementary file

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap