Genetic factors underlying discordance in chromatin accessibility between monozygotic twins

Genome Biol. 2014 May 29;15(5):R72. doi: 10.1186/gb-2014-15-5-r72.

Abstract

Background: Open chromatin is implicated in regulatory processes; thus, variations in chromatin structure may contribute to variations in gene expression and other phenotypes. In this work, we perform targeted deep sequencing for open chromatin, and array-based genotyping across the genomes of 72 monozygotic twins to identify genetic factors regulating co-twin discordance in chromatin accessibility.

Results: We show that somatic mutations cause chromatin discordance mainly via the disruption of transcription factor binding sites. Structural changes in DNA due to C:G to A:T transversions are under purifying selection due to a strong impact on chromatin accessibility. We show that CpGs whose methylation is specifically regulated during cellular differentiation appear to be protected from high mutation rates of 5'-methylcytosines, suggesting that the spectrum of CpG variations may be shaped fully at the developmental level but not through natural selection. Based on the association mapping of within-pair chromatin differences, we search for cases in which twin siblings with a particular genotype had chromatin discordance at the relevant locus. We identify 1,325 chromatin sites that are differentially accessible, depending on the genotype of a nearby locus, suggesting that epigenetic differences can control regulatory variations via interactions with genetic factors. Poised promoters present high levels of chromatin discordance in association with either somatic mutations or genetic-epigenetic interactions.

Conclusion: Our observations illustrate how somatic mutations and genetic polymorphisms may contribute to regulatory, and ultimately phenotypic, discordance.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Base Sequence
  • Binding Sites
  • Chromatin / chemistry
  • Chromatin / genetics
  • Chromatin / metabolism*
  • Computational Biology / methods
  • DNA Methylation
  • Epigenesis, Genetic
  • Gene Expression Profiling
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Middle Aged
  • Molecular Sequence Data
  • Mutation
  • Oligonucleotide Array Sequence Analysis / methods*
  • Polymorphism, Single Nucleotide
  • Quantitative Trait Loci
  • Transcription Factors / metabolism*
  • Twins, Monozygotic / genetics*

Substances

  • Chromatin
  • Transcription Factors

Associated data

  • GEO/GSE44742
  • GEO/GSE53822