Long-range genomic enrichment, sequencing, and assembly to determine unknown sequences flanking a known microRNA

PLoS One. 2013 Dec 20;8(12):e83721. doi: 10.1371/journal.pone.0083721. eCollection 2013.

Abstract

Conserved plant microRNAs (miRNAs) modulate important biological processes but little is known about conserved cis-regulatory elements (CREs) surrounding MIRNA genes. We developed a solution-based targeted genomic enrichment methodology to capture, enrich, and sequence flanking genomic regions surrounding conserved MIRNA genes with a locked-nucleic acid (LNA)-modified, biotinylated probe complementary to the mature miRNA sequence. Genomic DNA bound by the probe is captured by streptavidin-coated magnetic beads, amplified, sequenced and assembled de novo to obtain genomic DNA sequences flanking MIRNA locus of interest. We demonstrate the sensitivity and specificity of this enrichment methodology in Arabidopsis thaliana to enrich targeted regions spanning 10-20 kb surrounding known MIR166 and MIR165 loci. Assembly of the sequencing reads successfully recovered all targeted loci. While further optimization for larger, more complex genomes is needed, this method may enable determination of flanking genomic DNA sequence surrounding a known core (like a conserved mature miRNA) from multiple species that currently don't have a full genome assembly available.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Arabidopsis / genetics*
  • Base Sequence
  • Conserved Sequence
  • Genes, Plant / genetics
  • Genetic Engineering / methods*
  • Genetic Loci / genetics
  • Genome, Plant / genetics*
  • MicroRNAs / genetics*
  • Zea mays / genetics

Substances

  • MicroRNAs

Grants and funding

This study was funded by US National Science Foundation (NSF) award 0964859 to MJA. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.