Sorting the wheat from the chaff: identifying miRNAs in genomic survey sequences of Triticum aestivum chromosome 1AL

PLoS One. 2012;7(7):e40859. doi: 10.1371/journal.pone.0040859. Epub 2012 Jul 17.

Abstract

Individual chromosome-based studies of bread wheat are beginning to provide valuable structural and functional information about one of the world's most important crops. As new genome sequences become available, identifying miRNA coding sequences is arguably as important a task as annotating protein coding sequences, but one that is not as well developed. We compared conservation-based identification of conserved miRNAs in 1.5× coverage survey sequences of wheat chromosome 1AL with a predictive method based on pre-miRNA hairpin structure alone. In total, 42 sequences expected to encode conserved miRNAs were identified on chromosome 1AL, including members of several miRNA families that have not previously been reported to be expressed in T. aestivum. In addition, we demonstrate that a number of sequences previously annotated as novel wheat miRNAs are closely related to transposable elements, particularly Miniature Inverted Terminal repeat Elements (MITEs). Some of these TE-miRNAs may well have a functional role, but separating true miRNA coding sequences from TEs in genomic sequences is far from straightforward. We propose a strategy for annotation to minimize the risk of mis-identifying TE sequences as miRNAs.

MeSH terms

  • Base Sequence
  • Chromosomes, Artificial, Bacterial / genetics
  • Chromosomes, Plant / genetics*
  • Conserved Sequence / genetics
  • DNA Transposable Elements
  • Data Collection
  • Gene Expression Profiling
  • Gene Expression Regulation, Plant
  • Genetic Association Studies
  • Genome, Plant / genetics*
  • Inverted Repeat Sequences / genetics
  • MicroRNAs / chemistry
  • MicroRNAs / genetics*
  • Molecular Sequence Data
  • Nucleic Acid Conformation
  • RNA, Plant / genetics
  • Triticum / genetics*

Substances

  • DNA Transposable Elements
  • MicroRNAs
  • RNA, Plant