Discovery and characterization of medaka miRNA genes by next generation sequencing platform

BMC Genomics. 2010 Dec 2;11 Suppl 4(Suppl 4):S8. doi: 10.1186/1471-2164-11-S4-S8.

Abstract

Background: MicroRNAs (miRNAs) are endogenous non-protein-coding RNA genes which exist in a wide variety of organisms, including animals, plants, virus and even unicellular organisms. Medaka (Oryzias latipes) is a useful model organism among vertebrate animals. However, no medaka miRNAs have been investigated systematically. It is beneficial to conduct a genome-wide miRNA discovery study using the next generation sequencing (NGS) technology, which has emerged as a powerful sequencing tool for high-throughput analysis.

Results: In this study, we adopted ABI SOLiD platform to generate small RNA sequence reads from medaka tissues, followed by mapping these sequence reads back to medaka genome. The mapped genomic loci were considered as candidate miRNAs and further processed by a support vector machine (SVM) classifier. As result, we identified 599 novel medaka pre-miRNAs, many of which were found to encode more than one isomiRs. Besides, additional minor miRNAs (also called miRNA star) can be also detected with the improvement of sequencing depth. These quantifiable isomiRs and minor miRNAs enable us to further characterize medaka miRNA genes in many aspects. First of all, many medaka candidate pre-miRNAs position close to each other, forming many miRNA clusters, some of which are also conserved across other vertebrate animals. Secondly, during miRNA maturation, there is an arm selection preference of mature miRNAs within precursors. We observed the differences on arm selection preference between our candidate pre-miRNAs and their orthologous ones. We classified these differences into three categories based on the distribution of NGS reads. Finally, we also investigated the relationship between conservation status and expression level of miRNA genes. We concluded that the evolutionally conserved miRNAs were usually the most abundant ones.

Conclusions: Medaka is a widely used model animal and usually involved in many biomedical studies, including the ones on development biology. Identifying and characterizing medaka miRNA genes would benefit the studies using medaka as a model organism.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Base Sequence / genetics
  • Evolution, Molecular
  • Female
  • Genome
  • Male
  • MicroRNAs / genetics*
  • MicroRNAs / metabolism
  • Models, Animal
  • Multigene Family
  • Oryzias / genetics*
  • Oryzias / metabolism
  • RNA / genetics
  • RNA / isolation & purification
  • RNA, Untranslated / genetics
  • Sequence Analysis, RNA / methods*
  • Software
  • Species Specificity

Substances

  • MicroRNAs
  • RNA, Untranslated
  • RNA