Trimitomics: An efficient pipeline for mitochondrial assembly from transcriptomic reads in nonmodel species

Mol Ecol Resour. 2019 Sep;19(5):1230-1239. doi: 10.1111/1755-0998.13033. Epub 2019 Jun 12.

Abstract

Mitochondrial resources are of known utility to many fields of phylogenetic, population and molecular biology. Their combination of faster and slower-evolving regions and high copy number enables them to be used in many situations where other loci are unsuitable, with degraded samples and after recent speciation events.The advent of next-generation sequencing technologies (and notably the Illumina platform) has led to an explosion in the number of samples that can be studied at transcriptomic level, at relatively low cost. Here we describe a robust pipeline for the recovery of mitochondrial genomes from these RNA-sequencing resources. This pipeline can be used on sequencing of a variety of depths, and reliably recovers the protein coding and ribosomal gene complements of mitochondria from almost any transcriptomic sequencing experiment. The complete sequence of the mitochondrial genome can also be recovered when sequencing is performed in sufficient depth. We show the efficacy of our pipeline using data from eight nonmodel invertebrates of six disparate phyla. Interestingly, among our poriferan data, where microbiological symbionts are known empirically to make mitochondrial assembly difficult, this pipeline proved especially useful. Our pipeline will allow the recovery of mitochondrial data from a variety of previously sequenced samples, and add an additional angle of enquiry to future RNA-sequencing efforts, simplifying the process of mitochondrial genome assembly for even the most recalcitrant clades and adding these data to the scientific record for a range of future uses.

Keywords: assembly; invertebrates; mitochondrial genome; transcriptomics.

MeSH terms

  • Animals
  • Gene Expression Profiling / methods*
  • Genome, Mitochondrial*
  • Genomics / methods*
  • Invertebrates / classification
  • Invertebrates / genetics
  • Sequence Analysis, RNA / methods

Associated data

  • GENBANK/SRP157324
  • GENBANK/SRP150632
  • GENBANK/MH768970
  • GENBANK/MH768972