Transcriptome assembly, gene annotation and tissue gene expression atlas of the rainbow trout

PLoS One. 2015 Mar 20;10(3):e0121778. doi: 10.1371/journal.pone.0121778. eCollection 2015.

Abstract

Efforts to obtain a comprehensive genome sequence for rainbow trout are ongoing and will be complemented by transcriptome information that will enhance genome assembly and annotation. Previously, transcriptome reference sequences were reported using data from different sources. Although the previous work added a great wealth of sequences, a complete and well-annotated transcriptome is still needed. In addition, gene expression in different tissues was not completely addressed in the previous studies. In this study, non-normalized cDNA libraries were sequenced from 13 different tissues of a single doubled haploid rainbow trout from the same source used for the rainbow trout genome sequence. A total of ~1.167 billion paired-end reads were de novo assembled using the Trinity RNA-Seq assembler yielding 474,524 contigs > 500 base-pairs. Of them, 287,593 had homologies to the NCBI non-redundant protein database. The longest contig of each cluster was selected as a reference, yielding 44,990 representative contigs. A total of 4,146 contigs (9.2%), including 710 full-length sequences, did not match any mRNA sequences in the current rainbow trout genome reference. Mapping reads to the reference genome identified an additional 11,843 transcripts not annotated in the genome. A digital gene expression atlas revealed 7,678 housekeeping and 4,021 tissue-specific genes. Expression of about 16,000-32,000 genes (35-71% of the identified genes) accounted for basic and specialized functions of each tissue. White muscle and stomach had the least complex transcriptomes, with high percentages of their total mRNA contributed by a small number of genes. Brain, testis and intestine, in contrast, had complex transcriptomes, with a large numbers of genes involved in their expression patterns. This study provides comprehensive de novo transcriptome information that is suitable for functional and comparative genomics studies in rainbow trout, including annotation of the genome.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Alternative Splicing / genetics
  • Animals
  • Base Sequence
  • Cichlids / genetics
  • Contig Mapping
  • DNA, Complementary / genetics
  • Gene Expression Profiling*
  • Gene Library
  • Gene Ontology
  • Genes, Essential
  • Genome
  • High-Throughput Nucleotide Sequencing
  • Molecular Sequence Annotation*
  • Oncorhynchus mykiss / classification
  • Oncorhynchus mykiss / genetics*
  • Organ Specificity / genetics*
  • RNA, Messenger / genetics
  • RNA, Messenger / metabolism
  • Transcriptome / genetics*

Substances

  • DNA, Complementary
  • RNA, Messenger

Grants and funding

This study was supported by a cooperative agreement grant No. 58-1930-0-059 from the United States Department of Agriculture, Agriculture and Food Research (JY); and a competitive grant No. 2014-67015-21602 from the United States Department of Agriculture, National Institute of Food and Agriculture (MS). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.