Display Settings:

Format

Send to:

Choose Destination
See comment in PubMed Commons below
BMC Genomics. 2012 Aug 14;13:392. doi: 10.1186/1471-2164-13-392.

Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data.

Author information

  • 1Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, Institute of Crop Science, Chinese Academy of Agricultural Sciences, Zhongguancun, Beijing, People's Republic of China.

Abstract

BACKGROUND:

Rapid advances in next-generation sequencing methods have provided new opportunities for transcriptome sequencing (RNA-Seq). The unprecedented sequencing depth provided by RNA-Seq makes it a powerful and cost-efficient method for transcriptome study, and it has been widely used in model organisms and non-model organisms to identify and quantify RNA. For non-model organisms lacking well-defined genomes, de novo assembly is typically required for downstream RNA-Seq analyses, including SNP discovery and identification of genes differentially expressed by phenotypes. Although RNA-Seq has been successfully used to sequence many non-model organisms, the results of de novo assembly from short reads can still be improved by using recent bioinformatic developments.

RESULTS:

In this study, we used 212.6 million pair-end reads, which accounted for 16.2 Gb, to assemble the hexaploid wheat transcriptome. Two state-of-the-art assemblers, Trinity and Trans-ABySS, which use the single and multiple k-mer methods, respectively, were used, and the whole de novo assembly process was divided into the following four steps: pre-assembly, merging different samples, removal of redundancy and scaffolding. We documented every detail of these steps and how these steps influenced assembly performance to gain insight into transcriptome assembly from short reads. After optimization, the assembled transcripts were comparable to Sanger-derived ESTs in terms of both continuity and accuracy. We also provided considerable new wheat transcript data to the community.

CONCLUSIONS:

It is feasible to assemble the hexaploid wheat transcriptome from short reads. Special attention should be paid to dealing with multiple samples to balance the spectrum of expression levels and redundancy. To obtain an accurate overview of RNA profiling, removal of redundancy may be crucial in de novo assembly.

PMID:
22891638
[PubMed - indexed for MEDLINE]
PMCID:
PMC3485621
Free PMC Article

Images from this publication.See all images (4)Free text

Figure 1
Figure 2
Figure 3
Figure 4

Publication Types, MeSH Terms, Substances, Secondary Source ID

Publication Types

MeSH Terms

Substances

Secondary Source ID

PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Icon for BioMed Central Icon for PubMed Central
    Loading ...
    Write to the Help Desk