Format

Send to

Choose Destination
BMC Genomics. 2016 May 26;17:403. doi: 10.1186/s12864-016-2745-8.

An optimized protocol for generation and analysis of Ion Proton sequencing reads for RNA-Seq.

Yuan Y1, Xu H2, Leung RK3,4,5.

Author information

1
BGI-tech, BGI-Shenzhen, Shenzhen, 518083, Guangdong, China.
2
BGI-tech, BGI-Wuhan, Wuhan, 430075, Hubei, China.
3
BGI-tech, BGI-Shenzhen, Shenzhen, 518083, Guangdong, China. yssun@hku.hk.
4
School of Public Health, The University of Hong Kong, Hong Kong, China. yssun@hku.hk.
5
Stanley Ho Centre for Emerging Infectious Diseases, The Chinese University of Hong Kong, Hong Kong, China. yssun@hku.hk.

Abstract

BACKGROUND:

Previous studies compared running cost, time and other performance measures of popular sequencing platforms. However, comprehensive assessment of library construction and analysis protocols for Proton sequencing platform remains unexplored. Unlike Illumina sequencing platforms, Proton reads are heterogeneous in length and quality. When sequencing data from different platforms are combined, this can result in reads with various read length. Whether the performance of the commonly used software for handling such kind of data is satisfactory is unknown.

RESULTS:

By using universal human reference RNA as the initial material, RNaseIII and chemical fragmentation methods in library construction showed similar result in gene and junction discovery number and expression level estimated accuracy. In contrast, sequencing quality, read length and the choice of software affected mapping rate to a much larger extent. Unspliced aligner TMAP attained the highest mapping rate (97.27 % to genome, 86.46 % to transcriptome), though 47.83 % of mapped reads were clipped. Long reads could paradoxically reduce mapping in junctions. With reference annotation guide, the mapping rate of TopHat2 significantly increased from 75.79 to 92.09 %, especially for long (>150 bp) reads. Sailfish, a k-mer based gene expression quantifier attained highly consistent results with that of TaqMan array and highest sensitivity.

CONCLUSION:

We provided for the first time, the reference statistics of library preparation methods, gene detection and quantification and junction discovery for RNA-Seq by the Ion Proton platform. Chemical fragmentation performed equally well with the enzyme-based one. The optimal Ion Proton sequencing options and analysis software have been evaluated.

KEYWORDS:

Ion Proton; RNA-Seq; Sequencing length; Sequencing quality; Transcriptome

PMID:
27229683
PMCID:
PMC4880854
DOI:
10.1186/s12864-016-2745-8
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for BioMed Central Icon for PubMed Central
Loading ...
Support Center