Format

Send to

Choose Destination
Bioinformatics. 2012 Apr 1;28(7):929-37. doi: 10.1093/bioinformatics/bts065. Epub 2012 Feb 13.

Positional correlation analysis improves reconstruction of full-length transcripts and alternative isoforms from noisy array signals or short reads.

Author information

1
Bioinformatics and Systems Engineering division, RIKEN Yokohama Institute, Tsurumi, Yokohama, Kanagawa 230-0045, Japan.

Abstract

MOTIVATION:

A reconstruction of full-length transcripts observed by next-generation sequencer or tiling arrays is an essential technique to know all phenomena of transcriptomes. Several techniques of the reconstruction have been developed. However, problems of high-level noises and biases still remain and interrupt the reconstruction. A method is required that is robust against noise and bias and correctly reconstructs transcripts regardless of equipment used.

RESULTS:

We propose a completely new statistical method that reconstructs full-length transcripts and can be applied on both next-generation sequencers and tiling arrays. The method called ARTADE2 analyzes 'positional correlation', meaning correlations of expression values for every combination on genomic positions of multiple transcriptional data. ARTADE2 then reconstructs full-length transcripts using a logistic model based on the positional correlation and the Markov model. ARTADE2 elucidated 17 591 full-length transcripts from 55 transcriptome datasets and showed notable performance compared with other recent prediction methods. Moreover, 1489 novel transcripts were discovered. We experimentally tested 16 novel transcripts, among which 14 were confirmed by reverse transcription-polymerase chain reaction and sequence mapping. The method also showed notable performance for reconstructing of mRNA observed by a next-generation sequencer. Moreover, the positional correlation and factor analysis embedded in ARTADE2 successfully detected regions at which alternative isoforms may exist, and thus are expected to be applied for discovering transcript biomarkers for a wide range of disciplines including preemptive medicine.

AVAILABILITY:

http://matome.base.riken.jp

CONTACT:

toyoda@base.riken.jp

SUPPLEMENTARY INFORMATION:

Supplementary data are available at Bioinformatics online.

PMID:
22332235
PMCID:
PMC3315713
DOI:
10.1093/bioinformatics/bts065
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Silverchair Information Systems Icon for PubMed Central
Loading ...
Support Center