Format

Send to:

Choose Destination
See comment in PubMed Commons below
Stat Methods Med Res. 2013 Oct;22(5):519-36. doi: 10.1177/0962280211428386. Epub 2011 Nov 28.

Finding consistent patterns: a nonparametric approach for identifying differential expression in RNA-Seq data.

Author information

  • 11Department of Statistics, Stanford University, Stanford, CA 94305, USA.

Abstract

We discuss the identification of features that are associated with an outcome in RNA-Sequencing (RNA-Seq) and other sequencing-based comparative genomic experiments. RNA-Seq data takes the form of counts, so models based on the normal distribution are generally unsuitable. The problem is especially challenging because different sequencing experiments may generate quite different total numbers of reads, or 'sequencing depths'. Existing methods for this problem are based on Poisson or negative binomial models: they are useful but can be heavily influenced by 'outliers' in the data. We introduce a simple, non-parametric method with resampling to account for the different sequencing depths. The new method is more robust than parametric methods. It can be applied to data with quantitative, survival, two-class or multiple-class outcomes. We compare our proposed method to Poisson and negative binomial-based methods in simulated and real data sets, and find that our method discovers more consistent patterns than competing methods.

KEYWORDS:

FDR; RNA-Seq; differential expression; nonparametric; resampling

PMID:
22127579
[PubMed - indexed for MEDLINE]
PMCID:
PMC4605138
Free PMC Article
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for HighWire Icon for PubMed Central
    Loading ...
    Write to the Help Desk