Format

Send to

Choose Destination
Bioinformatics. 2013 Dec 1;29(23):3036-44. doi: 10.1093/bioinformatics/btt529. Epub 2013 Sep 12.

Ontology-aware classification of tissue and cell-type signals in gene expression profiles across platforms and technologies.

Author information

1
Department of Computer Science, Princeton University, Princeton, NJ 08544, USA and Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08540, USA.

Abstract

MOTIVATION:

Leveraging gene expression data through large-scale integrative analyses for multicellular organisms is challenging because most samples are not fully annotated to their tissue/cell-type of origin. A computational method to classify samples using their entire gene expression profiles is needed. Such a method must be applicable across thousands of independent studies, hundreds of gene expression technologies and hundreds of diverse human tissues and cell-types.

RESULTS:

We present Unveiling RNA Sample Annotation (URSA) that leverages the complex tissue/cell-type relationships and simultaneously estimates the probabilities associated with hundreds of tissues/cell-types for any given gene expression profile. URSA provides accurate and intuitive probability values for expression profiles across independent studies and outperforms other methods, irrespective of data preprocessing techniques. Moreover, without re-training, URSA can be used to classify samples from diverse microarray platforms and even from next-generation sequencing technology. Finally, we provide a molecular interpretation for the tissue and cell-type models as the biological basis for URSA's classifications.

PMID:
24037214
PMCID:
PMC3834796
DOI:
10.1093/bioinformatics/btt529
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Silverchair Information Systems Icon for PubMed Central
Loading ...
Support Center