Format

Send to

Choose Destination
Genome Inform. 2010 Jan;22:30-40.

Active pathway identification and classification with probabilistic ensembles.

Author information

1
Bioinformatics Center, Institute for Chemical Research, Kyoto University, Kyoto, Japan. timhancock@kuicr.kyoto-u.ac.jp

Abstract

A popular means of modeling metabolic networks is through identifying frequently observed pathways. However the definition of what constitutes an observation of a pathway and how to evaluate the importance of identified pathways remains unclear. In this paper we investigate different methods for defining an observed pathway and evaluate their performance with pathway classification models. We use three methods for defining an observed pathway; a path in gene over-expression, a path in probable gene over-expression and a path of most accurate classification. The performance of each definition is evaluated with three classification models; a probabilistic pathway classifier - HME3M, logistic regression and SVM. The results show that defining pathways using the probability of gene over-expression creates stable and accurate classifiers. Conversely we also show defining pathways of most accurate classification finds a severely biased pathways that are unrepresentative of underlying microarray data structure.

PMID:
20238417
[Indexed for MEDLINE]

Supplemental Content

Loading ...
Support Center