Format

Send to

Choose Destination
Genome Biol. 2007;8(5):R68.

ngLOC: an n-gram-based Bayesian method for estimating the subcellular proteomes of eukaryotes.

Author information

1
Department of Computer Science, State University of New York at Albany, Washington Ave, Albany, New York 12222, USA. bking@cs.albany.edu

Abstract

We present a method called ngLOC, an n-gram-based Bayesian classifier that predicts the localization of a protein sequence over ten distinct subcellular organelles. A tenfold cross-validation result shows an accuracy of 89% for sequences localized to a single organelle, and 82% for those localized to multiple organelles. An enhanced version of ngLOC was developed to estimate the subcellular proteomes of eight eukaryotic organisms: yeast, nematode, fruitfly, mosquito, zebrafish, chicken, mouse, and human.

PMID:
17472741
PMCID:
PMC1929137
DOI:
10.1186/gb-2007-8-5-r68
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for BioMed Central Icon for PubMed Central
Loading ...
Support Center