Display Settings:

Format

Send to:

Choose Destination
See comment in PubMed Commons below
Curr Genomics. 2009 Nov;10(7):446-62. doi: 10.2174/138920209789208228.

Classification and error estimation for discrete data.

Author information

  • Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX 77845, USA.

Abstract

Discrete classification is common in Genomic Signal Processing applications, in particular in classification of discretized gene expression data, and in discrete gene expression prediction and the inference of boolean genomic regulatory networks. Once a discrete classifier is obtained from sample data, its performance must be evaluated through its classification error. In practice, error estimation methods must then be employed to obtain reliable estimates of the classification error based on the available data. Both classifier design and error estimation are complicated, in the case of Genomics, by the prevalence of small-sample data sets in such applications. This paper presents a broad review of the methodology of classification and error estimation for discrete data, in the context of Genomics, focusing on the study of performance in small sample scenarios, as well as asymptotic behavior.

KEYWORDS:

Genomics; classification; coefficient of determination.; discrete histogram rule; ensemble methods; error estimation; leave-one-out; resubstitution; sampling distribution

PMID:
20436873
[PubMed]
PMCID:
PMC2808673
Free PMC Article
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for PubMed Central
    Loading ...
    Write to the Help Desk