Send to

Choose Destination
Int J Bioinform Res Appl. 2009;5(4):417-31.

Comparison of feature selection and classification combinations for cancer classification using microarray data.

Author information

Department of Bioinformatics, Dr. D.Y. Patil Biotechnology and Bioinformatics Institute, Akurdi, Pune 411044, India.


High throughput gene expression data can be used to identify biomarker profiles for classification. The accuracy of microarray based sample classification depends on the algorithm employed for selecting the features (genes) used for classification, and the classification algorithm. We have evaluated the performance of over 2000 combinations of feature selection and classification algorithms in classifying cancer datasets. One of these combinations (SVM for ranking genes + SMO) shows excellent classification accuracy using a small number of genes across three cancer datasets tested. Notably, classification using 15 selected genes yields 96% accuracy for a dataset obtained on an independent microarray platform.

[Indexed for MEDLINE]

Supplemental Content

Full text links

Icon for Atypon
Loading ...
Support Center