Format

Send to:

Choose Destination
See comment in PubMed Commons below
Australas Med J. 2013 May 30;6(5):272-9. doi: 10.4066/AMJ.2013.1641. Print 2013.

Incorporating feature ranking and evolutionary methods for the classification of high-dimensional DNA microarray gene expression data.

Author information

  • 1Department of Computing and Information Systems, The University of Melbourne, Victoria 3010, Australia ; IBM Research Australia, Carlton, Victoria 3053, Australia.

Abstract

BACKGROUND:

DNA microarray gene expression classification poses a challenging task to the machine learning domain. Typically, the dimensionality of gene expression data sets could go from several thousands to over 10,000 genes. A potential solution to this issue is using feature selection to reduce the dimensionality.

AIMS:

The aim of this paper is to investigate how we can use feature quality information to improve the precision of microarray gene expression classification tasks.

METHOD:

We propose two evolutionary machine learning models based on the eXtended Classifier System (XCS) and a typical feature selection methodology. The first one, which we call FS-XCS, uses feature selection for feature reduction purposes. The second model is GRD-XCS, which uses feature ranking to bias the rule discovery process of XCS.

RESULTS:

The results indicate that the use of feature selection/ranking methods is essential for tackling highdimensional classification tasks, such as microarray gene expression classification. However, the results also suggest that using feature ranking to bias the rule discovery process performs significantly better than using the feature reduction method. In other words, using feature quality information to develop a smarter learning procedure is more efficient than reducing the feature set.

CONCLUSION:

Our findings have shown that extracting feature quality information can assist the learning process and improve classification accuracy. On the other hand, relying exclusively on the feature quality information might potentially decrease the classification performance (e.g., using feature reduction). Therefore, we recommend a hybrid approach that uses feature quality information to direct the learning process by highlighting the more informative features, but at the same time not restricting the learning process to explore other features.

KEYWORDS:

Classification; GRD-XCS; XCS; eXtended Classifier System; evolutionary algorithms; feature ranking; guided rule discovery XCS; high-dimensional data; microarray gene expression profiling

PMID:
23745148
[PubMed]
PMCID:
PMC3674418
Free PMC Article
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for PubMed Central
    Loading ...
    Write to the Help Desk