Send to

Choose Destination
Stat Anal Data Min. 2011 Jun 1;4(3):301-312.

Sequential Support Vector Regression with Embedded Entropy for SNP Selection and Disease Classification.

Author information

Department of Family and Community Health, University of Maryland, Baltimore 655 W. Lombard Street, Baltimore, MD 21201-1579.


Comprehensive evaluation of common genetic variations through association of SNP structure with common diseases on the genome-wide scale is currently a hot area in human genome research. For less costly and faster diagnostics, advanced computational approaches are needed to select the minimum SNPs with the highest prediction accuracy for common complex diseases. In this paper, we present a sequential support vector regression model with embedded entropy algorithm to deal with the redundancy for the selection of the SNPs that have best prediction performance of diseases. We implemented our proposed method for both SNP selection and disease classification, and applied it to simulation data sets and two real disease data sets. Results show that on the average, our proposed method outperforms the well known methods of Support Vector Machine Recursive Feature Elimination, logistic regression, CART, and logic regression based SNP selections for disease classification.

Supplemental Content

Full text links

Icon for PubMed Central
Loading ...
Support Center