Format

Send to:

Choose Destination
See comment in PubMed Commons below
Biometrics. 2008 Jun;64(2):440-8. Epub 2007 Oct 26.

Variable selection for model-based high-dimensional clustering and its application to microarray data.

Author information

  • 1Department of Biostatistics, University of Michigan, Ann Arbor, Michigan 48109, USA.

Abstract

Variable selection in high-dimensional clustering analysis is an important yet challenging problem. In this article, we propose two methods that simultaneously separate data points into similar clusters and select informative variables that contribute to the clustering. Our methods are in the framework of penalized model-based clustering. Unlike the classical L(1)-norm penalization, the penalty terms that we propose make use of the fact that parameters belonging to one variable should be treated as a natural "group." Numerical results indicate that the two new methods tend to remove noninformative variables more effectively and provide better clustering results than the L(1)-norm approach.

PMID:
17970821
[PubMed - indexed for MEDLINE]
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for Wiley
    Loading ...
    Write to the Help Desk