Format

Send to

Choose Destination
See comment in PubMed Commons below
Biostatistics. 2008 Jan;9(1):30-50. Epub 2007 Apr 11.

Penalized logistic regression for detecting gene interactions.

Author information

1
Google Inc., 1600 Amphitheatre Parkway, Mountain View, CA 94043, USA. meeyoung@google.com

Abstract

We propose using a variant of logistic regression (LR) with (L)_(2)-regularization to fit gene-gene and gene-environment interaction models. Studies have shown that many common diseases are influenced by interaction of certain genes. LR models with quadratic penalization not only correctly characterizes the influential genes along with their interaction structures but also yields additional benefits in handling high-dimensional, discrete factors with a binary response. We illustrate the advantages of using an (L)_(2)-regularization scheme and compare its performance with that of "multifactor dimensionality reduction" and "FlexTree," 2 recent tools for identifying gene-gene interactions. Through simulated and real data sets, we demonstrate that our method outperforms other methods in the identification of the interaction structures as well as prediction accuracy. In addition, we validate the significance of the factors selected through bootstrap analyses.

PMID:
17429103
DOI:
10.1093/biostatistics/kxm010
[Indexed for MEDLINE]
PubMed Commons home

PubMed Commons

0 comments

    Supplemental Content

    Full text links

    Icon for Silverchair Information Systems
    Loading ...
    Support Center