Send to

Choose Destination
Genet Epidemiol. 2001;21 Suppl 1:S626-31.

Sequence analysis using logic regression.

Author information

Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue N, MP-1002, Seattle, WA 98109-1024, USA.


Logic Regression is a new adaptive regression methodology that attempts to construct predictors as Boolean combinations of (binary) covariates. In this paper we use this algorithm to deal with single-nucleotide polymorphism (SNP) sequence data. The predictors that are found are interpretable as risk factors of the disease. Significance of these risk factors is assessed using techniques like cross-validation, permutation tests, and independent test sets. These model selection techniques remain valid when data is dependent, as is the case for the family data used here. In our analysis of the Genetic Analysis Workshop 12 data we identify the exact locations of mutations on gene 1 and gene 6 and a number of mutations on gene 2 that are associated with the affected status, without selecting any false positives.

[Indexed for MEDLINE]

Supplemental Content

Loading ...
Support Center