Format

Send to

Choose Destination
Bioinformatics. 2010 Sep 1;26(17):2183-9. doi: 10.1093/bioinformatics/btq354. Epub 2010 Jul 13.

Logic Forest: an ensemble classifier for discovering logical combinations of binary markers.

Author information

1
Division of Biostatistics and Epidemiology, Medical University of South Carolina, 135 Cannon St., Charleston, SC, USA. wolfb@musc.edu

Abstract

MOTIVATION:

Highly sensitive and specific screening tools may reduce disease -related mortality by enabling physicians to diagnose diseases in asymptomatic patients or at-risk individuals. Diagnostic tests based on multiple biomarkers may achieve the needed sensitivity and specificity to realize this clinical gain.

RESULTS:

Logic regression, a multivariable regression method predicting an outcome using logical combinations of binary predictors, yields interpretable models of the complex interactions in biologic systems. However, its performance degrades in noisy data. We extend logic regression for classification to an ensemble of logic trees (Logic Forest, LF). We conduct simulation studies comparing the ability of logic regression and LF to identify variable interactions predictive of disease status. Our findings indicate LF is superior to logic regression for identifying important predictors. We apply our method to single nucleotide polymorphism data to determine associations of genetic and health factors with periodontal disease.

AVAILABILITY:

LF code is publicly available on CRAN, http://cran.r-project.org/.

PMID:
20628070
PMCID:
PMC3025651
DOI:
10.1093/bioinformatics/btq354
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Silverchair Information Systems Icon for PubMed Central
Loading ...
Support Center