Format

Send to

Choose Destination
AMIA Annu Symp Proc. 2011;2011:625-34. Epub 2011 Oct 22.

Improving predictions in imbalanced data using Pairwise Expanded Logistic Regression.

Author information

1
Division of Biomedical Informatics, University of California, San Diego, La Jolla, CA, USA. x1jiang@ucsd.edu

Abstract

Building classifiers for medical problems often involves dealing with rare, but important events. Imbalanced datasets pose challenges to ordinary classification algorithms such as Logistic Regression (LR) and Support Vector Machines (SVM). The lack of effective strategies for dealing with imbalanced training data often results in models that exhibit poor discrimination. We propose a novel approach to estimate class memberships based on the evaluation of pairwise relationships in the training data. The method we propose, Pairwise Expanded Logistic Regression, improved discrimination and had higher accuracy when compared to existing methods in two imbalanced datasets, thus showing promise as a potential remedy for this problem.

PMID:
22195118
PMCID:
PMC3243279
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for PubMed Central
Loading ...
Support Center