Format

Send to

Choose Destination
See comment in PubMed Commons below
AMIA Annu Symp Proc. 2011;2011:625-34. Epub 2011 Oct 22.

Improving predictions in imbalanced data using Pairwise Expanded Logistic Regression.

Author information

1
Division of Biomedical Informatics, University of California, San Diego, La Jolla, CA, USA. x1jiang@ucsd.edu

Abstract

Building classifiers for medical problems often involves dealing with rare, but important events. Imbalanced datasets pose challenges to ordinary classification algorithms such as Logistic Regression (LR) and Support Vector Machines (SVM). The lack of effective strategies for dealing with imbalanced training data often results in models that exhibit poor discrimination. We propose a novel approach to estimate class memberships based on the evaluation of pairwise relationships in the training data. The method we propose, Pairwise Expanded Logistic Regression, improved discrimination and had higher accuracy when compared to existing methods in two imbalanced datasets, thus showing promise as a potential remedy for this problem.

PMID:
22195118
PMCID:
PMC3243279
[Indexed for MEDLINE]
Free PMC Article
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for PubMed Central
    Loading ...
    Support Center