Format

Send to

Choose Destination
Artif Intell Med. 2009 Jun;46(2):139-54. doi: 10.1016/j.artmed.2008.12.003. Epub 2009 Jan 20.

An application of methods for the probabilistic three-class classification of pregnancies of unknown location.

Author information

1
Department of Electrical Engineering (ESAT-SISTA), Katholieke Universiteit Leuven, Kasteelpark Arenberg 10, B-3001 Leuven, Belgium. bvancals@esat.kuleuven.be

Abstract

OBJECTIVE:

We developed and compared classifiers to predict the outcome of pregnancies of unknown location (PUL). This is a three-class problem, as the possible outcomes are failing PUL, intra-uterine pregnancy (IUP), or ectopic pregnancy (EP). We focused on probabilistic classification because of the importance of uncertainty information in medical decision making.

METHODS AND MATERIALS:

Nine methods were implemented, based on logistic regression (LR), multi-layer perceptrons, least squares support vector machines (LS-SVMs), and kernel logistic regression (KLR). The LR models involved manual checks for the necessity of input transformations or interaction effects. The classifiers were developed on the training set (n=508) and evaluated on the test set (n=348). We used two performance measures that only evaluate discriminatory potential, and two that investigate the exact probabilities and/or discriminatory potential. Classifier comparison was carried out using a ranking method.

RESULTS:

The classifier based on a combination of binary LR models using pairwise coupling (LR-PC) ranked first or second for each performance measure. LR-PC obtained test set areas under the receiver operating characteristic curve of 0.989, 0.988, and 0.924 for the prediction of failing PULs, IUPs, and EPs, respectively. Multi-class LR, multi-class KLR, and a combination of binary Bayesian LS-SVMs with radial basis function kernel were always ranked in the top five. Models with low emphasis on nonlinearity were ranked at the bottom. Importantly, LR-PC also performed better than previously developed classifiers based on multi-class LR.

CONCLUSIONS:

The prediction of PULs was good, most notably for failing PULs and IUPs. Logistic regression models performed remarkably well. Multi-class KLR also performed well, whilst nonlinearity was automatically incorporated and probabilistic outputs were directly given without Bayesian analysis or a combination of binary results. The selected inputs are reasonably objective and easy to obtain in clinical practice. Taken together, this study provided useful decision support tools for implementation in clinical practice.

PMID:
19157812
DOI:
10.1016/j.artmed.2008.12.003
[Indexed for MEDLINE]

Supplemental Content

Full text links

Icon for Elsevier Science
Loading ...
Support Center