Display Settings:

Format

Send to:

Choose Destination
    BMC Med Res Methodol. 2010 Dec 31;10:112.

    Comparison of imputation methods for handling missing covariate data when fitting a Cox proportional hazards model: a resampling study.

    Source

    Centre for Statistics in Medicine, University of Oxford, Oxford, UK. andrea.marshall@warwick.ac.uk

    Abstract

    BACKGROUND:

    The appropriate handling of missing covariate data in prognostic modelling studies is yet to be conclusively determined. A resampling study was performed to investigate the effects of different missing data methods on the performance of a prognostic model.

    METHODS:

    Observed data for 1000 cases were sampled with replacement from a large complete dataset of 7507 patients to obtain 500 replications. Five levels of missingness (ranging from 5% to 75%) were imposed on three covariates using a missing at random (MAR) mechanism. Five missing data methods were applied; a) complete case analysis (CC) b) single imputation using regression switching with predictive mean matching (SI), c) multiple imputation using regression switching imputation, d) multiple imputation using regression switching with predictive mean matching (MICE-PMM) and e) multiple imputation using flexible additive imputation models. A Cox proportional hazards model was fitted to each dataset and estimates for the regression coefficients and model performance measures obtained.

    RESULTS:

    CC produced biased regression coefficient estimates and inflated standard errors (SEs) with 25% or more missingness. The underestimated SE after SI resulted in poor coverage with 25% or more missingness. Of the MI approaches investigated, MI using MICE-PMM produced the least biased estimates and better model performance measures. However, this MI approach still produced biased regression coefficient estimates with 75% missingness.

    CONCLUSIONS:

    Very few differences were seen between the results from all missing data approaches with 5% missingness. However, performing MI using MICE-PMM may be the preferred missing data approach for handling between 10% and 50% MAR missingness.

    PMID:
    21194416
    [PubMed - indexed for MEDLINE]
    PMCID:
    PMC3019210
    Free PMC Article

    Images from this publication.See all images (6) Free text

    Figure 2
    Figure 4
    Figure 6
    Figure 1
    Figure 3
    Figure 5

      Supplemental Content

      Icon for BioMed Central Icon for PubMed Central

      Save items

      loading

      Recent activity

      Your browsing activity is empty.

      Activity recording is turned off.

      Turn recording back on

      See more...
      Write to the Help Desk