Accounting for missing data in statistical analyses: multiple imputation is not always the answer

Rachael A. Hughes*, Jon Heron, Jonathan A.C. Sterne, Kate Tilling

*Corresponding author for this work

Research output: Contribution to journalArticle (Academic Journal)

29 Citations (Scopus)
382 Downloads (Pure)

Abstract

Background: Missing data are unavoidable in epidemiological research, potentially leading to bias and loss of precision. Multiple imputation (MI) is widely advocated as an improvement over complete case analysis (CCA). However, contrary to widespread belief, CCA is preferable to MI in some situations.

Methods: We provide guidance on choice of analysis when data are incomplete. Using causal diagrams to depict missingness mechanisms, we describe when CCA will not be biased by missing data and compare MI and CCA, with respect to bias and efficiency, in a range of missing data situations. We illustrate selection of an appropriate method in practice.

Results: For most regression models, CCA gives unbiased results when the chance of being a complete case does not depend on the outcome after taking the covariates into consideration, which includes situations where data are missing not at random. Consequently, there are situations in which CCA analyses are unbiased whilst MI analyses, assuming missing at random (MAR), are biased. By contrast MI, unlike CCA, is valid for all MAR situations and has the potential to use information contained in the incomplete cases and auxiliary variables to reduce bias and/or improve precision. For this reason, MI was preferred over CCA in our real data example.

Conclusions: Choice of method for dealing with missing data is crucial for validity of conclusions, and should be based on careful consideration of the reasons for the missing data, missing data patterns, and the availability of auxiliary information.
Original languageEnglish
Article numberdyz032
Pages (from-to)1294-1304
Number of pages11
JournalInternational Journal of Epidemiology
Volume48
Issue number4
Early online date16 Mar 2019
DOIs
Publication statusPublished - 1 Aug 2019

Keywords

  • complete case analysis
  • Inverse probability weighting
  • missing data
  • missing data mechanisms
  • missing data patterns
  • multiple imputation

Fingerprint Dive into the research topics of 'Accounting for missing data in statistical analyses: multiple imputation is not always the answer'. Together they form a unique fingerprint.

  • Projects

    Rework of IEU 2 Tilling Programme

    Tilling, K. M.

    1/04/1831/03/23

    Project: Research

    Cite this