SAGE Journals Online
Advertisement
Sign In to gain access to subscriptions and/or personal tools.

 

Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

Advertisement

Sign In to gain access to subscriptions and/or personal tools.
Statistical Methods in Medical Research
This Article
Right arrow Full Text (PDF)
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via ISI Web of Science (13)
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Ambler, G.
Right arrow Articles by Royston, P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Ambler, G.
Right arrow Articles by Royston, P.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

A comparison of imputation techniques for handling missing predictor values in a risk model with a binary outcome

Gareth Ambler

Department of Statistical Science, University College London/Joint UCLH/UCL Biomedical Research Unit, London, UK, g.ambler{at}ucl.ac.uk

Rumana Z Omar

Department of Statistical Science, University College London/Joint UCLH/UCL Biomedical Research Unit, London, UK

Patrick Royston

MRC Clinical Trials Unit, London, UK

Risk models that aim to predict the future course and outcome of disease processes are increasingly used in health research, and it is important that they are accurate and reliable. Most of these risk models are fitted using routinely collected data in hospitals or general practices. Clinical outcomes such as short-term mortality will be near-complete, but many of the predictors may have missing values. A common approach to dealing with this is to perform a complete-case analysis. However, this may lead to overfitted models and biased estimates if entire patient subgroups are excluded. The aim of this paper is to investigate a number of methods for imputing missing data to evaluate their effect on risk model estimation and the reliability of the predictions. Multiple imputation methods, including hotdecking and multiple imputation by chained equations (MICE), were investigated along with several single imputation methods. A large national cardiac surgery database was used to create simulated yet realistic datasets. The results suggest that complete case analysis may produce unreliable risk predictions and should be avoided. Conditional mean imputation performed well in our scenario, but may not be appropriate if using variable selection methods. MICE was amongst the best performing multiple imputation methods with regards to the quality of the predictions. Additionally, it produced the least biased estimates, with good coverage, and hence is recommended for use in practice.

Statistical Methods in Medical Research, Vol. 16, No. 3, 277-298 (2007)
DOI: 10.1177/0962280206074466


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
The GerontologistHome page
S.-I. Hong
Understanding Patterns of Service Utilization Among Informal Caregivers of Community Older Adults
Gerontologist, July 2, 2009; (2009) gnp105v1.
[Abstract] [Full Text] [PDF]


Home page
Arterioscler. Thromb. Vasc. Bio.Home page
J. Butler, A. Kalogeropoulos, V. Georgiopoulou, N. de Rekeneire, N. Rodondi, A. L. Smith, U. Hoffmann, A. Kanaya, A. B. Newman, S. B. Kritchevsky, et al.
Serum Resistin Concentrations and Risk of New Onset Heart Failure in Older Persons: The Health, Aging, and Body Composition (Health ABC) Study
Arterioscler. Thromb. Vasc. Biol., July 1, 2009; 29(7): 1144 - 1149.
[Abstract] [Full Text] [PDF]


Home page
Ann. Thorac. Surg.Home page
F. Farjah, D. R. Flum, T. K. Varghese Jr, R. G. Symons, and D. E. Wood
Surgeon Specialty and Long-Term Survival After Pulmonary Resection for Lung Cancer
Ann. Thorac. Surg., April 1, 2009; 87(4): 995 - 1006.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Clin. Nutr.Home page
D. A Wagstaff, S. Kranz, and O. Harel
A preliminary study of active compared with passive imputation of missing body mass index values among non-Hispanic white youths
Am. J. Clinical Nutrition, April 1, 2009; 89(4): 1025 - 1030.
[Abstract] [Full Text] [PDF]


Home page
Eur. J. Cardiothorac. Surg.Home page
M. K. Ferguson, J. Siddique, and T. Karrison
Modeling major lung resection outcomes using classification trees and multiple imputation techniques
Eur. J. Cardiothorac. Surg., November 1, 2008; 34(5): 1085 - 1089.
[Abstract] [Full Text] [PDF]


Home page
NEJMHome page
M. F. Dawwas, C. J. Watson, A. E. Gimson, R. D. Anbar, S. C. Sweet, C. Benden, O. Elidemir, T. G. Liou, F. R. Adler, and D. R. Cox
Lung Transplantation and Survival in Children with Cystic Fibrosis
N. Engl. J. Med., April 17, 2008; 358(16): 1753 - 1755.
[Full Text] [PDF]


Home page
Occup. Environ. Med.Home page
M Carder, R McNamee, I Beverland, R Elton, M Van Tongeren, G R Cohen, J Boyd, W MacNee, and R M Agius
Interacting effects of particulate pollution and cold temperature on cardiorespiratory mortality in Scotland
Occup. Environ. Med., March 1, 2008; 65(3): 197 - 204.
[Abstract] [Full Text] [PDF]



Advertisement