Statistical Methods in Medical Research

 

Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

Click here to register today!

Click here to register today!

Sign In to gain access to subscriptions and/or personal tools.
This Article
Right arrow Full Text (OnlineFirst[PDF])
Right arrow Order Full text via Infotrieve
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrow Add to My Marked Citations
Google Scholar
Right arrow Articles by Hansson, L.
Right arrow Articles by Khamis, H. J
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hansson, L.
Right arrow Articles by Khamis, H. J
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?
First published on March 28, 2008
Statistical Methods in Medical Research 2008, doi:10.1177/0962280207082348
© 2008 SAGE Publications

Article

Matched samples logistic regression in case-control studies with missing values: when tobreak the matches

Lisbeth Hansson1 and Harry J Khamis2*

1 Department of Information Sciences, Uppsala University, Uppsala, Sweden
2 Statistical Consulting Center, Wright State University, Dayton, OH, USA

* To whom correspondence should be addressed.


   Abstract

Simulated data sets are used to evaluate conditional and unconditional maximum likelihood estimation in an individual case-control design with continuous covariates when there aredifferent rates of excluded cases and different levels of other design parameters. The effectiveness of the estimation procedures is measured by method bias, variance of the estimators, root mean square error (RMSE) for logistic regression and the percentage of explained variation. Conditional estimation leads to higher RMSE than unconditional estimation in the presence of missing observations, especially for 1:1 matching. The RMSE ishigher for the smaller stratum size, especially for the 1:1 matching. The percentage of explained variation appears to be insensitive to missing data, but is generally higher for the conditional estimation than for the unconditional estimation. It is particularly good for the 1:2 matching design. For minimizing RMSE, a high matching ratio is recommended; in this case, conditional and unconditional logistic regression models yield comparable levels of effectiveness. For maximizing the percentage of explained variation, the 1:2 matching design with the conditional logistic regression model is recommended.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?