Statistical Methods in Medical Research

 

Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

Click here to register today!

Sign In to gain access to subscriptions and/or personal tools.
This Article
Right arrow Full Text (OnlineFirst[PDF])
Right arrow Order Full text via Infotrieve
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrow Add to My Marked Citations
Google Scholar
Right arrow Articles by Lee, S.
PubMed
Right arrow PubMed Citation
Right arrow Articles by Lee, S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?
First published on March 28, 2008
Statistical Methods in Medical Research 2008, doi:10.1177/0962280207084839
© 2008 SAGE Publications

Article

Mistakes in validating the accuracy of a prediction classifier in high-dimensional butsmall-sample microarray data

Sunho Lee*

Department of Applied Mathematics, Sejong University, Seoul, SouthKorea

* To whom correspondence should be addressed.


   Abstract

A major interest in gene expression microarray studies is to develop an accurate classifierwhich can be adopted in clinical practice. The usage of large numbers of genes with small data samples may lead to overfitting in classification, and generate promising, but often nonreproducible results. Therefore, assessing the reproducibility of a classifier is necessary. Appropriate methods for validating a developed classifier and estimating its predictingaccuracy are discussed. In addition, some mistakes that can arise in the cross validation process are reviewed using published articles in prominent medical journals, to prevent the indefinite results of a classifier development from leading to inappropriate treatment.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?