Statistical Methods in Medical Research

 

Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

Click here for more information

Sign In to gain access to subscriptions and/or personal tools.
This Article
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
0962280206071842v1
16/6/539    most recent
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Seo Young Kim,
Right arrow Articles by Won Lee, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Seo Young Kim,
Right arrow Articles by Won Lee, J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?
This version was published on December 1, 2007
Statistical Methods in Medical Research, Vol. 16, No. 6, 539-564 (2007)
DOI: 10.1177/0962280206071842
© 2007 SAGE Publications

Ensemble clustering method based on the resampling similarity measure for gene expression data

Seo Young Kim

Research Institute for Basic Science, Chonnam National University

Jae Won Lee

Department of Statistics, Korea University, Seoul, Korea, jael{at}korea.ac.kr

The rapid development of microarray technologies enabled the monitoring of expression levels of thousands of genes simultaneously. Microarray technology has great potential for creating an enormous amount of data in a short time, and now becomes a new tool for studying such broad problems as classification of tumors in biology and medical science. Many statistical methods are available for analysing and systematizing these complex data into meaningful information, and one of the main goals in analysing gene expression data is the detection of samples or genes with similar expression patterns. In this paper, we developed a new clustering method of class discovery in a dataset. The performances of the new and existing methods were compared using both simulated data and real gene expression data. The proposed method was generally found to give more accurate cluster numbers and cluster assignments for individual objects than the three well-known general clustering methods such as agglomerative and divisive hierarchical clustering (HC) and self-organizing map (SOM). It also gave better results than the three consensus clustering methods based on agglomerative and divisive HC and SOM.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?