SAGE Journals Online
Advertisement
Sign In to gain access to subscriptions and/or personal tools.

 

Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

Advertisement

Sign In to gain access to subscriptions and/or personal tools.
Statistical Methods in Medical Research
This Article
Right arrow Full Text (PDF)
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Poon, W.-Y.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Poon, W.-Y.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?

Identifying influential observations in discriminant analysis

Wai-Yin Poon

Department of Statistics, The Chinese University of Hong Kong, Shatin, Hong Kong, wypoon{at}cuhk.edu.hk

Linear discriminant analysis has been widely applied in medical studies where atypical observations in a data set are usually encountered. While it is well known that the estimation in linear discriminant analysis can be conducted by using regression with dummy variates, typical regression diagnostic statistics cannot be applied to identify influential observations in discriminant analysis because these statistics are not invariant with regard to the codings of the dummy variates. We propose that regression model diagnostic measures developed from the local influence perspective can be used for identifying observations in a data set that exert undue influence on the result of the linear discriminant analysis. The measures are functions of the usual regression diagnostic statistics, such as leverage and residual, but are independent of the choice of the values of the dummy variate. They are local versions of Cook’s distance-type diagnostic statistic and the advantage of the measures lies in its ability in detecting a group rather than a single influential observation. The performance of the proposed measures are illustrated by analyses of three medical data sets and is compared with other diagnostic measures available in the literature. The results indicate that the proposed measures are simple and yet efficient discriminant diagnostic quantities. It is also observed from empirical evidence that a data point which is a multivariate outlier may not be influential in linear discriminant analysis.

Statistical Methods in Medical Research, Vol. 13, No. 4, 291-308 (2004)
DOI: 10.1191/0962280204sm367ra


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?




Advertisement