Statistical Methods in Medical Research

 

Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

Sign In to gain access to subscriptions and/or personal tools.
This Article
Right arrow Abstract Freely available
Right arrow Free Full Text (Free PDF) Free
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via ISI Web of Science (1)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Kraemer, H. C.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kraemer, H. C.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?
Statistical Methods in Medical Research, Vol. 15, No. 6, 525-545 (2006)
DOI: 10.1177/0962280206070650
© 2006 SAGE Publications

Correlation coefficients in medical research: from product moment correlation to the odds ratio

Helena Chmura Kraemer

Department of Psychiatry and Behavioral Sciences, Stanford University, 401 Quarry Road, MC5717 Stanford, CA 94305, USA, hck{at}stanford.edu

Objective: Presentation of effect sizes that can be interpreted in terms of clinical or practical significance is currently urged whenever statistical significance (a ‘p-value’) is reported in research journals. However, which effect size and how to interpret it are not yet clearly delineated. The present focus is on effect sizes indicating strength of correlation, that is, effect sizes that describe the strength of monotonic association between two random variables X and Y in a population.

Methods: A logical structure of measures of association is traced, showing the interrelationships among the many measures of association. Advantages and disadvantages of each are discussed.

Conclusions: Suggestions are made for the future use of measures of association in research to facilitate considerations of clinical significance, emphasizing distribution-free effect sizes such as the Spearman correlation coefficient and Kendall’s coefficient of concordance for ordinal versus ordinal associations, weighted and intraclass kappa for binary versus binary associations and risk difference (RD) for binary versus ordinal association.

References

  • Cohen J. The earth is round (p < 0.05) . American Psychologist 1995; 49: 997-1003 .[CrossRef]
  • Dar R, Serlin RC, Omer H. Misuse of statistical tests in three decades of psychotherapy research . Journal of Consulting and Clinical Research 1994; 62: 75-82 .
  • Harris RJ. Significance tests have their place . Psychological Science 1997; 8(1): 8-11 .[CrossRef]
  • Hunter JE. Needed: a ban on the significance test . Psychological Science 1997; 8(1): 3-7 .[CrossRef][ISI]
  • Schmidt FL. Statistical significance testing and cumulative knowledge in psychology: implications for training of researchers . Psychological Methods 1996; 1(2): 115-129 .[CrossRef][ISI]
  • Shrout PE. Should significance tests be banned? Introduction to a special section exploring the pros and cons . Psychological Science 1997; 8(1): 1-2 .[Medline] [Order article via Infotrieve]
  • Wilkinson L. The task force on statistical inference. Statistical methods in psychology journals: guidelines and explanations . American Psychologist 1999; 54: 594-604 .[CrossRef]
  • Rennie D. How to report randomized controlled trials: the CONSORT statement . Journal of the American Medical Association 1996; 276(8): 649-649 .[CrossRef][ISI][Medline] [Order article via Infotrieve]
  • Altman DG, Schulz KF, Hoher D, Egger M, Davidoff F, Elbourne D, Gotzsche PC, Lang T, Consort Group. The revised CONSORT statement for reporting randomized trials: explanation and elaboration . Annals of Internal Medicine 2001; 134(8): 663-694 .[Abstract/Free Full Text]
  • Begg C, Cho M, Eastwood S, Horton R, Homer D, Olkin I, Pitkin R, Rennie D, Schultz KF, Simel D, Stroup DF. Improving the quality of reporting of randomized controlled trials: the CONSORT statement . Journal of the American Medical Association 1999; 276: 637-639 .
  • Grissom RJ, Kim JJ. Effect sizes for research. Lawrence Erlbaum Associates , 2005.
  • Kraemer HC, Morgan GA, Leech NL, Gilner JA, Vaske JJ, Harmon RJ. Measures of clinical significance . Journal of the American Academy of Child and Adolescent Psychiatry 2003; 42(12): 1524-1529 .[CrossRef][ISI][Medline] [Order article via Infotrieve]
  • Kraemer HC, Kupfer DJ. Size of treatment effects and their importance to clinical research and practice . Biological Psychiatry 2006; 59(11): 990-996 .[CrossRef][ISI][Medline] [Order article via Infotrieve]
  • Fisher RA. Frequency distribution of the values of the correlation coefficient in samples from an indefinitely large population . Biometrika 1915; 10: 507-521 .[Free Full Text]
  • Fisher RA. On the ‘probable error’ of a coefficient of correlation deduced from a small sample . Metron 1921; 1: 1-32 .
  • Pearson K. On a new method of determining the correlation between a measured character A and a character B, of which only the percentage of cases wherin B exceeds (or falls short of) a given intensity is recorded for each grade of A . Biometrika 1909; 7: 96-105 .[Free Full Text]
  • Kraemer HC. Reconsidering the odds ratio as a measure of 2 x 2 association in a population . Statistics in Medicine 2003; 23(2): 257-270 .
  • Sackett DL. Down with odds ratios! Evidence-Based Medicine 1996; 1: 164-166 .
  • Rothman KJ, Greenland S. Modern epidemiology. Williams & Wilkins , 1998.
  • Kendall MG, Buckland WR. A dictionary of statistical terms, fourth edition. Longman , 1982.
  • Last JM. A dictionary of epidemiology. Oxford University Press , 1995.
  • Krantz DH. The null hypothesis testing controversy in psychology . Journal of the American Statistical Association 1999; 44(448): 1372-1381 .[CrossRef]
  • Meehl PE. Theory testing in psychology and physics: a methodological paradox . Philosophy of Science 1967; 34: 103-115 .[CrossRef][ISI]
  • Jones LV, Tukey JW. A sensible formulation of the significance test . Psychological Methods 2000; 5(4): 411-414 .[CrossRef][ISI][Medline] [Order article via Infotrieve]
  • Efron B. Bootstrap methods: another look at the jackknife . The Annals of Statistics 1979; 7: 1-26 .
  • Efron B, Gong G. A leisurely look at the bootstrap, the jackknife, and cross-validation . The American Statistician 1983; 37: 36-48 .[CrossRef]
  • Efron B. Bootstrap confidence intervals: good or bad? Psychological Bulletin 1988; 104(2): 293-296 .[CrossRef]
  • David FN. Tables of the ordinates and probability integral of the distribution of the correlation coefficient in small samples. Cambridge University Press , 1938.
  • Fisher RA. Statistical methods for research workers, second edition. Oliver & Boyd , 1928.
  • Edgell SE, Noon SM. Effect of violation of normality on the t-test of the correlation . Psychological Bulletin 1984; 95: 576-583 .[CrossRef]
  • Kraemer HC. Robustness of the distribution theory of the product-moment correlation coefficient . Journal of Educational Statistics 1980; 5(2): 115-128 .
  • Kowalski CJ. On the effects on non-normality on the distribution of the sample correlation coefficient . The Journal of the Royal Statistical Society 1972; 21: 1-12 .
  • Bartko JJ. The intraclass correlation coefficient as a measure of reliability . Psychological Reports 1966; 19: 3-11 .[Medline] [Order article via Infotrieve]
  • Bartko JJ. On various intraclass correlation reliability coefficients . Psychological Bulletin 1976; 83: 762-765 .[CrossRef][ISI]
  • Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater reliability . Psychological Bulletin 1979; 86: 420-428 .[CrossRef][ISI]
  • Algina J. Comment on Bartko’s ‘On various intraclass correlation reliability coefficients’ . Psychological Bulletin 1978; 85(1): 135-138 .[CrossRef][ISI]
  • Tate RF. Correlation between a discrete and a continuous variable. Point-biserial correlation . Annals of Mathematical Statistics 1954; 25: 603-607 .[ISI]
  • Tate RF. Applications of correlation models for biserial data . American Statistical Association Journal 1955; 50: 1078-1095 .[CrossRef]
  • Olkin I, Tate RF. Multivariate correlation models with mixed discrete and continuous . Annals of Mathematical Statistics 1961; 32: 448-465 .
  • Mayer LS. Estimating a correlation coefficient when one variable is not directly observed . Journal of the American Statistical Association 1973; 68: 420-421 .[CrossRef]
  • Olsson U, Drasgow F, Dorans NJ. The polyserial correlation coefficient . Psychometrika 1982; 47: 337-347 .[CrossRef][ISI]
  • Lee S, Poon W. Maximum likelihood estimation of polyserial correlations . Psychometrika 1986; 51: 113-121 .[CrossRef]
  • Bedrick EJ, Breslin FC. Estimating the polyserial correlation coefficient . Psychometrika 1996; 61(3): 427-443 .[CrossRef]
  • Pearson K. On the correlation of characters not quantitatively measurable . Philosophical Transactions of the Royal Society of London 1901; 195A: 1-47 .
  • Pearson K. On the probable error of a coefficient of correlation as found from a fourfold table . Biometrika 1913; 9: 22-27 .[Free Full Text]
  • Kraemer HC. What is the ‘right’ statistical measure of twin concordance (or diagnostic reliability and validity)? Archives of General Psychiatry 1997; 54: 1121-1124 .[ISI][Medline] [Order article via Infotrieve]
  • Lyons MJ, True WR, Eisen SA, Goldberg J, Meyer JM, Faraone SV, Eaves LF, Tsuang MT. Differential heritability of adult and juvenile antisocial traits . Archives of General Psychiatry 1995; 52: 906-915 .[Abstract]
  • Lyons MF, Faraone SV, Tsuang MT, Goldberg J, Ramakrishnan V, Eaves LJ, Meyer JM, True WR, Eisen SA. Another view on the ‘right’ statistical measure of twin concordance . Archives of General Psychiatry 1997; 54: 1126-1128 .[ISI][Medline] [Order article via Infotrieve]
  • Cohen J. Statistical power analysis for the behavioral sciences. Lawrence Erlbaum Associates , 1988.
  • Kraemer HC. A Simple effect size indicator for two-group comparisons?: a comment on requivalent . Psychological Methods 2006; 10(4): 413-419 .[CrossRef]
  • Rosenthal R, Rubin DB. requivalent: a simple effect size indicator . Psychological Methods 2003; 8(4): 492-496 .[CrossRef][ISI][Medline] [Order article via Infotrieve]
  • Spearman C. The proof and measurement of association between two things . American Journal of Psychology 1904; 15: 72-101 .[CrossRef]
  • Spearman C. A footrule for measuring correlation . British Journal of Psychology 1906; 2: 89-108 .
  • Fieller EC, Hartley HO, Pearson ES. Tests for rank correlation coefficients. I . Biometrika 1957; 44: 470-481 .[Free Full Text]
  • Fieller EC, Pearson ES. Tests for rank correlation coefficients. II . Biometrika 1961; 48: 29-40 .[Free Full Text]
  • Kraemer HC. On estimation and hypothesis testing problems for correlation coefficients . Psychometrika 1975; 40(4): 473-485 .[CrossRef][ISI]
  • Zar JH. Significance testing of the Spearman rank correlation . Journal of the American Statistical Association 1972; 67: 578-585 .[CrossRef]
  • Cureton EE. Rank-biserial correlation . Psychometrika 1956; 21: 287-290 .[CrossRef][ISI]
  • Kendall MG. Rank correlation methods. Hafner Publishing Company , 1962.
  • Kendall M, Gibbons JD. Rank correlation methods, fifth edition. Oxford University Press , 1990.
  • Gibbons JD. Nonparametric statistics: an introduction. Sage Publications , 1993.
  • Kraemer HC. The small sample non-null properties of Kendall’s coefficient of concordance for normal populations . Journal of the American Statistical Association 1976; 71: 608-613 .[CrossRef]
  • Kraemer HC. A measure of 2 x 2 association with stable variance and approximately normal small sample distribution: planning cost-effective studies . Biometrics 1985; 42: 359-370 .
  • Fleiss JL. Statistical methods for rates and proportions. John Wiley & Sons , 1981.
  • Kraemer HC, Periyakoil VS, Noda A. Tutorial in biostatistics: kappa coefficients in medical research . Statistics in Medicine 2002; 21: 2109-2129 .[CrossRef][ISI][Medline] [Order article via Infotrieve]
  • Kraemer HC. Measurement of reliability for categorical data in medical research . Statistical Methods in Medical Research 1992; 1: 183-199 .[Medline] [Order article via Infotrieve]
  • Kraemer HC. Ramifications of a population model for k as a coefficient of reliability . Psychometrika 1979; 44(4): 461-472 .[CrossRef][ISI]
  • Kendall MG. A new measure of rank correlation . Biometrika 1938; 30: 91-93 .
  • Arndt S, Turvey C, Andreason NC. Correlating and predicting psychiatric symptom ratings: Spearman’s r versus Kendall’s tau correlation . Journal of Psychiatric Research 1999; 33(2): 97-104 .[CrossRef][ISI][Medline] [Order article via Infotrieve]
  • Brogden HE. A new coefficient: Application to biserial correlation and to estimation of selective efficiency . Psychometrika 1949; 14: 169-182 .[CrossRef][ISI]
  • Lord FM. Biserial estimates of correlation . Psychometrika 1963; 28: 81-85 .[CrossRef][ISI]
  • Kraemer HC. Modified biserial correlation coefficients . Psychometrika 1981; 46: 275-282 .[CrossRef]
  • Bedrick EJ. On the large sample distributions of modified sample biserial . Psychometrika 1990; 55: 217-228 .[CrossRef]
  • Cohen J. Weighted kappa: nominal scale agreement with provision for scaled disagreement or partial credit . Psychological Bulletin 1968; 70: 213-229 .[CrossRef][ISI]
  • Bloch DA, Kraemer HC. 2 x 2 kappa coefficients: measures of agreement or association . Biometrics 1989; 45: 269-287 .[CrossRef][ISI][Medline] [Order article via Infotrieve]
  • Kraemer HC, Kazdin AE, Offord DR, Kessler RC, Jensen PS, Kupfer DJ. Measuring the potency of a risk factor for clinical or policy significance . Psychological Methods 1999; 4(3): 257-271 .[CrossRef]
  • Kraemer HC. Evaluating medical tests: objective and quantitative guidelines. Sage Publications , 1992.
  • McNeil BJ, Keeler E, Adelstein SJ. Primer on certain elements of medical decision making . The New England Journal of Medicine 1975; 293: 211-215 .[Abstract]
  • Swets JA, Pickett RM. Evaluation of diagnostic systems: methods from signal detection theory. Academic Press , 1982.
  • Brownie C. Estimating Pr(X < Y)in categorized data using ‘ROC’ analysis . Biometrics 1988; 44: 615-621 .[CrossRef]
  • Grissom RJ. Probability of the superior outcome of one treatment over another . Journal of Applied Psychology 1994; 79: 314-316 .[CrossRef]
  • Hanley JA, McNeil BJ. The meaning and use of the area under a receiver operating characteristic (ROC) curve . Radiology 1982; 143: 29-36 .[Abstract/Free Full Text]
  • McGraw KO, Wong SP. A common language effect size statistic . Psychological Bulletin 1992; 111: 361-365 .[CrossRef][ISI]
  • Acion L, Peterson JJ, Temple S, Arndt S. Probabilistic index: an intuitive non-parametric approach to measuring the size of treatment effects . Statistics in Medicine 2006; 25(4): 591-602 .[CrossRef][ISI][Medline] [Order article via Infotrieve]
  • Hsu LM. Biases of success rate differences shown in binomial effect size displays . Psychological Bulletin 2004; 9(2): 183-197 .
  • Altman DG, Andersen K. Calculating the number needed to treat for trials where the outcome is time to an event . British Medical Journal 1999; 319: 1492-1495 .[Free Full Text]
  • Cook RJ, Sackett DL. The number needed to treat: a clinically useful measure of treatment effect . British Medical Journal 1995; 310: 452-454 .[Free Full Text]
  • Altman DG. Confidence intervals for the number needed to treat . British Medical Journal 1998; 317(7168): 1309-1312 .[Free Full Text]
  • Newcombe RG. Confidence intervals for the number needed to treat-absolute risk reduction is less likely to be misunderstood . British Medical Journal 1999; 318: 1765-1765 .[Free Full Text]
  • Duncan BW, Olkin I. Bias of estimates of the number needed to treat . Statistics in Medicine 2005; 24: 1837-1848 .[CrossRef][ISI][Medline] [Order article via Infotrieve]
  • Efron B. Computer-intensive methods in statistical regression. Technical Report #174, Division of Biostatistics, Stanford University , April 1995.
  • Brown MB, Benedetti JK. Sampling behavior of tests for correlation in two-way contigency . Journal of the American Statistical Association 1977; 72: 309-315 .[CrossRef]
  • Rosenthal I. Distribution of the sample version of the measure of association, Gamma . Journal of the American Statistical Association 1966; 61: 440-453 .[CrossRef]
  • Ruben H. Non-central chi-square and gamma revisited . Communications in Statistics 1974; 3: 607-633 .
  • Newcombe RG. A deficiency of the odds ratio as a measure of effect size . Statistics in Medicine, inpress.
  • Kirk DB. On the numerical approximation of the bivariate normal (tetrachoric) correlation coefficient . Psychometrika 1973; 38(2): 259-268 .
  • Cornfield J. A statistical problem arising from retrospective studies . In Neyman J, ed. Proceedings of the Third Berkeley Symposium. University of California Press, 1956: 135-135 .
  • Cornfield J. A method of estimating comparative rates from clinical data. Applications to cancer of the lung, breast and cervix . Journal of the National Cancer Institute 1951; 11: 1269-1275 .[ISI][Medline] [Order article via Infotrieve]
  • Mantel N, Haenszel W. Statistical aspects of the analysis of data from retrospective studies of disease . Journal of the National Cancer Institute 1959; 22: 719-748 .[ISI][Medline] [Order article via Infotrieve]

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?



This Article
Right arrow Abstract Freely available
Right arrow Free Full Text (Free PDF) Free
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via ISI Web of Science (1)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Kraemer, H. C.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kraemer, H. C.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?