|
Sign In to gain access to subscriptions and/or personal tools.
|
Statistical Methods in Medical Research, Vol. 15, No. 6,
525-545 (2006)
DOI: 10.1177/0962280206070650
© 2006 SAGE Publications
Correlation coefficients in medical research: from product moment correlation to the odds ratio
Helena Chmura Kraemer
Department of Psychiatry and Behavioral Sciences, Stanford University, 401 Quarry Road, MC5717 Stanford, CA 94305, USA, hck{at}stanford.edu
Objective: Presentation of effect sizes that can be interpreted in terms of clinical or practical significance is currently urged whenever statistical significance (a p-value) is reported in research journals. However, which effect size and how to interpret it are not yet clearly delineated. The present focus is on effect sizes indicating strength of correlation, that is, effect sizes that describe the strength of monotonic association between two random variables X and Y in a population.
Methods: A logical structure of measures of association is traced, showing the interrelationships among the many measures of association. Advantages and disadvantages of each are discussed.
Conclusions: Suggestions are made for the future use of measures of association in research to facilitate considerations of clinical significance, emphasizing distribution-free effect sizes such as the Spearman correlation coefficient and Kendalls coefficient of concordance for ordinal versus ordinal associations, weighted and intraclass kappa for binary versus binary associations and risk difference (RD) for binary versus ordinal association.
References
- Cohen J. The earth is round (p < 0.05) . American Psychologist 1995; 49: 997-1003 .[CrossRef]
- Dar R, Serlin RC, Omer H. Misuse of statistical tests in three decades of psychotherapy research . Journal of Consulting and Clinical Research 1994; 62: 75-82 .
- Harris RJ. Significance tests have their place . Psychological Science 1997; 8(1): 8-11 .[CrossRef]
- Hunter JE. Needed: a ban on the significance test . Psychological Science 1997; 8(1): 3-7 .[CrossRef][ISI]
- Schmidt FL. Statistical significance testing and cumulative knowledge in psychology: implications for training of researchers . Psychological Methods 1996; 1(2): 115-129 .[CrossRef][ISI]
- Shrout PE. Should significance tests be banned? Introduction to a special section exploring the pros and cons . Psychological Science 1997; 8(1): 1-2 .[Medline]
[Order article via Infotrieve]
- Wilkinson L. The task force on statistical inference. Statistical methods in psychology journals: guidelines and explanations . American Psychologist 1999; 54: 594-604 .[CrossRef]
- Rennie D. How to report randomized controlled trials: the CONSORT statement . Journal of the American Medical Association 1996; 276(8): 649-649 .[CrossRef][ISI][Medline]
[Order article via Infotrieve]
- Altman DG, Schulz KF, Hoher D, Egger M, Davidoff F, Elbourne D, Gotzsche PC, Lang T, Consort Group. The revised CONSORT statement for reporting randomized trials: explanation and elaboration . Annals of Internal Medicine 2001; 134(8): 663-694 .[Abstract/Free Full Text]
- Begg C, Cho M, Eastwood S, Horton R, Homer D, Olkin I, Pitkin R, Rennie D, Schultz KF, Simel D, Stroup DF. Improving the quality of reporting of randomized controlled trials: the CONSORT statement . Journal of the American Medical Association 1999; 276: 637-639 .
- Grissom RJ, Kim JJ. Effect sizes for research. Lawrence Erlbaum Associates , 2005.
- Kraemer HC, Morgan GA, Leech NL, Gilner JA, Vaske JJ, Harmon RJ. Measures of clinical significance . Journal of the American Academy of Child and Adolescent Psychiatry 2003; 42(12): 1524-1529 .[CrossRef][ISI][Medline]
[Order article via Infotrieve]
- Kraemer HC, Kupfer DJ. Size of treatment effects and their importance to clinical research and practice . Biological Psychiatry 2006; 59(11): 990-996 .[CrossRef][ISI][Medline]
[Order article via Infotrieve]
- Fisher RA. Frequency distribution of the values of the correlation coefficient in samples from an indefinitely large population . Biometrika 1915; 10: 507-521 .[Free Full Text]
- Fisher RA. On the probable error of a coefficient of correlation deduced from a small sample . Metron 1921; 1: 1-32 .
- Pearson K. On a new method of determining the correlation between a measured character A and a character B, of which only the percentage of cases wherin B exceeds (or falls short of) a given intensity is recorded for each grade of A . Biometrika 1909; 7: 96-105 .[Free Full Text]
- Kraemer HC. Reconsidering the odds ratio as a measure of 2 x 2 association in a population . Statistics in Medicine 2003; 23(2): 257-270 .
- Sackett DL. Down with odds ratios! Evidence-Based Medicine 1996; 1: 164-166 .
- Rothman KJ, Greenland S. Modern epidemiology. Williams & Wilkins , 1998.
- Kendall MG, Buckland WR. A dictionary of statistical terms, fourth edition. Longman , 1982.
- Last JM. A dictionary of epidemiology. Oxford University Press , 1995.
- Krantz DH. The null hypothesis testing controversy in psychology . Journal of the American Statistical Association 1999; 44(448): 1372-1381 .[CrossRef]
- Meehl PE. Theory testing in psychology and physics: a methodological paradox . Philosophy of Science 1967; 34: 103-115 .[CrossRef][ISI]
- Jones LV, Tukey JW. A sensible formulation of the significance test . Psychological Methods 2000; 5(4): 411-414 .[CrossRef][ISI][Medline]
[Order article via Infotrieve]
- Efron B. Bootstrap methods: another look at the jackknife . The Annals of Statistics 1979; 7: 1-26 .
- Efron B, Gong G. A leisurely look at the bootstrap, the jackknife, and cross-validation . The American Statistician 1983; 37: 36-48 .[CrossRef]
- Efron B. Bootstrap confidence intervals: good or bad? Psychological Bulletin 1988; 104(2): 293-296 .[CrossRef]
- David FN. Tables of the ordinates and probability integral of the distribution of the correlation coefficient in small samples. Cambridge University Press , 1938.
- Fisher RA. Statistical methods for research workers, second edition. Oliver & Boyd , 1928.
- Edgell SE, Noon SM. Effect of violation of normality on the t-test of the correlation . Psychological Bulletin 1984; 95: 576-583 .[CrossRef]
- Kraemer HC. Robustness of the distribution theory of the product-moment correlation coefficient . Journal of Educational Statistics 1980; 5(2): 115-128 .
- Kowalski CJ. On the effects on non-normality on the distribution of the sample correlation coefficient . The Journal of the Royal Statistical Society 1972; 21: 1-12 .
- Bartko JJ. The intraclass correlation coefficient as a measure of reliability . Psychological Reports 1966; 19: 3-11 .[Medline]
[Order article via Infotrieve]
- Bartko JJ. On various intraclass correlation reliability coefficients . Psychological Bulletin 1976; 83: 762-765 .[CrossRef][ISI]
- Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater reliability . Psychological Bulletin 1979; 86: 420-428 .[CrossRef][ISI]
- Algina J. Comment on Bartkos On various intraclass correlation reliability coefficients . Psychological Bulletin 1978; 85(1): 135-138 .[CrossRef][ISI]
- Tate RF. Correlation between a discrete and a continuous variable. Point-biserial correlation . Annals of Mathematical Statistics 1954; 25: 603-607 .[ISI]
- Tate RF. Applications of correlation models for biserial data . American Statistical Association Journal 1955; 50: 1078-1095 .[CrossRef]
- Olkin I, Tate RF. Multivariate correlation models with mixed discrete and continuous . Annals of Mathematical Statistics 1961; 32: 448-465 .
- Mayer LS. Estimating a correlation coefficient when one variable is not directly observed . Journal of the American Statistical Association 1973; 68: 420-421 .[CrossRef]
- Olsson U, Drasgow F, Dorans NJ. The polyserial correlation coefficient . Psychometrika 1982; 47: 337-347 .[CrossRef][ISI]
- Lee S, Poon W. Maximum likelihood estimation of polyserial correlations . Psychometrika 1986; 51: 113-121 .[CrossRef]
- Bedrick EJ, Breslin FC. Estimating the polyserial correlation coefficient . Psychometrika 1996; 61(3): 427-443 .[CrossRef]
- Pearson K. On the correlation of characters not quantitatively measurable . Philosophical Transactions of the Royal Society of London 1901; 195A: 1-47 .
- Pearson K. On the probable error of a coefficient of correlation as found from a fourfold table . Biometrika 1913; 9: 22-27 .[Free Full Text]
- Kraemer HC. What is the right statistical measure of twin concordance (or diagnostic reliability and validity)? Archives of General Psychiatry 1997; 54: 1121-1124 .[ISI][Medline]
[Order article via Infotrieve]
- Lyons MJ, True WR, Eisen SA, Goldberg J, Meyer JM, Faraone SV, Eaves LF, Tsuang MT. Differential heritability of adult and juvenile antisocial traits . Archives of General Psychiatry 1995; 52: 906-915 .[Abstract]
- Lyons MF, Faraone SV, Tsuang MT, Goldberg J, Ramakrishnan V, Eaves LJ, Meyer JM, True WR, Eisen SA. Another view on the right statistical measure of twin concordance . Archives of General Psychiatry 1997; 54: 1126-1128 .[ISI][Medline]
[Order article via Infotrieve]
- Cohen J. Statistical power analysis for the behavioral sciences. Lawrence Erlbaum Associates , 1988.
- Kraemer HC. A Simple effect size indicator for two-group comparisons?: a comment on requivalent . Psychological Methods 2006; 10(4): 413-419 .[CrossRef]
- Rosenthal R, Rubin DB. requivalent: a simple effect size indicator . Psychological Methods 2003; 8(4): 492-496 .[CrossRef][ISI][Medline]
[Order article via Infotrieve]
- Spearman C. The proof and measurement of association between two things . American Journal of Psychology 1904; 15: 72-101 .[CrossRef]
- Spearman C. A footrule for measuring correlation . British Journal of Psychology 1906; 2: 89-108 .
- Fieller EC, Hartley HO, Pearson ES. Tests for rank correlation coefficients. I . Biometrika 1957; 44: 470-481 .[Free Full Text]
- Fieller EC, Pearson ES. Tests for rank correlation coefficients. II . Biometrika 1961; 48: 29-40 .[Free Full Text]
- Kraemer HC. On estimation and hypothesis testing problems for correlation coefficients . Psychometrika 1975; 40(4): 473-485 .[CrossRef][ISI]
- Zar JH. Significance testing of the Spearman rank correlation . Journal of the American Statistical Association 1972; 67: 578-585 .[CrossRef]
- Cureton EE. Rank-biserial correlation . Psychometrika 1956; 21: 287-290 .[CrossRef][ISI]
- Kendall MG. Rank correlation methods. Hafner Publishing Company , 1962.
- Kendall M, Gibbons JD. Rank correlation methods, fifth edition. Oxford University Press , 1990.
- Gibbons JD. Nonparametric statistics: an introduction. Sage Publications , 1993.
- Kraemer HC. The small sample non-null properties of Kendalls coefficient of concordance for normal populations . Journal of the American Statistical Association 1976; 71: 608-613 .[CrossRef]
- Kraemer HC. A measure of 2 x 2 association with stable variance and approximately normal small sample distribution: planning cost-effective studies . Biometrics 1985; 42: 359-370 .
- Fleiss JL. Statistical methods for rates and proportions. John Wiley & Sons , 1981.
- Kraemer HC, Periyakoil VS, Noda A. Tutorial in biostatistics: kappa coefficients in medical research . Statistics in Medicine 2002; 21: 2109-2129 .[CrossRef][ISI][Medline]
[Order article via Infotrieve]
- Kraemer HC. Measurement of reliability for categorical data in medical research . Statistical Methods in Medical Research 1992; 1: 183-199 .[Medline]
[Order article via Infotrieve]
- Kraemer HC. Ramifications of a population model for k as a coefficient of reliability . Psychometrika 1979; 44(4): 461-472 .[CrossRef][ISI]
- Kendall MG. A new measure of rank correlation . Biometrika 1938; 30: 91-93 .
- Arndt S, Turvey C, Andreason NC. Correlating and predicting psychiatric symptom ratings: Spearmans r versus Kendalls tau correlation . Journal of Psychiatric Research 1999; 33(2): 97-104 .[CrossRef][ISI][Medline]
[Order article via Infotrieve]
- Brogden HE. A new coefficient: Application to biserial correlation and to estimation of selective efficiency . Psychometrika 1949; 14: 169-182 .[CrossRef][ISI]
- Lord FM. Biserial estimates of correlation . Psychometrika 1963; 28: 81-85 .[CrossRef][ISI]
- Kraemer HC. Modified biserial correlation coefficients . Psychometrika 1981; 46: 275-282 .[CrossRef]
- Bedrick EJ. On the large sample distributions of modified sample biserial . Psychometrika 1990; 55: 217-228 .[CrossRef]
- Cohen J. Weighted kappa: nominal scale agreement with provision for scaled disagreement or partial credit . Psychological Bulletin 1968; 70: 213-229 .[CrossRef][ISI]
- Bloch DA, Kraemer HC. 2 x 2 kappa coefficients: measures of agreement or association . Biometrics 1989; 45: 269-287 .[CrossRef][ISI][Medline]
[Order article via Infotrieve]
- Kraemer HC, Kazdin AE, Offord DR, Kessler RC, Jensen PS, Kupfer DJ. Measuring the potency of a risk factor for clinical or policy significance . Psychological Methods 1999; 4(3): 257-271 .[CrossRef]
- Kraemer HC. Evaluating medical tests: objective and quantitative guidelines. Sage Publications , 1992.
- McNeil BJ, Keeler E, Adelstein SJ. Primer on certain elements of medical decision making . The New England Journal of Medicine 1975; 293: 211-215 .[Abstract]
- Swets JA, Pickett RM. Evaluation of diagnostic systems: methods from signal detection theory. Academic Press , 1982.
- Brownie C. Estimating Pr(X < Y)in categorized data using ROC analysis . Biometrics 1988; 44: 615-621 .[CrossRef]
- Grissom RJ. Probability of the superior outcome of one treatment over another . Journal of Applied Psychology 1994; 79: 314-316 .[CrossRef]
- Hanley JA, McNeil BJ. The meaning and use of the area under a receiver operating characteristic (ROC) curve . Radiology 1982; 143: 29-36 .[Abstract/Free Full Text]
- McGraw KO, Wong SP. A common language effect size statistic . Psychological Bulletin 1992; 111: 361-365 .[CrossRef][ISI]
- Acion L, Peterson JJ, Temple S, Arndt S. Probabilistic index: an intuitive non-parametric approach to measuring the size of treatment effects . Statistics in Medicine 2006; 25(4): 591-602 .[CrossRef][ISI][Medline]
[Order article via Infotrieve]
- Hsu LM. Biases of success rate differences shown in binomial effect size displays . Psychological Bulletin 2004; 9(2): 183-197 .
- Altman DG, Andersen K. Calculating the number needed to treat for trials where the outcome is time to an event . British Medical Journal 1999; 319: 1492-1495 .[Free Full Text]
- Cook RJ, Sackett DL. The number needed to treat: a clinically useful measure of treatment effect . British Medical Journal 1995; 310: 452-454 .[Free Full Text]
- Altman DG. Confidence intervals for the number needed to treat . British Medical Journal 1998; 317(7168): 1309-1312 .[Free Full Text]
- Newcombe RG. Confidence intervals for the number needed to treat-absolute risk reduction is less likely to be misunderstood . British Medical Journal 1999; 318: 1765-1765 .[Free Full Text]
- Duncan BW, Olkin I. Bias of estimates of the number needed to treat . Statistics in Medicine 2005; 24: 1837-1848 .[CrossRef][ISI][Medline]
[Order article via Infotrieve]
- Efron B. Computer-intensive methods in statistical regression. Technical Report #174, Division of Biostatistics, Stanford University , April 1995.
- Brown MB, Benedetti JK. Sampling behavior of tests for correlation in two-way contigency . Journal of the American Statistical Association 1977; 72: 309-315 .[CrossRef]
- Rosenthal I. Distribution of the sample version of the measure of association, Gamma . Journal of the American Statistical Association 1966; 61: 440-453 .[CrossRef]
- Ruben H. Non-central chi-square and gamma revisited . Communications in Statistics 1974; 3: 607-633 .
- Newcombe RG. A deficiency of the odds ratio as a measure of effect size . Statistics in Medicine, inpress.
- Kirk DB. On the numerical approximation of the bivariate normal (tetrachoric) correlation coefficient . Psychometrika 1973; 38(2): 259-268 .
- Cornfield J. A statistical problem arising from retrospective studies . In Neyman J, ed. Proceedings of the Third Berkeley Symposium. University of California Press, 1956: 135-135 .
- Cornfield J. A method of estimating comparative rates from clinical data. Applications to cancer of the lung, breast and cervix . Journal of the National Cancer Institute 1951; 11: 1269-1275 .[ISI][Medline]
[Order article via Infotrieve]
- Mantel N, Haenszel W. Statistical aspects of the analysis of data from retrospective studies of disease . Journal of the National Cancer Institute 1959; 22: 719-748 .[ISI][Medline]
[Order article via Infotrieve]

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
|