COMPARISON OF 2 SAMPLES WITH WILCOXON TESTS

Comparison of 2 samples with a Wilcoxon rank-sum test

“Blah blah blah Flo! You talk nice and pretty, but our data are crappy: we have less than 30 records per sample, and on top of that it is nowhere near a Normal distribution! What should we do then, Big Cheese?!”

Well, first, stop calling me “Big Cheese”, and give me a minute to regroup after this vicious hit to my perfect data world…

* One minute later…

Ok, let’s take care of this. In the case of small sample sizes with strange distributions, we need to use a non-parametric test (i.e. that does not make inferences about the parameters of the distribution, such as the mean). One such test is the Wilcoxon rank-sum test (a.k.a. Mann–Whitney U test or Wilcoxon–Mann–Whitney test). This test, rather than comparing the values of the records in two samples, will compare the ranks of those values once sorted all together. Basically, it helps you look at if, on average, values of one sample are higher than the values of the other sample. Instead of comparing the means, we are comparing the medians. Let’s take a look at our size data again:

wilcox.test(men, women)
        Wilcoxon rank sum test with continuity correction
data:  men and women
W = 74.5, p-value = 0.00243
alternative hypothesis: true location shift is not equal to 0
Warning message:
In wilcox.test.default(men, women) : cannot compute exact p-value with ties

(Noticed how the formulation to call the test is similar? This is actually a pattern that is quite common among statistical tests in R)

Once again, we detect the difference between the 2 groups. Concerning the error message, do not worry, it simply indicates that the value of P could not be exactly calculated because some values are the same in our two samples, leading to ties in the rank assigned to each record.

Comparison of 2 samples with a Wilcoxon signed-rank test for paired data

As previously mentioned with the t-test, you can… no, you should use a paired test when your data are paired! In this case, the test is called a Wilcoxon signed-rank test.

No detection of an effect if pairing is not considered:

wilcox.test(IQbefore,IQafter)
        Wilcoxon rank sum test with continuity correction
data:  IQbefore and IQafter
W = 37.5, p-value = 0.5992
alternative hypothesis: true location shift is not equal to 0
Warning message:
In wilcox.test.default(IQbefore, IQafter) :
  cannot compute exact p-value with ties

But magic happens as soon as we take the pairing into account:

wilcox.test(IQbefore,IQafter, paired=T)
        Wilcoxon signed rank test with continuity correction
data:  IQbefore and IQafter
V = 36, p-value = 0.01368
alternative hypothesis: true location shift is not equal to 0
Warning message:
In wilcox.test.default(IQbefore, IQafter, paired = T) :
  cannot compute exact p-value with ties

COMPARISON OF 2 SAMPLES WITH WILCOXON TESTS

Comparison of 2 samples with a Wilcoxon rank-sum test

Comparison of 2 samples with a Wilcoxon signed-rank test for paired data

INTRODUCTION

KRUSKAL-WALLIS RANK SUM TEST

FISHER’S EXACT TEST

STUDENT’S T-TESTS

BINOMIAL TEST

CORRELATION AND REGRESSION

ANOVA

CHI SQUARE TEST

CONCLUSION

Found this site useful? Please spread the word :)