51Judging Rank Sum Differences

Let's look at a formulaic method that corresponds to what we just did (and is named after the originators, Mann, Whitney, and Wilcoxon). We'll be testing for a rank sum difference between 30 randomly selected males and 30 randomly selected females. The female sample contains an extreme outlier. We'll look at the statistical analysis results produced by statistical analysis software, and walk through what it did.

But first, look at Table 51.1 to see how the standard t-test is bamboozled by the unruly data: Notice the impact of the outlier on the female sample mean, variance, number of standard errors, and -value. This is what happens when you have unruly scaled data and use methods from Parts II, III, and IV.

Table 51.1 An inappropriate -test.

-Test for mean difference—Whoops!
Males	Females
Mean	15.5	119.3333
Variance	77.5	296083.3
Observations	30	30
Hypothesized mean difference	0
df	59
(#SEs)	−1.04504
-value, two-tail	0.304637

Get Illuminating Statistical Analysis Using Scenarios and Simulations now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Illuminating Statistical Analysis Using Scenarios and Simulations by Jeffrey E. Kottemann

51Judging Rank Sum Differences

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly