xiv Introduction to Hypothesis Testing
Formal Hypothesis Testing
Every hypothesis test begins with a statement of two hypotheses, or asser-
tions, about some parameter or set of parameters that are the logical negations
of each other. The rst, by convention, is often called the “null” hypothesis
and is labeled H
0
. The alternative, which is the logical negation of the null
hypothesis, is symbolized in this work by H
1
. The null hypothesis is gener-
ally a statement of something we hope is not true. The idea is that, using
data, we hope to reject the null hypothesis. We reject the null hypothesis if
it is not likely enough to have obtained the observations we made given that
the null hypothesis is true. “Likely enough” is customarily dened as a prob-
ability greater than 5percent. Typically, the null hypothesis is stated in such
a way as to make it possible to assess the limits to a feasible range for sample
statistics given that the null hypothesis is actually true. Note that in elemen-
tary statistics texts the null hypothesis is almost always stated as an equality,
either the value of a parameter between two or more groups, or the equality
of a parameter for a single population to a particular value. In the case of
equivalence and noninferiority, the null is usually stated as an inequality,
the nature of which is always adverse to the experimenter’s general desire.
As an example, consider the hypotheses
H
0
: μ = 10
and its logical negation:
H
1
: μ ≠ 10
Here, the Greek letter μ symbolizes the mean value of some measurement
made on individuals, averaged over all individuals in the population. Note
that it is not possible to calculate the arithmetic average of any measure-
ment for a population with an innite number of members, and it may not
be economically feasible for a population with a nite number of members.
Without elaborating, let us just assume that some population mean exists,
but its value is unknown. Suppose further that we sample n = 15individuals,
and compute the sample mean and the sample standard deviation:
S = 1.7
So, do we believe that the population mean is in fact 10 or not? After all, 9.3
doesn’t seem too far off from 10. Perhaps if we sampled again, we might get
a different sample mean that was greater than 10. Perhaps we cannot afford
to sample more than these 15individuals. Statistical theory tells us how to