A sizable volume of scientific efforts involve questions on a given
population parameter. While we estimate/calculate an interval to contain
an unknown population parameter with a probability of 100%
under the heading of Confidence interval estimation, here in Hypothesis
testing, we question the viability of a given value as the value of our
unknown population parameter.
So, what we do is to check for the validity of a claim about an unknown, in formal terms.
A statistical hypothesis is a statement about the numerical value of a parameter. The null hypothesis, denoted H0, represents the hypothesis that is assumed to be true unless the data provide convincing counter evidence. This usually represents the status quo or some claim about the parameter that the researcher states.
The alternative hypothesis, denoted H1, represents the hypothesis that will be maintained only if the data provide convincing evidence for its truth.
The test statistic is a sample statistic, computed from information provided in the sample, that the researcher uses to decide between the null and alternative hypotheses.
A Type I error occurs if the researcher rejects the null hypothesis in favor of the alternative hypothesis when, in fact, the null hypothesis is true. The probability of Type I error is denoted as α.
The rejection region of a statistical test is the set of possible values of the test statistic for which the researcher will reject the null hypothesis in favor of the alternative.
A Type II error occurs if the researcher fails to reject the null hypothesis when, in fact, the null hypothesis is false. The probability of Type II error is denoted as β.
7.1 EXERCISES ___________________________________________________________
For each of the situations below write the null and alternative hypotheses, corresponding to the test, in plain English; then write the null and alternative hypotheses using only mathematical symbols; then state what the symbols you used above represents:
ii. We would like to test if the average income of males is greater than the average income of females.
iii. It is claimed that, among the people who drinks at least 2 liters of water every day, the percentage of those with a kidney problem is less than 5%. We suspect the truth of this statement and would like to test it.
v. We would like to test if a coin is not a fair coin.
Solution: This exercise is left as self-study.
Consider:
H0 : μ ≤ μ0
H1 : μ > μ0
and i=1n, a random sample of n observations from a normal population
N
with known σ2.
At the statistical significance level α:
H0 is rejected if
|
7.2 EXERCISES ___________________________________________________________
A researcher wonders if the mean wage rate of workers in Ankara is greater than 6500. The population variance is known to be 1000000. The researcher measures the mean wage rate of a sample of 64 workers as 7000. Conduct and conclude the relevant hypothesis testate the significance level of 5%.
Solution:
H0 : μ ≤ 6500
H1 : μ > 6500
The relevant distribution is z.
Since:
|
we reject H0 at α = 0.05.
For:
H0 : μ ≥ μ0
H1 : μ < μ0
At the statistical significance level α:
H0 is rejected if
|
7.3 EXERCISES ___________________________________________________________
A researcher wonders if the mean wage rate of workers in Ankara is less than 7500. The population variance is known to be 1000000. The researcher measures the mean wage rate of a sample of 64 workers as 7000. Conduct and conclude the relevant hypothesis testate the significance level of 5%.
Solution:
H0 : μ ≥ 7500
H1 : μ < 7500
The relevant distribution is z.
Since:
|
we reject H0 at α = 0.05.
For:
H0 : μ = μ0
H1 : μ≠μ0
At the statistical significance level α:
H0 is rejected if
|
7.4 EXERCISES ___________________________________________________________
A researcher wonders if the mean wage rate of workers in Ankara is different than 7500. The population variance is known to be 1000000. The researcher measures the mean wage rate of a sample of 64 workers as 7000. Conduct and conclude the relevant hypothesis testate the significance level of 5%.
Solution:
H0 : μ = 7500
H1 : μ≠7500
The relevant distribution is z.
Since:
|
we reject H0 at α = 0.05.
Consider:
H0 : μ ≤ μ0
H1 : μ > μ0
and i=1n, a random sample of n observations from a normal population
N
where σ2 is unkown.
At the statistical significance level α:
H0 is rejected if
|
7.5 EXERCISES ___________________________________________________________
A researcher wonders if the mean wage rate of workers in Ankara is greater than 6500 , for which the population variance is unknown. The researcher measures the mean wage rate of a sample of 64 workers as 7000 and the ’sample variance’ as 640000. Conduct and conclude the relevant hypothesis test at the significance level of 5%.
Solution:
H0 : μ ≤ 6500
H1 : μ > 6500
The relevant distribution is t and the degrees of freedom is 63.
Since:
|
we reject H0 at α = 0.05.
For:
H0 : μ ≥ μ0
H1 : μ < μ0
At the statistical significance level α:
H0 is rejected if
|
7.6 EXERCISES ___________________________________________________________
A researcher wonders if the mean wage rate of workers in Ankara is less than 7500 , for which the population variance is unknown. The researcher measures the mean wage rate of a sample of 64 workers as 7000 and the ’sample variance’ as 640000. Conduct and conclude the relevant hypothesis test at the significance level of 5%.
Solution:
H0 : μ ≥ 7500
H1 : μ < 7500
The relevant distribution is t and the degrees of freedom is 63.
Since:
|
we reject H0 at α = 0.05.
For:
H0 : μ = μ0
H1 : μ≠μ0
At the statistical significance level α:
H0 is rejected if
|
7.7 EXERCISES ___________________________________________________________
A researcher wonders if the mean wage rate of workers in Ankara is different than 7500 , for which the population variance is unknown. The researcher measures the mean wage rate of a sample of 64 workers as 7000 and the ’sample variance’ as 640000. Conduct and conclude the relevant hypothesis test at the significance level of 5%.
Solution:
H0 : μ = 7500
H1 : μ≠7500
The relevant distribution is t and the degrees of freedom is 63.
Since:
|
we reject H0 at α = 0.05.
Consider:
H0 : P ≤ P0
H1 : P > P0
and i=1n, a random sample of n observations from a Bernoulli
population.
At the statistical significance level α:
H0 is rejected if
|
7.8 EXERCISES ___________________________________________________________
A political candidate wonders if her nationwide support rate exceeds 50%. Among a sample of 64 people, we know 35 support the political candidate. Conduct and conclude the relevant hypothesis test at the significance level of 5%.
Solution:
H0 : P ≤ 0.50
H1 : P > 0.50
The relevant distribution is z.
Since:
|
we fail to reject H0 at α = 0.05.
For:
H0 : P ≥ P0
H1 : P < P0
At the statistical significance level α:
H0 is rejected if
|
7.9 EXERCISES ___________________________________________________________
A political candidate wonders if her nationwide support rate falls short of 50%. Among a sample of 64 people, we know 30 support the political candidate. Conduct and conclude the relevant hypothesis test at the significance level of 5%.
Solution:
H0 : P ≥ 0.50
H1 : P < 0.50
The relevant distribution is z.
Since:
|
we fail to reject H0 at α = 0.05.
For:
H0 : P = P0
H1 : P≠P0
At the statistical significance level α:
H0 is rejected if
|
7.10 EXERCISES ___________________________________________________________
A political candidate wonders if her nationwide support rate is different than 50%. Among a sample of 64 people, we know 35 support the political candidate. Conduct and conclude the relevant hypothesis test at the significance level of 5%.
Solution:
H0 : P = 0.50
H1 : P≠0.50
The relevant distribution is z.
Since:
|
we fail to reject H0 at α = 0.05.
Consider:
H0 : σ2 ≤ σ02
H1 : σ2 > σ02
and i=1n, a random sample of n observations from a normal population
N
.
At the statistical significance level α:
H0 is rejected if
|
7.11 EXERCISES ___________________________________________________________
A process engineer is concerned with the variation of - temperature in an industrial furnace and wonders if it exceeds 1500. She collects a random sample of temperatures as:
Conduct and conclude the relevant hypothesis test at the significance level of 5%.
Solution:
H0 : σ2 ≤ 1500
H1 : σ2 > 1500
The relevant distribution is χ2 and the degrees of freedom is
8.
Since:
|
we reject H0 at α = 0.05.
For:
H0 : σ2 ≥ σ02
H1 : σ2 < σ02
At the statistical significance level α:
H0 is rejected if
|
7.12 EXERCISES ___________________________________________________________
A process engineer is concerned with the variation of - temperature in an industrial furnace and wonders if it is less than 2500. She collects a random sample of temperatures as:
Conduct and conclude the relevant hypothesis test at the significance level of 5%.
Solution:
H0 : σ2 ≥ 2500
H1 : σ2 < 2500
The relevant distribution is χ2 and the degrees of freedom is
8.
Since:
|
we fail to reject H0 at α = 0.05.
For:
H0 : σ2 = σ02
H1 : σ2≠σ02
At the statistical significance level α:
H0 is rejected if
|
7.13 EXERCISES ___________________________________________________________
A process engineer is concerned with the variation of - temperature in an industrial furnace and wonders if it is different than 2000. She collects a random sample of temperatures as:
Conduct and conclude the relevant hypothesis test at the significance level of 5%.
Solution:
H0 : σ2 = 2000
H1 : σ2≠2000
The relevant distribution is χ2 and the degrees of freedom is
8.
Since:
|
we fail to reject H0 at α = 0.05.
7.14 EXERCISES ___________________________________________________________
A manufacturer of automobile batteries claim that at least 80% of the batteries that it produces will last 36 months. A consumers’ advocate group wants to evaluate this longevity claim and selects a random sample of 28 batteries to test. The following data indicate the length of time (in months) that each of these batteries lasted (i. e., performed properly before failure):
42.3, 39.6, 25.0, 56.2, 37.2, 47.4, 57.5, 39.3, 39.2, 47.0, 47.4, 39.7, 57.3, 51.8, 31.6, 45.1, 40.8, 42.4, 38.9, 42.9, 34.1, 49.0, 41.5, 60.1, 34.6, 50.4, 30.7, 44.1
Now, we would like to test, at a significance level of 0.05, if there is a significant evidence that less than 80% of the batteries will last at least 36 months? Conduct and conclude the test.
Solution: The critical element of solution is that what we are testing
here is not the mean product life, rather it is the proportion of items
that last at least 36 months. So, begin by counting the product
lifetimes (among the given 28 measurements), calculate and
proceed straightforwardly with the rest. 8402 This exercise is left as
self-study.
Right after the poll stations are closed at 17:00, a political candidate receives the information that out of the 50 people interviewed her approval "count" is 24. As a statistics lover, she immediately tests the null hypothesis that her population approval rate is less than or equal 0.50 against its respective alternative, at the 5% level of statistical significance. What is the conclusion of this test? Suppose in every consecutive 15 minutes, number of people interviewed increases by 5 and approval count increases by 4. Find the earliest time, HH:MM, that she can declare her victory based on her tests of hypotheses. Note that a formal statistical/algebraic solution is expected with proper terminology and notation.
Solution: This exercise is left as self-study.
In this part, you are more than welcome to transfer your earlier, indeed recently acquired, knowledge to understand things better. Except for one case or two, the material remains fairly intact compared to the ones in confidence intervals for two populations.
Consider:
H0 : μx −μy ≤ 0
H1 : μx −μy > 0
Let i=1n and
i=1n, be two matched samples.
At the statistical significance level α:
H0 is rejected if
|
For:
H0 : μx −μy ≥ 0
H1 : μx −μy < 0
At the statistical significance level α:
H0 is rejected if
|
For:
H0 : μx −μy = 0
H1 : μx −μy≠0
At the statistical significance level α:
H0 is rejected if
|
7.15 EXERCISES ___________________________________________________________
A company is about to release a new drug to assist weight loss, and we are in charge of assessing how effective the drug is. We pick a random sample of 8 people with the following pre-drug body weights:
After using the drug for the designated test duration, the post-drug body weights are measured as:
Conduct and conclude a hypothesis test at the significance level of 5% to assess if ’pre-drug minus post-drug difference of mean body weights is positive’.
Solution:
H0 : μx −μy ≤ 0
H1 : μx −μy > 0
The difference series (pre-drug minus post-drug) is:
The relevant distribution is t and the degrees of freedom is 7.
Since:
|
we fail to reject H0 at α = 0.05.
Consider:
H0 : μx −μy ≤ 0
H1 : μx −μy > 0
Let,
i=1nx ⊂
i=1Nx ∼ N
i=1ny ⊂
i=1Ny ∼ N
where σx2 and σy2 are known.
At the statistical significance level α:
H0 is rejected if
|
For:
H0 : μx −μy ≥ 0
H1 : μx −μy < 0
At the statistical significance level α:
H0 is rejected if
|
For:
H0 : μx −μy = 0
H1 : μx −μy≠0
At the statistical significance level α:
H0 is rejected if
|
7.16 EXERCISES ___________________________________________________________
A researcher wonders if the mean wage of workers in Ankara falls short of that in Istanbul. She has the following data and information: Mean wage rate of 49 workers from Ankara is 6000. Mean wage rate of 81 workers from Istanbul is 7000. Population variance of wages in Ankara and Istanbul are known to be 640000 and 810000, respectively. Conduct and conclude a hypothesis test at the significance level of 5% to assess if mean wage rate in Ankara is less than the mean wage rate in Istanbul.
Solution:
H0 : μx −μy ≥ 0
H1 : μx −μy < 0
The relevant distribution is z.
Since:
|
we reject H0 at α = 0.05.
Consider:
H0 : μx −μy ≤ 0
H1 : μx −μy > 0
Let,
i=1nx ⊂
i=1Nx ∼ N
i=1ny ⊂
i=1Ny ∼ N
where σx2 and σy2 are unkown but assumed to be equal.
At the statistical significance level α:
H0 is rejected if
|
In our formulation:
|
and,
|
|
For:
H0 : μx −μy ≥ 0
H1 : μx −μy < 0
At the statistical significance level α:
H0 is rejected if
|
For:
H0 : μx −μy = 0
H1 : μx −μy≠0
At the statistical significance level α:
H0 is rejected if
|
7.17 EXERCISES ___________________________________________________________
A researcher wonders if the mean wage of workers in Ankara falls short of that in Istanbul. She has the following data and information: Mean wage rate of 49 workers from Ankara is 6000. Mean wage rate of 81 workers from Istanbul is 7000. Population variances of wages in Ankara and Istanbul are unknown but they are assumed to be equal. Sample variance of wages in Ankara and Istanbul are calculated as 490000 and 640000, respectively. Conduct and conclude a hypothesis test at the significance level of 5% to assess if mean wage rate in Ankara is less than the mean wage rate in Istanbul.
Solution:
H0 : μx −μy ≤ 0
H1 : μx −μy > 0
|
The relevant distribution is t and the degrees of freedom is 128.
Since:
|
we reject H0 at α = 0.05.
Consider:
H0 : μx −μy ≤ 0
H1 : μx −μy > 0
Let,
i=1nx ⊂
i=1Nx ∼ N
i=1ny ⊂
i=1Ny ∼ N
where σx2 and σy2 are unkown and assumed not to be equal.
At the statistical significance level α:
H0 is rejected if
|
In our formulation:
|
Notice that, if nx = ny = n
|
For:
H0 : μx −μy ≥ 0
H1 : μx −μy < 0
At the statistical significance level α:
H0 is rejected if
|
For:
H0 : μx −μy = 0
H1 : μx −μy≠0
At the statistical significance level α:
H0 is rejected if
|
7.18 EXERCISES ___________________________________________________________
A researcher wonders if the mean wage of workers in Ankara falls short of that in Istanbul. She has the following data and information: Mean wage rate of 49 workers from Ankara is 6000. Mean wage rate of 81 workers from Istanbul is 7000. Population variances of wages in Ankara and Istanbul are unknown and they are assumed to be unequal. Sample variance of wages in Ankara and Istanbul are calculated as 490000 and 640000, respectively. Conduct and conclude a hypothesis test at the significance level of 5% to assess if mean wage rate in Ankara is less than the mean wage rate in Istanbul.
Solution:
H0 : μx −μy ≤ 0
H1 : μx −μy > 0
The relevant distribution is t and the degrees of freedom is ν:
|
Since:
|
we reject H0 at α = 0.05.
Consider:
H0 : Px −Py ≤ 0
H1 : Px −Py > 0
Let
i=1nx ⊂
i=1nX ∼ Bernoulli
i=1ny ⊂
i=1nY ∼ Bernoulli
At the statistical significance level α:
H0 is rejected if
|
In our formulation:
|
For:
H0 : Px −Py ≥ 0
H1 : Px −Py < 0
At the statistical significance level α:
H0 is rejected if
|
For:
H0 : Px −Py = 0
H1 : Px −Py≠0
At the statistical significance level α:
H0 is rejected if
|
7.19 EXERCISES ___________________________________________________________
A political candidate wonders if her support rate in Ankara exceeds that in Istanbul. We know that among 64 people from Ankara 35 supports the candidate and among 81 people from Istanbul 45 supports the candidate. Conduct and conclude the relevant hypothesis test at the significance level of 5%.
Solution:
H0 : Px −Py < 0
H1 : Px −Py ≥ 0
|
The relevant distribution is z.
Since:
|
we fail to reject H0 at α = 0.05.
Consider:
H0 : σx2 ≤ σy2
H1 : σx2 > σy2
Let
i=1nx ⊂
i=1Nx ∼ N
i=1ny ⊂
i=1Ny ∼ N
At the statistical significance level α:
H0 is rejected if
|
Go over the description of F-distribution in Chapter 10.
In our formulation:
|
|
For:
H0 : σx2 ≥ σy2
H1 : σx2 < σy2
At the statistical significance level α:
H0 is rejected if
|
For:
H0 : σx2 = σy2
H1 : σx2≠σy2
At the statistical significance level α:
H0 is rejected if
|
7.20 EXERCISES ___________________________________________________________
A process engineer wonders if the temperature variation in Furnace X exceeds that in Furnace Y. Sample variance of temperatures in Furnace X is 1600 on the basis of 10 temperature readings and sample variance of temperatures in Furnace Y is 1100 on the basis of 8 temperature readings. Conduct and conclude the relevant hypothesis test at the significance level of 5%.
Solution:
H0 : σx2 < σy2
H1 : σx2 ≥ σy2
The relevant distribution is F with a numerator degrees of freedom of 9
and a denominator degrees of freedom of 7.
Since:
|
we fail to reject H0 at α = 0.05.
0Go to Teaching page & experiment with F(ν1,ν2) using the file named ‘Statistical distributions.xlsx’.
7.21 EXERCISES ___________________________________________________________
Consider the hypotheses regarding two normal populations X and Y:
H0 : σx2 ≤ σy2 H1 : σx2 > σy2
Sample values for X and Y are given as follows:
X: | 2 | 8 | 5 | 4 | 3 | 7 | 9 | 6 |
Y: | 26 | 24 | 23 | 25 | 22 | 27 | ||
Conduct and conclude the test at α = 0.05. Clearly state the test statistic, the distribution of test statistic and critical value(s). Find the necessary critical values from the end of your textbook or from Internet sources.
Consider two populations X and Y for which a researcher has estimated the following confidence intervals given that = 150 and ȳ = 250.
|
|
In her research report, the researcher noted that she used an N
distribution in her calculations. Based on these, calculate a 90%
confidence interval for μx −μy
Solution: This exercise requires some little portion of creative thinking. As the researcher has wed the standard normal distribution in her calculations, this means σx2 and σy2 are both known (or given). As the given confidence intervals ’for μx and μy are one-sided, the critical values are −1.29 and 1.65, respectively. So,
Once these are known, estimation of a 90% C.I. for μx −μy is straightforward.
p-value is defined as the tail probability of a test statistic. While conducting hypothesis tests manually, i.e. with a pencil on paper, use of a p-value is not essential, since calculation of p-value already requires a calculated test statistic. In some cases, we may need to do so, though. A p-value is especially practical when we do our analysis on a computer using a dedicated software. The rule is simple:
H0 is rejected if p−value < α
A course-related/pedagogical warning about the p-value is that, my expectation (from students) is to see the proper use of ’test score vs critical value’ comparisons in concluding hypothesis tests rather than p−value vs α’ comparisons, unless otherwise stated. In your future/professional practice you will have full freedom to enjoy p−values.
Despite Power is not a difficult concept to grasp intuitively, its mathematics is often confusing to students. Patiently go over the following:
As you may recall, a Type I error occurs if the researcher rejects the null hypothesis in favor of the alternative hypothesis when, in fact, the null hypothesis is true. The probability of Type I error is denoted as α. A Type II error, on the other hand, occurs if the researcher fails to reject the null hypothesis when, in fact, the null hypothesis is false. The probability of Type II error is denoted as β. So,
So, in plain language, Power is the ability of a test to avoid a false null hypothesis.
(As a caution, note that there is no requirement of any sort like α + β = 1)
As you may pick infinitely many alternative values for your parameter of interest, there is a multiplicity of values for Power. So, Power, is indeed a function. We often write it as a function of the difference between the alternative and hypothesized parameter values.
As a closing remark, drawing graphs (rather than calculating) may be very useful to understand the Type II error as well as Power.
7.22 EXERCISES ___________________________________________________________
In a two-sided (two-tailed) hypothesis test, the test statistic was calculated as 0.18. We know that the distribution of the test statistic (call this A distribution) has the triangular shaped union of the line segments [AB] and [BC], given A(0.00, 0.00), B(2.00, 0.50) and C(4.00, 0.00). Conclude the test at α = 0.005 by calculating and using p-values only. In your answer, clearly define what p-value is.
Referring to H0 : μ = 0 against H1 : μ > 0 and using proper drawings of the relevant distributions, demonstrate that
ii. Power of a hypothesis test gets higher as population variance gets smaller
Make sure your drawings are clear and well-explained.
Consider a large box which contains many white (W) and black (B)
balls. We have forgotten the percentage of white balls in the box, but
remember that it is either or
. Even though we do not know the
percentage of white balls in the box we strongly believe that it is
(but still believe that it might be
). Hence we decide to test if
the percentage of white balls in the box is
. For this purpose
we draw 20 balls at random with replacement and note their
color.
In the investigation of the average performance of produced kettles, a quality control engineer examines 49 kettles and measures the mean time to heat 1 liter of water from 25 ∘C to 100 ∘C as 75seconds. Knowing that this had a historical variance of 100seconds2, he wants to test at α = 0.05 whether the population mean time is equal to 60seconds or not, as the producer’s advertisements say "1 liter in 1 minute". Help him to correct the mistakes in his statistical test report.
Solution: The hypotheses involved should be written as:
As the historical variance of temperatures is known to be 100, the researcher should use:
where the upper-critical z-value is 1.65 in this one-sided test. Since 1.5 < 1.65, we fail to reject H0. The mean time to boil water is not longer than 60 seconds, as promised in the advertisements.
A researcher investigates whether two different teaching methods yield
similar impacts on learning of students. After Method 1 is used in
Section 1 and Method 2 is used in Section 2 of the same course, the
same final exam is given to both sections. Then the researcher forms a
95% confidence interval as for the difference of exam
grades (Section 1 grade minus Section 2 grade). Can you analyze
whether there is a difference of 15points between the grades of two
sections?
The choice of confidence level for statistical practices
depend on the scientific/technical discipline. Referring to an
economist/financial analyst (performing portfolio analysis), a computer
scientist (designing and coding national payment systems), an
international relations specialist (trying to avoid nuclear conflicts) and
a physicist working for the CERN (searching for a very rare
subatomic particle), explain how the confidence level must be
chosen.
We have the following information:
Help him to find the critical value needed.
Solution: This exercise is left as self-study.