Topic:Two Tailed Hypothesis Testing
From SharedExperienceProject
Contents |
Topic Highlights
(What you will learn)
- What it means to test a hypothesis
- Why you would do it
- How to do it for the two tailed case
Introduction and Motivation
(Why learn it)
In this topic we're still talking about inferential statistics. In fact, for the rest of the course we will be talking about cases where the parameters
and
are unknown and have to be estimated by sample statistics
and s.
In the topic More on Confidence Intervals, we saw that a confidence interval can be used to represent the precision with which
estimates
.
In this topic we will take this concept further by looking at ways to use the sample mean to test claims made about the population mean.
More generally, hypothesis testing has many business applications. One that jumps to mind is the testing required on information collected in a marketing research survey. More on this later.
Learning Activities
(How the levels of understanding will be gained)
| Type | Name | Direction |
| Lecture and discussion |
| Instructor-directed |
| In-class worksheet |
| |
| Reading |
| Self-directed |
| Personal activities |
| Self-directed |
Learning Objectives
(Levels of understanding to be gained)
| Level of Understanding | Objective(s) |
| Very best |
|
| Highly satisfactory |
|
| Satisfactory |
|
| Maybe just enough to pass |
|
Topic Notes: What is Hypothesis Testing Anyway?
As mentioned above in the introduction, hypothesis testing plays an important role in many business applications, most notably in marketing research.
The most important first step you can take is understanding the basic concept. And it's not all that difficult. Let's take a look using an example, which we will use throughout this section of the notes.
Example 1
An advertisement put out recently by a manufacturer of Compact Fluorescent Light (CFL) bulbs indicates that the average life time of their new product is 10000 hours.
Imagine that you run the marketing division of competing company and your boss comes in demanding you find out whether the claim of the first company is correct. After all, your bulbs don't last as long. How would you approach evaluating the claim that
= 10000 hours?
| Solution |
|---|
|
Example 2
Okay, now suppose:
What then? How does this help you test their claim that the true mean
= 10000 hours?
This is where the concept of hypothesis testing comes in. The idea is that we're going to define a hypothesis and test it against some condition. Let's start by looking at what that condition is.
Significance level
The condition for a hypothesis test is often defined by something called a significance level. For now you should know the following:
On the standard normal curve,
looks like the following, which explains why it is called two-tailed hypothesis testing:
The green areas are known as the rejection region.
The manager is usually responsible for specifying a significance level based on the issue at hand. We will look more at this in a future topic.
Step 1: Define the hypothesis
The primary, or null, hypothesis represented as H0. It is a statement concerning the population parameter that the researcher wishes to discredit.
In the case of population means, the null hypothesis takes the form:
H0:=
0
where
0 is the value of the mean to be tested.
In the case described in Example 1, the null hypothesis is:
H0:= 10000
We also usually define an alternative hypothesis, Ha, which is the complementary statement to H0. In Example 1, this is:
Ha:![]()
10000
When solving problems of this type, you need to write both the null and alternative hypothesis in your solution.
Be clear however, that the task of hypothesis testing is to either reject H0 or fail to reject H0.
Step 2: Sketch the situation
The next step is to sketch the situation, including the rejection region and the z-scores. Let's do this assuming we've been told to use a significance level
= 0.100.
First, draw and label the rejection regions:
Then add the z-scores, which you should know because you can figure out that the area between 0 and z is 0.45 (and because of all the z-score values we summarized for your equation sheet):
Computationally, the goal of the test is to find the z-score corresponding to
, for which we use the symbol z*, and looking to see whether it falls in one of the rejection regions:
Let's do this now to find out.
Step 3: Compute the test statistic (z-score)
Finding the test statistic z* just means finding the z-score for
. You should be familiar enough now to recognize that the following applies for this:
Because we don't know the population standard devation
, we'll approximate the above equation with the following:
Plugging in the numbers from Example 2 we get:
z* = (9930 - 10000) / (400/sqrt(100)) = -1.75
Step 4: Determining whether to reject
So, you can add the following to your sketch, showing that because -1.75 is less than -1.645, z* does fall in the lower rejection region as follows:
This means that the null hypothesis should be rejected. You could write something like:
- I reject H0 because -1.75 is less than -1.645
Step 5: Give a conclusion
Now just give a succint conclusion, e.g.:
- Based on a sample of 100 and a significance level of 0.10, the average life time of the CFL bulbs is not 10000.
Notes:
- If you reject H0, you can say that there is a difference (as we did above)
- But if you don't reject H0, you should NEVER say that you proved there is no difference - you didn't prove that! - you can only say that we cannot reject their claim
That's it!
In the case presented in Example 1, you could now tell your boss that the claim made by the competing company is not supported by your data at the 0.10 significance level.
Topic Notes: The Hypothesis Testing Steps
Now that you've been through it once, let's summarize the basic steps:
1. Define the hypotheses, i.e.:H0:= some number
Ha:![]()
that number
2. Sketch the situation, e g.:
3. Compute the test statistic, i.e. using:4. Determine whether to reject
5. Give a conclusion
These steps should be on your equation sheet.
You should do each of these every time you are asked a two-tailed test of the mean.
Topic Notes: Do I really Get It?
A good test of whether you really understand is to compare the hypothesis testing situation to the confidence interval situation.
Let's do that with another example.
Example 3
a) Compute the confidence interval for the level of confidence corresponding to the significance level
=0.10 in Example 2. Use the same values of n = 100,
= 9930, s = 400.
b) Does the confidence interval support your hypothesis testing conclusion obtained above?
c) Sketch both situations together
| Solution to a) |
|---|
|
From the topic More on Confidence Intervals, you should recognize that: CI = [ You can then find df = n-1 = 99 When you look at Table A.5 you will see that there is no entry for df = 99, so you will have to make use of the fact that the normal curve is very close to the t curve for n > 30. This allows you to use: CI = [ A significance level of 0.10 corresponds to a confidence level of 0.90 or 90%. Convince yourself of this using the following sketch: z is therefore = -1.645. Filling in the values, you get: CI = [ 9930 - 1.645*400/sqrt(100), 9930 + 1.645*400/sqrt(100) ] = [ 9864.21, 9995.79 ] |
| Solution to c) |
|---|
|
The following sketch shows both situations. Notice that the complementary nature of the two: |
Practice Problems
Practice Problem 1
Do exercise 8.7 on page 317 of Kvanli et al. The solution is in the back of the book.
For those who don't have this textbook, this problem is very similar to the examples seen above.
Practice Problem 2 (ES)
You work for a manufacturer of Compact Fluorescent Light (CFL) bulbs that has been selling standard CFL bulbs for many years. The average life time of its standard bulbs is 10000 hours.
The company has recently come out with a new type of bulb that it believes last longer on average. You take a sample of 65 of the new bulbs and find that the average life time is 10100 hours and the standard deviation is 450 hours.
a) Using a 5% level of significance, determine whether there is a significant difference in the average life time of the new bulbs
b) Find the confidence interval for the average lifetime for a 95% confidence level
c) Do the hypothesis test and confidence interval lead you to the same conclusion?
| Solution to b) |
|---|
|
Do this to find: CI = [ 9990.6034, 10209.3966 ] |
Practice Problem 3
A donut factory has just completed contract negotiations with the union representing its workers. The company claims that the workers' average salary is now $400 per week. In a survey of 28 workers, the average wage was found to be $378.86 and the standard deviation was $49.20. The wages are known to be normally distributed.
Is the company's claim incorrect? Use a level of significance of 2%.
| Solution |
|---|
|
Given:
s = 49.20 n = 28[1] Step 1: Hypotheses: H0: Ha:Step 2: Sketch the situation Here, you should show that the area on the outsides is 0.01 and that z=2.33 Step 3: Compute the test statistic: z* = -2.274 Step 4: Determine whether to reject: Do not reject H0 since -2.33 < -2.274 Step 5: Draw a conclusion: Based on the sample of 28 and a significance level of 2%, the average salary is not significantly different from $400. Evidence was not found that the company's claim is incorrect. |
Footnote
- ↑ In a later topic, we will learn that n=28 corresponds to the small sample case, which is solved slightly differently than this (using the t-distribution). Don't worry about this for now, and solve this using the z-distribution.












