Topic:Two Tailed Hypothesis Testing

From SharedExperienceProject

Jump to: navigation, search

Contents

Topic Highlights

(What you will learn)

  • What it means to test a hypothesis
  • Why you would do it
  • How to do it for the two tailed case

Introduction and Motivation

(Why learn it)

In this topic we're still talking about inferential statistics. In fact, for the rest of the course we will be talking about cases where the parameters Image:Mu_for_poisson.png and Image:Alpha.png are unknown and have to be estimated by sample statistics Image:Xbar.png and s.

In the topic More on Confidence Intervals, we saw that a confidence interval can be used to represent the precision with which Image:Xbar.png estimates Image:Mu_for_poisson.png.

In this topic we will take this concept further by looking at ways to use the sample mean to test claims made about the population mean.

More generally, hypothesis testing has many business applications. One that jumps to mind is the testing required on information collected in a marketing research survey. More on this later.

Learning Activities

(How the levels of understanding will be gained)

Learning activities for this topic
Type Name Direction
Lecture and discussion
Instructor-directed
In-class worksheet
Reading Self-directed
Personal activities Self-directed

Learning Objectives

(Levels of understanding to be gained)

Learning objectives for this topic
Level of Understanding Objective(s)
Very best
Highly satisfactory
Satisfactory
  • Can I follow the extensive example we did after Example 2?
Maybe just enough to pass

Topic Notes: What is Hypothesis Testing Anyway?

As mentioned above in the introduction, hypothesis testing plays an important role in many business applications, most notably in marketing research.

The most important first step you can take is understanding the basic concept. And it's not all that difficult. Let's take a look using an example, which we will use throughout this section of the notes.

Example 1

An advertisement put out recently by a manufacturer of Compact Fluorescent Light (CFL) bulbs indicates that the average life time of their new product is 10000 hours.

Imagine that you run the marketing division of competing company and your boss comes in demanding you find out whether the claim of the first company is correct. After all, your bulbs don't last as long. How would you approach evaluating the claim that Image:Mu_for_poisson.png = 10000 hours?


Example 2

Okay, now suppose:

  • You collect a sample, say n=100 bulbs
  • You test the life times of each (wait approximately 10000 hours!)
  • You compute the sample mean to find that Image:Xbar.png = 9930
  • You also compute the sample standard deviation to find that s = 400 hours

What then? How does this help you test their claim that the true mean Image:Mu_for_poisson.png = 10000 hours?

This is where the concept of hypothesis testing comes in. The idea is that we're going to define a hypothesis and test it against some condition. Let's start by looking at what that condition is.

Significance level

The condition for a hypothesis test is often defined by something called a significance level. For now you should know the following:

  • It is the probability of rejecting our hypothesis when the hypothesis is in fact true, e.g. probability that a device indicates you have a disease when in fact you do not
  • It is represented by the symbol Image:Alpha.png
  • It is typically in the range: Image:Alpha.png = 0.01 to 0.1

On the standard normal curve, Image:Alpha.png looks like the following, which explains why it is called two-tailed hypothesis testing:

Image:Two_tailed_hypothesis_testing_2.png

The green areas are known as the rejection region.

The manager is usually responsible for specifying a significance level based on the issue at hand. We will look more at this in a future topic.

Step 1: Define the hypothesis

The primary, or null, hypothesis represented as H0. It is a statement concerning the population parameter that the researcher wishes to discredit.

In the case of population means, the null hypothesis takes the form:

H0: Image:Mu_for_poisson.png = Image:Mu_for_poisson.png0

where Image:Mu_for_poisson.png0 is the value of the mean to be tested.

In the case described in Example 1, the null hypothesis is:

H0: Image:Mu_for_poisson.png = 10000

We also usually define an alternative hypothesis, Ha, which is the complementary statement to H0. In Example 1, this is:

Ha: Image:Mu_for_poisson.png Image:Not_equal_sign.png 10000

When solving problems of this type, you need to write both the null and alternative hypothesis in your solution.

Be clear however, that the task of hypothesis testing is to either reject H0 or fail to reject H0.

Step 2: Sketch the situation

The next step is to sketch the situation, including the rejection region and the z-scores. Let's do this assuming we've been told to use a significance level Image:Alpha.png = 0.100.

First, draw and label the rejection regions:

Image:Two_tailed_hypothesis_testing_3.png

Then add the z-scores, which you should know because you can figure out that the area between 0 and z is 0.45 (and because of all the z-score values we summarized for your equation sheet):

Image:Two_tailed_hypothesis_testing_4.png

Computationally, the goal of the test is to find the z-score corresponding to Image:Xbar.png, for which we use the symbol z*, and looking to see whether it falls in one of the rejection regions:

Image:Two_tailed_hypothesis_testing_5.png

Let's do this now to find out.

Step 3: Compute the test statistic (z-score)

Finding the test statistic z* just means finding the z-score for Image:Xbar.png. You should be familiar enough now to recognize that the following applies for this:

Image:Z_for_two_tailed.png

Because we don't know the population standard devation Image:Sigma.png, we'll approximate the above equation with the following:

Image:Z_for_two_tailed_2.png

Plugging in the numbers from Example 2 we get:

z* = (9930 - 10000) / (400/sqrt(100)) = -1.75

Step 4: Determining whether to reject

So, you can add the following to your sketch, showing that because -1.75 is less than -1.645, z* does fall in the lower rejection region as follows:

Image:Two_tailed_hypothesis_testing_6.png

This means that the null hypothesis should be rejected. You could write something like:

  • I reject H0 because -1.75 is less than -1.645

Step 5: Give a conclusion

Now just give a succint conclusion, e.g.:

  • Based on a sample of 100 and a significance level of 0.10, the average life time of the CFL bulbs is not 10000.

Notes:

  • If you reject H0, you can say that there is a difference (as we did above)
  • But if you don't reject H0, you should NEVER say that you proved there is no difference - you didn't prove that! - you can only say that we cannot reject their claim

That's it!

In the case presented in Example 1, you could now tell your boss that the claim made by the competing company is not supported by your data at the 0.10 significance level.

Topic Notes: The Hypothesis Testing Steps

Now that you've been through it once, let's summarize the basic steps:

1. Define the hypotheses, i.e.:
H0: Image:Mu_for_poisson.png = some number
Ha: Image:Mu_for_poisson.png Image:Not_equal_sign.png that number

2. Sketch the situation, e g.:

Image:Two_tailed_hypothesis_testing_4.png

3. Compute the test statistic, i.e. using:

Image:Z_for_two_tailed_2.png

4. Determine whether to reject

5. Give a conclusion

These steps should be on your equation sheet.

You should do each of these every time you are asked a two-tailed test of the mean.

Topic Notes: Do I really Get It?

A good test of whether you really understand is to compare the hypothesis testing situation to the confidence interval situation.

Let's do that with another example.

Example 3

a) Compute the confidence interval for the level of confidence corresponding to the significance level Image:Alpha.png=0.10 in Example 2. Use the same values of n = 100, Image:Xbar.png = 9930, s = 400.

b) Does the confidence interval support your hypothesis testing conclusion obtained above?

c) Sketch both situations together


Practice Problems

Practice Problem 1

Do exercise 8.7 on page 317 of Kvanli et al. The solution is in the back of the book.

For those who don't have this textbook, this problem is very similar to the examples seen above.

Practice Problem 2 (ES)

You work for a manufacturer of Compact Fluorescent Light (CFL) bulbs that has been selling standard CFL bulbs for many years. The average life time of its standard bulbs is 10000 hours.

The company has recently come out with a new type of bulb that it believes last longer on average. You take a sample of 65 of the new bulbs and find that the average life time is 10100 hours and the standard deviation is 450 hours.

a) Using a 5% level of significance, determine whether there is a significant difference in the average life time of the new bulbs

b) Find the confidence interval for the average lifetime for a 95% confidence level

c) Do the hypothesis test and confidence interval lead you to the same conclusion?


Practice Problem 3

A donut factory has just completed contract negotiations with the union representing its workers. The company claims that the workers' average salary is now $400 per week. In a survey of 28 workers, the average wage was found to be $378.86 and the standard deviation was $49.20. The wages are known to be normally distributed.

Is the company's claim incorrect? Use a level of significance of 2%.


Footnote

  1. In a later topic, we will learn that n=28 corresponds to the small sample case, which is solved slightly differently than this (using the t-distribution). Don't worry about this for now, and solve this using the z-distribution.
Personal tools