Topic:More on Probability

From SharedExperienceProject

Jump to: navigation, search

Contents

Topic Highlights

(What you will learn)

  • Venn diagrams
  • The relationship between Venn diagrams and contingency tables
  • More on basic probability equations
  • Using Venn diagrams, contingency tables and the equations to solve probability problems

Introduction and Motivation

(Why learn it)

This topic extends the concepts learned in the topic Introduction to Probability. Same motivation here, just more of the advanced concepts, and not only the case of independent events.

Be sure to understand these too.

Learning Activities

(How the levels of understanding will be gained)

Learning activities for this topic
Type Name Direction
Reading Self-directed
In-class worksheet Self-directed
In-class discussion
Instructor-directed
Practice problems
Self-directed
Personal activities Self-directed


Learning Objectives

(Levels of understanding to be gained)

Learning objectives for this topic
Level of Understanding Objective(s)
Very best
Highly satisfactory
Satisfactory
Maybe just enough to pass


Topic Notes: Venn Diagrams and the Probability Equations

These notes are intended to facilitate a discussion introducing the concept of a Venn diagram and to use such diagrams to explain each of the probability equations.

What is a Venn Diagram?

A Venn diagram is a visual way to represent events in a probability situation. Let's have a look at some examples.

Example 1

A manager counts his staff to find a total of 180 people, of which 80 are males.

a) Create the frequency distribution table

b) Draw the Venn diagram



In the above example, be sure you see how the outcomes can only be male or female; the sum of those two areas make up the set of possible outcomes. This is shown explicitly below using shading, just to make sure you've understood this important concept:

Image:Venn_diagram_2.png

From this, it shouldn't be a stretch for you to see that where the complement equation comes from:

1 = P( F ) + P( M )

which is written in the general form as follows:

1 = P( A ) + P( B )

You should also recognize that F and M are complements. The general form of the equation for probabilities of complements is:

Image:Complement_equation.png

Example 2

Remember our marketing manager from last time? She had collected the data below:

Image:Contingency_table_example_-_raw_data.PNG

Do you recall the contingency table you created for her (just checking):

Image:Contingency_table_example_-_joint_probs.PNG

How would you use the Venn diagram concept to represent the same data for these two classes?



Notice that we don't usually put any numbers on Venn diagrams - we only use them to represent the classes and the relationships between them. In fact, the usual Venn diagram is even more simple than this one. Because I and N are complements, we don't usually need to label the I. Same goes for U. This gives the standard form of the Venn diagram for two classes of events:

Image:Venn_diagram_7.png

We will see that this conceptual representation can be useful for helping us think about the problems we are posed, and for knowing which equations to use.

Example 3

Can you identify the following areas in the final Venn diagram from Example 2?

a) N and O

b) N or O



Example 4

What about the following areas?

a) I and O

b) I and U

c) N and U



How do conditional probabilities fit in?

Now let's use Venn diagrams to understand conditional probabilities and the corresponding equation.

Example 5

Consider the same Venn diagram. How can one use it to think about a conditional probability? For example, let's say we were asked about the probability of respondents not liking the new shoes (N), given that they are older than 18 (O). This is one way to think about it:

Image:Venn_for_conditional_prob.png

From this, it follows that:

P( N | O ) = P( N and O ) / P( O )

The above concept gives rise to the general conditional probability equation given in your textbook:

P( A | B ) = P( A and B ) / P( B )

What about joint probabilities?

The joint probability equation, or multiplicative rule, is hard to show using Venn diagrams. Rather, let's just find it by rearranging the above:

P( A and B ) = P( A | B ) x P( B )

Independence

It is important that you notice that the above equation for joint probability is not the same equation that we saw in the topic Introduction to Probability. The one we studied there was for independent events, whereas the one given above is a general form that always applies.

Two events, A and B are independent if knowing whether A occurs does not change the probability that B occurs.

You should use the form above, unless you are told that the events are independent, or unless you can show that they are using one of the following equations. Tests for independence:

P( A | B ) = P( A )
P( B | A ) = P( B )
P( A and B ) = P( A ) x P( B )

Mutual Exclusiveness

Venn diagrams are also useful for understanding the concept of mutual exclusiveness.

Example 6

Consider the case given below. Notice that it is not possible for an outcome to be both an event of type A and an event of type B. That's because the events are mutually exclusive.

Image:Venn_diagram_8.png

You can test for mutual exclusiveness by checking that:

P( A and B ) = 0

This should make sense to you from the above figure because there is no space where both A and B exist together.

The Additive Rule

The additive rule is a term used to represent the concept of finding the probability that one or another of two events will occur, P( A or B ), but not both.

The Venn diagram in Example 6 is useful to illustrate the additive rule. For the case of mutually exclusive events, you should be able to see that:

P( A or B ) = P( A ) + P ( B )

In other words, the chance of event A or event B is the sum of each chance.

The general case is only slightly more complex. Consider the following Venn diagram where the events are not mutually exclusive:

Image:Venn_diagram_9.png

Here, we want the same area, A + B, but we need to subtract the small shaded area, which will have been counted twice if we don't.

From our earlier discussions, you should recognize this area to be: A and B. So, the general additive rule is:

P( A or B ) = P( A ) + P ( B ) - P ( A and B )

Lecture Notes: Comprehensive Example

The following comprehensive example is intended to facilitate an introduction to problems using the main approaches we've seen so far: Venn diagrams, probability equations and contingency tables.

Problem statement

Imagine that you run the sports centre at Mount Royal College. You carry out a survey of several classes at the college to find out that 20% of the students work out before classes, 50% work out after classes, and 10% (the hard core students) work out both before and after class.

Example 7

a) For the situation provided, define the two relevant classes and give them one-letter labels



b) What is the probability corresponding to each class?



c) Draw a suitable Venn diagram



d) What is the probability corresponding to the students who work out before or after class?



e) If a student works out before class, what is the probability that he or she will work out after class?



f) If nobody is working out during classes, what is the probability that a student does not work out at all?



Example 8

Now let's tackle the same challenge a different way.

a) Draw a contingency diagram for the situation described above, and place the data you have



b) fill in the remaining values, if you haven't already



c) Find the answers to questions d), e) and f) of Example 7:




Practice Problems

As promised last time, this set of problems is more advanced. Try not to look at the solutions until you've worked the problem through.

Practice Problem 1

You are the data analyst for an environmental lobby group and you are concerned about how the upcoming election will impact the chances of a new environmental law going through. Your team has collected the following data from members of the three main political parties about their support for the law:

Image:Contingency_for_PP_3.PNG

Find these values:

a) The probability of being for the law and a member of the Conservatives

b) The marginal probability of being against the law

c) The marginal probability of being in the Liberal party

d) The likelihood of a member being against the law given that he or she is an NDP

e) The likelihood of any member not being undecided

f) The probability of being for the law and an NDP

g) P( Liberal ) x P( against | NDP )


Practice Problem 2

A recruitment manager surveys 50 potential students of the Bissett School of Business. They are classified on the basis of age and their interest in three of the programs (Entrepreneurship, Supply Chain and Accounting). Of the respondents: 28 were under 20 years old, 11 were between 20 and 25 years old, 20 were interested in Entrepreneurship, 20 were interested in Accounting, 15 of those interested in Entrepreneurship were under 20 years old, 9 of those interested in Supply Chain were under 20, 1 of those interested in Supply Chain is between 20 and 25 years old, and 6 of those interested in Accounting are between 20 and 25 years old.

Part 1: What is the probability of an incoming student being:

a) under 20 years old?
b) interested in Entrepreneurship?
c) interested in Accounting and over 25?
d) over 25 years old, given that he or she is interested in Supply Chain?
e) interested in Supply Chain or under 20?

Part 2: Are being interested in Supply Chain and being over 25 independent events?

HINT: Don't get scared by the written text. Just write down the relevent classes and everything you know. Then go from there.


Personal tools