Probability Part 1: Rules and Patterns: Crash Course Statistics #13

CrashCourse
25 Apr 201812:00

Summary

TLDRIn this Crash Course episode, Adriene Hill explores the concept of probability, distinguishing between empirical and theoretical probabilities. Empirical probability is derived from observed data, such as the ratio of girls in families, while theoretical probability represents an ideal truth. Hill introduces the addition rule for calculating the probability of multiple, non-mutually exclusive events and the multiplication rule for independent events. She also discusses conditional probabilities, using examples like medical screenings and the likelihood of simultaneous events, such as Cole Sprouse's presence and 'Free Ice Cream Night' at IHOP, to illustrate these principles.

Takeaways

  • 🧠 The human brain's tendency to see patterns, such as faces in inanimate objects, is known as pareidolia.
  • 📊 Probability in statistics is defined in two ways: empirical probability, which is observed in data, and theoretical probability, which is an ideal or universal truth.
  • 🔍 Empirical probability is derived from actual data and provides an estimate of the theoretical probability but is subject to uncertainty and randomness.
  • 🎰 Theoretical probability is an idealized concept that can be estimated through empirical observations, such as the probability of winning a jackpot on a slot machine.
  • 🍭 The addition rule of probability is used to calculate the probability of one or more events occurring, considering whether events are mutually exclusive or not.
  • 📝 Probability notation, like P(Red) for the probability of drawing a red Skittle, simplifies the expression of complex probability scenarios.
  • 🎲 When events are not mutually exclusive, the full addition rule accounts for the overlap by adding the probabilities of individual events and subtracting the probability of their joint occurrence.
  • 🍨 The multiplication rule is used to find the probability of multiple independent events occurring together, such as Cole Sprouse being at IHOP and it being 'Free Ice Cream Night'.
  • 🏥 Conditional probabilities, like P(Cancer | Positive Test), are crucial in medical screenings to understand the likelihood of a condition given a test result.
  • 🎟️ Probability can inform decision-making in everyday life, such as choosing between movie tickets or applying to multiple colleges, by helping to set realistic expectations.

Q & A

  • What is pareidolia and how does it relate to the human brain's pattern recognition?

    -Pareidolia is a psychological phenomenon where the brain perceives a familiar pattern, such as a face, in an unrelated object. It occurs because the human brain is exceptionally adept at recognizing patterns, sometimes to the extent of seeing them even when they aren't present.

  • What are the two types of probability discussed in the script?

    -The two types of probability discussed are empirical probability and theoretical probability. Empirical probability is derived from observed data, while theoretical probability represents an ideal or universal truth that we cannot directly observe.

  • How is empirical probability different from theoretical probability?

    -Empirical probability is based on observed data and carries some uncertainty due to being a sample of all available data. In contrast, theoretical probability is an idealized, exact value that represents the true likelihood of an event occurring in the universe.

  • What is the addition rule of probability, and when is it used?

    -The addition rule of probability is used to calculate the probability of either one event or another occurring. It is used when events are mutually exclusive, meaning they cannot happen at the same time. The rule states that the probability of A or B occurring is P(A) + P(B).

  • What is meant by mutually exclusive events in the context of probability?

    -Mutually exclusive events are those that cannot occur simultaneously. In probability, the probability of two mutually exclusive events happening at the same time is zero.

  • How do you calculate the probability of getting a red or purple Skittle from a bag if each color has a 20% chance?

    -The probability of drawing a red or purple Skittle is calculated by adding the individual probabilities of each color since they are mutually exclusive. So, P(Red or Purple) = P(Red) + P(Purple) = 0.2 + 0.2 = 0.4 or 40%.

  • What is the full addition rule for probability, and when is it used?

    -The full addition rule for probability is used when events are not mutually exclusive. It is calculated as P(A or B) = P(A) + P(B) - P(A and B). This rule accounts for the probability of both events occurring together to avoid double counting.

  • What is the multiplication rule in probability, and how is it used?

    -The multiplication rule is used to calculate the probability of two or more independent events occurring together. It is expressed as P(A and B) = P(A) * P(B), provided that the events are independent, meaning the occurrence of one does not affect the probability of the other.

  • What is the difference between independent and dependent events in probability?

    -Independent events are those where the occurrence of one does not affect the probability of the other. Dependent events, on the other hand, have probabilities that are influenced by whether the other event has occurred.

  • How are conditional probabilities used in medical screenings, as mentioned in the script?

    -Conditional probabilities are used in medical screenings to determine the likelihood of having a condition, given a positive test result. For example, P(Cancer | Positive Test) represents the probability of actually having cancer given that the test result is positive.

  • Why are probabilities important in everyday decision-making?

    -Probabilities are important in everyday decision-making because they help set realistic expectations and guide choices by estimating the likelihood of different outcomes. They can inform decisions such as planning events, making backups, and assessing risks.

Outlines

00:00

🧠 Understanding Probability and Pattern Recognition

This paragraph introduces the concept of probability through the lens of pattern recognition, specifically pareidolia, where the brain perceives familiar patterns like faces in random objects. It sets the stage for discussing empirical and theoretical probabilities. Empirical probability is derived from observed data and is subject to uncertainty, whereas theoretical probability represents an ideal or universal truth. The example of a slot machine illustrates how empirical probability can estimate theoretical probability. The paragraph also introduces the addition rule of probability for mutually exclusive events, using the selection of Skittles as an example, and explains the use of probability notation, such as P(Red) for the probability of drawing a red Skittle.

05:04

🎰 Probabilities of Independent and Dependent Events

The second paragraph delves into the addition rule for non-mutually exclusive events, using the example of rolling a die and flipping a coin. It explains the need to adjust the addition rule by including the probability of both events occurring together to avoid double counting. The concept of conditional probability is introduced, showing how it can be used to calculate the likelihood of one event given the occurrence of another. The paragraph uses a hypothetical scenario involving actor Cole Sprouse and a 'Free Ice Cream Night' at IHOP to illustrate the multiplication rule and the calculation of probabilities for independent events. It also touches on the importance of conditional probabilities in medical screening, using cervical cancer screening as an example to demonstrate how false positives and negatives affect the interpretation of test results.

10:05

🎬 Applying Probability to Everyday Life

The final paragraph emphasizes the practical application of probability in everyday decision-making. It discusses how understanding probabilities can help in setting realistic expectations and making informed choices, such as choosing movie tickets or applying to colleges. The paragraph also highlights the unpredictability of certain events, like catching red lights, and suggests that while probabilities can't always provide exact outcomes, they are a valuable tool for navigating the uncertainties of life. The host encourages viewers to consider probability as a part of their decision-making process, even when faced with the complexities of real-world scenarios.

Mindmap

Keywords

💡Pareidolia

Pareidolia is a psychological phenomenon where the mind perceives familiar patterns or images, such as faces, where they do not exist. In the context of the video, pareidolia is used to introduce the concept of pattern recognition, which is a fundamental aspect of how our brains process information and is directly related to the study of statistics and probability. The video uses the example of seeing faces in inanimate objects like bell peppers to illustrate how our brains are wired to find patterns, even when they are not present.

💡Probability

Probability is a measure of the likelihood that a particular event will occur. It is a central concept in statistics and is used to make inferences and predictions based on data. The video defines probability and distinguishes between empirical and theoretical probability. Empirical probability is based on observed data, while theoretical probability represents an ideal or true value that may not be directly observable. The video uses the example of playing a slot machine to explain how empirical probability can be estimated from the frequency of an event occurring.

💡Empirical Probability

Empirical probability is the probability of an event based on observed data. It is an estimate derived from a sample of the population and is subject to uncertainty due to the randomness and limited size of the sample. In the video, the ratio of girls in individual families is given as an example of empirical probability, which provides an insight into the true theoretical probability but may not be exactly equal to it.

💡Theoretical Probability

Theoretical probability refers to the true probability of an event occurring, which is considered an ideal or absolute value that exists independently of any particular sample or observation. The video explains that theoretical probability is not directly observable but can be estimated through empirical observations, such as the long-term frequency of winning a jackpot in a slot machine game.

💡Addition Rule of Probability

The addition rule of probability is a principle used to calculate the probability of either one event or another (or both) occurring. It is particularly useful when dealing with non-mutually exclusive events. The video explains the rule using the example of selecting a Skittle, where the probability of picking a red or purple Skittle is the sum of their individual probabilities. The rule is adjusted for non-mutually exclusive events by adding the probability of both events occurring together.

💡Mutually Exclusive Events

Mutually exclusive events are events that cannot occur at the same time. In probability, if two events are mutually exclusive, the probability of both occurring together is zero. The video uses the colors of Skittles as an example, where a Skittle cannot be red and purple at the same time, making these color choices mutually exclusive.

💡Multiplication Rule

The multiplication rule is used to calculate the probability of two or more independent events occurring together. It is based on the idea that the probability of multiple events occurring is the product of their individual probabilities. The video illustrates this with a scenario involving the chance of actor Cole Sprouse being at a local IHOP and it being 'Free Ice Cream Night' on the same day.

💡Conditional Probability

Conditional probability is the probability of an event occurring, given that another event has already occurred. It is denoted as P(Event 1 | Event 2) and is calculated by dividing the probability of both events occurring by the probability of the second event. The video discusses the importance of conditional probabilities in medical screenings, such as determining the likelihood of having cervical cancer given a positive test result.

💡Independence

In probability, two events are considered independent if the occurrence of one event does not affect the probability of the other event occurring. The video explains this concept using the example of Cole Sprouse's presence at IHOP and 'Free Ice Cream Night' being independent events, meaning that the probability of Cole being there is not influenced by whether it is 'Free Ice Cream Night' or not.

💡False Positives and False Negatives

False positives and false negatives are terms used in the context of diagnostic testing. A false positive occurs when a test indicates that a condition is present when it is not, while a false negative indicates that a condition is absent when it is actually present. The video discusses these concepts in the context of cervical cancer screenings, explaining how understanding these probabilities can influence medical recommendations and patient expectations.

Highlights

Pareidolia is the phenomenon where our brains perceive patterns, like faces, in non-face objects.

Probability is introduced as a measure of the likelihood of events occurring.

There are two types of probability: empirical, based on observed data, and theoretical, an idealized concept.

Empirical probability is derived from the ratio of specific outcomes in observed data, such as the number of girls in individual families.

Theoretical probability represents an ideal or universal truth about the likelihood of an event.

The empirical probability can estimate the theoretical probability but is not always equal due to sample randomness.

An example of estimating probability is given by playing a slot machine and calculating the win rate.

The addition rule of probability is explained for calculating the probability of one event or another occurring.

Mutually exclusive events are defined as events that cannot occur at the same time, such as selecting a red or purple Skittle.

Probability notation, such as P(Red), is introduced to simplify the expression of probability calculations.

The probability of getting a red or purple Skittle is calculated using the addition rule, resulting in a 40% chance.

The full addition rule is presented, which accounts for the probability of two non-mutually exclusive events occurring together.

The multiplication rule for probability is introduced to calculate the likelihood of multiple independent events happening simultaneously.

An example using the multiplication rule involves calculating the probability of Cole Sprouse being at IHOP on 'Free Ice Cream Night'.

The concept of independent events is explained, where the occurrence of one event does not affect the probability of another.

Conditional probabilities are discussed, showing the probability of one event given the occurrence of another.

The practical application of conditional probabilities in medical screenings, such as for cervical cancer, is examined.

The importance of understanding probabilities in everyday life decisions, such as planning a night out with friends, is highlighted.

The video concludes by emphasizing the usefulness of probability in setting realistic expectations and making informed decisions.

Transcripts

play00:02

Hi, I’m Adriene Hill, and Welcome back to Crash Course, Statistics.

play00:05

If you’ve ever seen a face in an onion, or a grilled cheese, or any other inanimate

play00:09

object, you’ve experienced pareidolia, which is a product of our brains that causes us

play00:14

to see the pattern of a face in non-face objects.

play00:18

This happens because our brains are so good at seeing patterns, that they sometimes see

play00:22

them when they’re not really there, like a face in this bell pepper.

play00:26

And faces aren’t the only patterns we see.

play00:28

Our brains recognize patterns in everything, especially in sequences of events, like the

play00:33

kind we’ll see today as we start talk about Probability.

play00:37

INTRO

play00:46

Alright, first let’s just establish a more specific definition of what probability is,

play00:52

because the way we use the word in everyday life can be different from how we use it in Statistics.

play00:57

Statisticians talk about two types of probability: empirical, and theoretical.

play01:02

Empirical probability is something we observe in actual data, like the ratio of girls in

play01:07

each individual family.

play01:09

It has some uncertainty, because like the samples in experiments, it’s just a small

play01:14

amount of the data that is available.

play01:15

Empirical probabilities, like sample statistics, give us a glimpse at the true theoretical

play01:20

probability, but they won’t always be equal to it because of the uncertainty and randomness

play01:26

of any sample.

play01:27

The theoretical probability on the other hand, is more of an ideal or a truth out there in

play01:31

the universe that we can’t directly see.

play01:33

Just like we use samples of data to guess what the true mean or standard deviation of

play01:37

the population is, we can use a sample of data to guess what the true probability of

play01:42

an event is.

play01:43

Say you play a slot machine over and over, you’ll be able to guess the probability

play01:49

of winning the jackpot by counting the number of times you win, and dividing it by the number

play01:53

of times you played.

play01:55

If you play 100 times and win 6 times, you can be pretty sure that the probability of getting

play01:59

a jackpot is around 6/100 or 6%.

play02:03

Now this isn’t to say that you can rule out that the true probability is 5% or even

play02:06

10%, but you’re relatively sure that it’s close to 6%, and not, say 99%.

play02:12

So, the empirical probability can be a good estimation of the theoretical one, even if

play02:17

it’s not exact.

play02:18

So far we’ve been talking about the probability of just one event, but often there may be

play02:22

two or more events that we want to consider, like what if you want to know the probability

play02:26

of picking a purple OR a red skittle from a bag.

play02:29

The proportion of each color in a bag of Skittles is roughly equal, 20% for each of the 5 colors.

play02:35

So let’s say you randomly select a Skittle without looking.

play02:38

For this, we need the addition rule of probability.

play02:41

Since a Skittle can’t be two different colors at once, the color possibilities are Mutually Exclusive.

play02:45

That means the probability of a Skittle being red AND purple at the same time is 0.

play02:49

So, we can use the simplified addition rule which says that the probability of getting

play02:53

a Red or Purple Skittle is the sum of the probability of getting a Red, and the probability

play02:59

of getting a purple.

play03:00

Since we’re going to be talking a lot about probability in the next few episodes I’m

play03:03

going to introduce a little notation.

play03:05

Instead of writing out “the probability of Red” we can use the notation P(Red).

play03:11

The probability of getting a red OR purple would then be written P(Red or Purple).

play03:16

So far we know what the probability of Red or Purple is, P(Red) + P(Purple), or 0.2+0.2

play03:22

That equals 0.40 or 40%.

play03:24

I like all skittles so the probability that I will get a skittle I like is 0.2 + 0.2 +0.2

play03:30

+0.2 +0.2.

play03:31

That’s 100%.

play03:33

Good odds!

play03:33

Red and Purple Skittles are mutually exclusive, but not all the events we’re interested in are.

play03:38

For example, if you roll a die and flip a coin, the probability of getting a tails is

play03:42

not mutually exclusive of rolling a 6, since you can both roll a 6 and flip tails in the

play03:47

same turn.

play03:48

Since P(tails or 6) ≠ 0, these two events are not mutually exclusive, and we’ll need

play03:54

to adjust our addition rule accordingly.

play03:56

The full version of the addition rule states that P(tails or 6) = P(tails) + P(6) - P(tails and 6).

play04:05

When two things are mutually exclusive, the probability that they happen together is 0,

play04:10

so we ignored it, but now the probability of both these things happening is not zero,

play04:15

so we’ll need to calculate it.

play04:16

You can see here that there are 12 possible outcomes when flipping a coin and rolling a die.

play04:20

There are 6 outcomes with a tails, and 2 outcomes with a 6.

play04:23

If we add all of those together we get 8, but by looking through the chart, we can tell

play04:27

that there are only 7 possible outcomes that have either a tails or a 6.

play04:32

When we count T’s and 6’s independently, we double count the outcomes that have both

play04:37

If we didn’t subtract off the probability of (tails and 6), we would double count it.

play04:42

Let’s put these probabilities into a Venn Diagram we can see even more clearly why we

play04:46

need to subtract P(Tails and 6).

play04:48

If this Circle is all the Times we flip tails, and this circle is all the times we roll a

play04:53

6, this overlapping area is counted twice if we simply added the two circles together.

play04:59

In this simple case, we could easily see what the probability of tails and 6 is, but sometimes

play05:04

it’s not so easy to figure out.

play05:05

That’s why we have the Multiplication Rule, which helps us figure out the probability

play05:10

of two or more things happening at the same time.

play05:12

Let’s say you just found out that actor Cole Sprouse goes to your local IHOP pretty

play05:17

often, and there’s a 20% chance that he’ll be there for dinner any given night.

play05:20

And yeah, I know that’s not how people work but we’re going to say that’s how Cole

play05:24

Sprouse works.

play05:25

Anyway to top that off, your local IHOP has a promotion where they randomly select certain

play05:30

nights to be “Free Ice Cream Night” in the hopes that customers will keep coming

play05:34

back in case that night is the night.

play05:37

Each night there’s a 10% chance that it will be “Free Ice Cream Night”.

play05:41

Now, you love ice cream and you like like Cole Sprouse--as do we all--and your perfect

play05:46

night would include them both.

play05:47

So you try to calculate the probability that will happen on your visit tonight.

play05:52

Using the multiplication rule, multiply the probability that Cole Sprouse will be at IHOP,

play05:57

0.2, with the probability that it will be Free Ice Cream Night, 0.1.

play06:01

And you come to the sad realization that there’s only a 2% chance that you’ll get to see

play06:07

Cole and get free dessert tonight.

play06:09

When we want to know the probability of two things happening at the same time, we first

play06:12

need to look at only the times when one thing--Cole is at IHOP--is true, which is 20% of the time.

play06:19

Now that we reduced our options to just Cole nights, out of all these Cole times, how often

play06:24

is it free ice cream time?

play06:26

Only 10% of Cole nights.

play06:28

10% of the original 20% leaves only a 2% chance that both will happen at the same time.

play06:33

But you could always change your expectations and calculate the probability of getting either

play06:37

by using the addition rule.

play06:39

Cole or free Ice cream which, is calculated by adding the probability of Cole, to the

play06:44

probability of Free Ice Cream, minus the probability of both--so we don’t double count anything.

play06:49

You realize that there’s a 28% chance that something good will happen tonight, so you

play06:53

decide to still go, no matter what you’re going to get French Toast.

play06:56

Cole Sprouse and Free Ice Cream Night are independent.

play06:59

Cole doesn’t have any secret knowledge about when it’s Free Ice Cream Night, so it has

play07:02

never affected his decision to come.

play07:05

Two events are considered independent if the probability of one event occurring is not

play07:09

changed by whether or not the second event occurred.

play07:13

In more concrete terms, if Cole’s decision to go to IHOP is independent of IHOP’s decision

play07:18

to give out free ice cream, than the probability of Cole showing up should be the same on both

play07:23

ice cream and non ice-cream nights, since he’s just choosing randomly.

play07:27

We write conditional probabilities as P(Event 1 | Event 2).

play07:32

Conditional probabilities tell us the probability of Event 1, given that Event 2 has already happened.

play07:38

If two events are independent--like Cole and ice cream night--then we expect P(Cole | Ice Cream Night)

play07:43

to be the same as just plain ole P(Cole), since the two things are unrelated.

play07:48

If P(Cole | Ice Cream) wasn’t the same as plain ole P(Cole), then that means that Ice

play07:54

Cream night might somehow affect Cole’s decision to show up at IHOP.

play07:58

We calculate conditional probability P( Event 2 | Event 1) by dividing the probability of

play08:04

Event 1 and Event 2 by the Probability of Event 1.

play08:08

The role of conditional probabilities are particularly important when we consider medical screenings.

play08:14

For example, when screening for cervical cancer it used to be recommended that all adult women

play08:18

get screened once a year.

play08:20

But sometimes the results of the screenings are wrong.

play08:22

Either they can say there’s something abnormal when there isn’t (called a false positive)

play08:27

or that everything is all clear when it’s really not (called a false negative).

play08:31

This is exactly the kind of scenario where knowing the likelihood that something is actually

play08:35

abnormal in this case cervical cancer given that you’ve gotten positive tests results

play08:39

would be useful.

play08:41

That is P(Cancer | Positive Test).

play08:44

When looking at the data of people who DON’T have cancer, 3% will get a false positive.

play08:50

And people who DO have cancer will get false negatives 46% of the time.

play08:55

This means we miss a lot.

play08:56

And maybe freak some people who don’t need to be freaked out.

play08:59

The logic of conditional probabilities can help us make sense of why doctors have recently

play09:04

recommended that these tests be done less frequently in some cases.

play09:08

In the United States, the rate of cervical cancer is about 0.0081%, so only about 8 in

play09:15

100,000 women get cervical cancer.

play09:18

Using our rates of false negatives and positives, we can see that for every 100,000 women in

play09:23

the US, only about 4...and we’re rounding here of the about 3,004 people with positive

play09:29

tests actually had abnormal growths.

play09:32

That means the conditional probability of having cancer, given that you got a positive

play09:36

test is only 0.1%. Give or take. We're rounding.

play09:41

And these positive tests require expensive and invasive follow up tests.

play09:45

And I just want to point out that conditional probabilities aren’t reciprocal.

play09:50

That is to say P(Cancer|Pos Test) isn’t the same as P(Pos Test| Cancer) which would be about 50%.

play09:57

In real life you’re not always going to know the probability of Cole Sprouse showing

play10:01

up at the iHop--he’s unpredictable that way.

play10:05

Unpredictable like ...pretty much the rest of life.

play10:07

It can be very difficult to put a specific probability on a lot of everyday situations.

play10:12

Like how likely it is that your teacher will call-in sick today.

play10:15

Like whether or not you’re going to catch all the red lights on your way to school.

play10:18

Probabilities can--as we’ve seen--require a lot of calculations--and there’ not always

play10:23

time for that.

play10:24

But that doesn’t mean they belong only on the school-only side of your brain.

play10:28

Say you want to go out on a Friday night with friends.

play10:30

More than anything you don’t want it to suck.

play10:33

Last week you wound up on the couch watching Sandy Wexler, again.

play10:36

You know it’ll be hard to get tickets to see Black Panther so you make a backup plan

play10:40

just in case.

play10:42

You can always stream Get Out without stealing it of course.

play10:45

But if you’re determined to see Black Panther in the theater, Probability will help set

play10:49

your expectations.

play10:51

If you’ll only settle for center row tickets you’re more likely to be disappointed.

play10:56

Your chance of seeing Black Panther is going to be greater if you’re willing to settle

play10:59

for whatever tickets you can get.

play11:01

Probabilities help us understand why it makes sense to apply to more than one college.

play11:05

Why we should shouldn’t expect that the first short story you write will be get an

play11:09

A and be published in the New Yorker.

play11:12

And how likely it is that you’ll get mono.

play11:14

Given your significant other has mono.

play11:17

Probability can help you figure that out too.

play11:20

Thanks for watching. We'll see you next time.

Rate This

5.0 / 5 (0 votes)

Связанные теги
ProbabilityStatisticsPareidoliaPattern RecognitionEmpirical DataTheoretical ProbabilitySkittlesMutual ExclusivityAddition RuleMultiplication RuleConditional Probability
Вам нужно краткое изложение на английском?