Mean and variance of Bernoulli distribution example | Probability and Statistics | Khan Academy

Khan Academy
29 Oct 201008:20

Summary

TLDRThe video discusses a simple example of a Bernoulli distribution, where a population is surveyed for their opinion on the president. Respondents can either give a favorable or unfavorable rating, and the mean and variance of this discrete probability distribution are calculated. The example demonstrates how to find the expected value and variance using probability-weighted sums, despite the expected value not being a possible outcome. It concludes by introducing general formulas for the mean, variance, and standard deviation in a Bernoulli distribution, setting the stage for further exploration of the binomial distribution.

Takeaways

  • 📊 The speaker conducts a full survey of a population's opinion on the president, offering two possible responses: favorable or unfavorable.
  • 🎯 The probability distribution is discrete with two outcomes: 40% have an unfavorable view, and 60% have a favorable view.
  • 📈 The expected value (mean) of the distribution is calculated by assigning 0 to unfavorable (u) and 1 to favorable (f) views.
  • 🔢 The expected value of the distribution is 0.6, which represents a probability-weighted sum of the two options.
  • 🙅 No individual can have an actual value of 0.6; individuals will either have a favorable or unfavorable rating (0 or 1).
  • 💡 The variance of the population is the probability-weighted sum of the squared distances from the mean.
  • 🔍 Variance is calculated using the differences between each outcome (0 or 1) and the mean (0.6), resulting in a variance of 0.24.
  • 📐 The standard deviation is the square root of the variance, which in this case is approximately 0.49.
  • 🧠 The distribution is skewed to the right, with most individuals having a favorable view.
  • 📚 This specific example introduces the Bernoulli Distribution, a special case of the binomial distribution, which is further explained in future discussions.

Q & A

  • What is the purpose of surveying every single member of a population in this scenario?

    -The purpose is to gather data on the favorability rating of the president, with the aim of understanding the distribution of opinions within the population.

  • What are the two options available for the survey respondents?

    -The respondents can either have an unfavorable rating or a favorable rating for the president.

  • What percentage of the population had an unfavorable rating according to the survey?

    -According to the survey, 40% of the population had an unfavorable rating.

  • What percentage of the population had a favorable rating?

    -60% of the population had a favorable rating.

  • How is the probability distribution represented in this scenario?

    -The probability distribution is represented as a discrete distribution with two values: unfavorable (0) and favorable (1).

  • What is the expected favorability rating of a randomly picked member of the population?

    -The expected favorability rating is the mean of the distribution, which is calculated as 0.4 * 0 + 0.6 * 1 = 0.6.

  • Why is the mean of 0.6 not a value that an individual can actually take on?

    -The mean of 0.6 is not a value that an individual can take on because each person must choose either a favorable or unfavorable rating, which are represented as 1 or 0, respectively.

  • How is the variance of the distribution calculated?

    -The variance is calculated as the probability-weighted sum of the squared distances from the mean. In this case, it is 0.4 * (0 - 0.6)^2 + 0.6 * (1 - 0.6)^2 = 0.24.

  • What is the standard deviation of this distribution?

    -The standard deviation is the square root of the variance, which is approximately 0.49.

  • What does the distribution's skew to the right indicate?

    -The skew to the right indicates that the distribution is not symmetric and that there is a higher concentration of favorable ratings.

  • What is the Bernoulli Distribution mentioned in the script?

    -The Bernoulli Distribution is a discrete probability distribution that takes value 1 with success probability p and value 0 with failure probability q = 1 - p. It is the simplest case of the binomial distribution.

Outlines

00:00

📊 Understanding Population Favorability Ratings

In this paragraph, the speaker explains a scenario where every member of a population is surveyed about their opinion of the president, with only two response options: favorable or unfavorable. After surveying the entire population, 40% give an unfavorable rating, while 60% give a favorable rating. This forms a discrete probability distribution since only two values are possible. The speaker introduces the concept of expected value (mean) and explains how it’s calculated for this distribution, assigning 0 to unfavorable and 1 to favorable. The mean is found to be 0.6, but no individual has this rating; rather, it's an average across the population. This discrepancy between the actual ratings (0 or 1) and the mean (0.6) is explored.

05:03

🧮 Calculating Variance and Standard Deviation in Discrete Distributions

This paragraph introduces variance, which measures how much the data points in the distribution deviate from the mean. Using the same population distribution (40% unfavorable, 60% favorable), the speaker shows how variance is calculated as the probability-weighted sum of squared differences from the mean. The variance is calculated as 0.24, and the standard deviation (the square root of the variance) is 0.49. The speaker also notes that while it's harder to visualize standard deviation in a discrete distribution, it makes sense that the distribution is skewed to the right since more people gave a favorable rating.

Mindmap

Keywords

💡Probability Distribution

A probability distribution is a mathematical function that describes the likelihood of different outcomes in an experiment. In the video, the distribution is discrete, meaning there are only two possible outcomes: a favorable or unfavorable view of the president. The concept of probability distribution helps to quantify how likely each outcome is in a population survey.

💡Discrete Distribution

A discrete distribution refers to a type of probability distribution where the possible outcomes are distinct and separate, as opposed to continuous. In this script, the survey has only two outcomes: favorable (1) or unfavorable (0). This type of distribution simplifies the analysis of binary events like yes/no or success/failure.

💡Mean (Expected Value)

The mean, or expected value, is the average value that would result from repeated trials of an experiment. In the video, the expected value is calculated to be 0.6, representing the favorability rating, even though no individual can be 60% favorable. It’s a theoretical average based on the probability-weighted sum of the possible outcomes.

💡Variance

Variance measures how much the data points in a distribution differ from the mean. In this video, variance is calculated as the probability-weighted sum of the squared differences between each possible outcome and the mean (0.6). It quantifies the spread or dispersion of the favorability ratings from the mean value.

💡Standard Deviation

Standard deviation is the square root of the variance and indicates how much the outcomes deviate from the mean. In the video, the standard deviation of 0.49 shows how spread out the favorability ratings are around the mean. It’s a useful measure for understanding the distribution’s variability.

💡Bernoulli Distribution

A Bernoulli distribution is the simplest case of a binomial distribution, representing a random experiment with exactly two possible outcomes (like success/failure). In this script, the favorable/unfavorable view is an example of a Bernoulli trial, with probabilities 0.6 and 0.4, respectively. The entire video is centered around calculating characteristics of this distribution.

💡Binomial Distribution

A binomial distribution is a probability distribution that summarizes the likelihood of obtaining a fixed number of successes in a set of independent trials, each with the same probability of success. The Bernoulli distribution is a special case of the binomial distribution with only one trial. This concept is introduced in the video to build towards understanding how probabilities are distributed across multiple trials.

💡Weighted Sum

A weighted sum is a summation where each value contributes proportionally to its assigned weight. In this video, the probability-weighted sum is used to calculate both the mean (expected value) and the variance by multiplying each outcome by its probability before summing the results. This allows for more accurate modeling of random variables.

💡Success and Failure

In the context of the Bernoulli and binomial distributions, success and failure refer to the two possible outcomes of a trial. In the video, a 'success' is defined as a favorable rating (with a probability of 0.6) and a 'failure' is an unfavorable rating (with a probability of 0.4). These terms are crucial in binary outcome models like the one discussed.

💡Squared Distance

The squared distance refers to the square of the difference between a given value and the mean, used to calculate variance. In the script, the squared distance is calculated between 0 and 0.6 (the mean) for unfavorable ratings, and between 1 and 0.6 for favorable ratings. Squaring ensures that all deviations from the mean contribute positively to the variance.

Highlights

Surveying an entire population to measure the favorability rating of the president is typically impractical, but hypothetically possible.

The population has two response options: favorable or unfavorable rating.

In this example, 40% of the population has an unfavorable rating, and 60% has a favorable rating.

The probability distribution in this scenario is discrete, with only two possible values.

The mean (or expected value) is calculated as the probability-weighted sum of the possible values of the distribution.

Defining the unfavorable rating as 0 and the favorable rating as 1 allows for calculating the mean of the distribution.

The mean of the distribution is calculated as 0.6, representing the expected favorability rating.

Even though the mean is 0.6, no individual can have a favorability value of 0.6; individuals can only choose 1 or 0.

The mean represents the expected proportion of favorable responses in the population, not an individual outcome.

Variance is defined as the probability-weighted sum of the squared distances from the mean.

To calculate variance, the distances between the possible values (0 and 1) and the mean (0.6) are squared and weighted by their respective probabilities.

The variance is calculated to be 0.24.

The standard deviation of the distribution is the square root of the variance, which is approximately 0.49.

The distribution is skewed to the right, with the mean closer to the favorable rating (1).

This scenario demonstrates a basic case of the Bernoulli distribution, which is the simplest form of a binomial distribution.

Transcripts

play00:00

Let's say that I'm able to go out and survey every single

play00:03

member of a population, which we know is not normally

play00:07

practical, but I'm able to do it.

play00:09

And I ask each of them, what do you think of the president?

play00:13

And I ask them, and there's only two options, they can

play00:16

either have an unfavorable rating or they could have a

play00:23

favorable rating.

play00:28

And let's say after I survey every single member of this

play00:32

population, 40% have an unfavorable rating and 60%

play00:38

have a favorable rating.

play00:39

So if I were to draw the probability distribution, and

play00:44

it's going to be a discrete one because there's only two

play00:46

values that any person can take on.

play00:49

They could either have an unfavorable view or they could

play00:52

have a favorable view.

play00:55

And 40% have an unfavorable view, and let me color code

play01:01

this a little bit.

play01:02

So this is the 40% right over here, so 0.4 or maybe I'll

play01:06

just write 40% right over there.

play01:10

And then 60% have a favorable view.

play01:19

Let me color code this.

play01:21

60% have a favorable view.

play01:23

And notice these two numbers add up to 100% because

play01:26

everyone had to pick between these two options.

play01:29

Now if I were to go and ask you to pick a random member of

play01:34

that population and say what is the expected favorability

play01:38

rating of that member, what would it be?

play01:40

Or another way to think about it is what is the mean of this

play01:44

distribution?

play01:45

And for a discrete distribution like this, your

play01:48

mean or you're expected value is just going to be the

play01:51

probability weighted sum of the different values that your

play01:55

distribution can take on.

play01:56

Now the way I've written it right here, you can't take a

play01:59

probability weighted sum of u and f-- you can't say 40%

play02:02

times u plus 60% times f, you won't get

play02:05

any type of a number.

play02:06

So what we're going to do is define u and f to be

play02:09

some type of value.

play02:10

So let's say that u is 0 and f is 1.

play02:16

And now the notion of taking a probability weighted sum makes

play02:20

some sense.

play02:21

So that mean, or you could say the mean, I'll say the mean of

play02:27

this distribution it's going to be 0.4-- that's this

play02:33

probability right here times 0 plus 0.6 times 1, which is

play02:48

going to be equal to-- this is just going to be

play02:51

0.6 times 1 is 0.6.

play02:56

So clearly, no individual can take on the value of 0.6.

play03:00

No one can tell you I 60% am favorable and 40% am

play03:05

unfavorable.

play03:05

Everyone has to pick either favorable or unfavorable.

play03:09

So you will never actually find someone who has a 0.6

play03:12

favorability value.

play03:13

It'll either be a 1 or a 0.

play03:15

So this is an interesting case where the mean or the expected

play03:18

value is not a value that the distribution can

play03:20

actually take on.

play03:22

It's a value some place over here that

play03:27

obviously cannot happen.

play03:28

But this is the mean, this is the expected value.

play03:31

And the reason why that makes sense is if you surveyed 100

play03:35

people, you'd multiply 100 times this number, you would

play03:38

expect 60 people to say yes, or if you'd summed them all

play03:41

up, 60 would say yes, and then 40 would say 0.

play03:44

You sum them all up, you would get 60% saying yes, and that's

play03:47

exactly what our population distribution told us.

play03:49

Now what is the variance?

play03:50

What is the variance of this population right over here?

play03:53

So the variance-- let me write it over here, let me pick a

play03:57

new color-- the variance is just-- you could view it as

play04:02

the probability weighted sum of the squared distances from

play04:06

the mean, or the expected value of the squared distances

play04:10

from the mean.

play04:11

So what's that going to be?

play04:12

Well there's two different values that

play04:14

anything can take on.

play04:15

You can either have a 0 or you could either have a 1.

play04:19

The probability that you get a 0 is 0.4-- so there's a 0.4

play04:23

probability that you get a 0.

play04:25

And if you get a 0 what's the distance from 0 to the mean?

play04:30

The distance from 0 to the mean is 0 minus 0.6, or I can

play04:35

even say 0.6 minus 0-- same thing because we're going to

play04:37

square it-- 0 minus 0.6 squared-- remember, the

play04:42

variance is the weighted sum of the squared distances.

play04:49

So this is the difference between 0 and the mean.

play04:51

And then plus, there's a 0.6 chance that you get a 1.

play04:59

And the difference between 1 and 0.6, 1 and our

play05:03

mean, 0.6, is that.

play05:07

And then we are also going to square this over here.

play05:13

Now what is this value going to be?

play05:15

This is going to be 0.4 times 0.6 squared-- this is 0.4

play05:21

times point-- because 0 minus 0.6 is negative 0.6.

play05:24

If you square it you get positive 0.36.

play05:30

So this value right here-- I'm going to color code it.

play05:33

This value right here is times 0.36.

play05:37

And then this value right here-- let me do this in

play05:40

another-- so then we're going to have plus 0.6 times 1 minus

play05:46

0.6 squared.

play05:48

Now 1 minus 0.6 is 0.4.

play05:51

0.4 squared is 0.16.

play05:56

So let me do this.

play05:57

So this value right here is going to be 0.16.

play06:01

So let me get my calculator out to actually calculate

play06:05

these values.

play06:09

So this is going to be 0.4 times 0.36, plus 0.6 times

play06:22

0.16, which is equal to 0.24.

play06:29

So our standard deviation of this distribution is 0.24.

play06:37

Or if you want to think about the variance of this

play06:42

distribution is 0.24 and the standard deviation of this

play06:45

distribution, which is just the square root of this, the

play06:48

standard deviation of this distribution is going to be

play06:50

the square root of 0.24, and let's calculate what that is.

play06:55

That is going to be-- let's take the square root of 0.24,

play07:00

which is equal to 0.48-- well I'll just round it up-- 0.49.

play07:07

So this is equal to 0.49.

play07:11

So if you were look at this distribution, the mean of this

play07:16

distribution is 0.6.

play07:18

So 0.6 is the mean.

play07:20

And the standard deviation is 0.5.

play07:23

So the standard deviation is-- so it's actually out here--

play07:28

because if you go add one standard deviation you're

play07:30

almost getting to 1.1, so this is one standard deviation

play07:32

above, and then one standard deviation below gets you right

play07:36

about here.

play07:37

And that kind of makes sense.

play07:38

It's hard to kind of have a good intuition for a discrete

play07:43

distribution because you really can't take on those

play07:45

values, but it makes sense that the distribution is

play07:47

skewed to the right over here.

play07:49

Anyway, I did this example with particular numbers

play07:52

because I wanted to show you why this

play07:54

distribution is useful.

play07:55

In the next video I'll do these with just general

play07:58

numbers where this is going to be p, where this is the

play08:01

probability of success and this is 1 minus p, which is

play08:05

the probability of failure.

play08:07

And then we'll come up with general formulas for the mean

play08:09

and variance and standard deviation of this

play08:12

distribution, which is actually called the Bernoulli

play08:15

Distribution.

play08:15

It's the simplest case of the binomial distribution.

Rate This

5.0 / 5 (0 votes)

関連タグ
Bernoulli DistributionMeanVarianceStatisticsProbabilitySurvey DataStandard DeviationBinomial DistributionMath EducationProbability Theory
英語で要約が必要ですか?