Normal Distribution and Empirical Rule

Sir Vicz
7 Jan 202409:46

Summary

TLDRThis lesson introduces the normal distribution, also known as the Gaussian distribution. It's a symmetric probability distribution with a mean that reflects the data's center. The normal curve's characteristics are defined by its mean and standard deviation, which determine its position and shape. The empirical rule, or 68-95-99.7 rule, is highlighted, stating that nearly all data points fall within three standard deviations of the mean. Specific examples illustrate how to calculate the proportion of data within certain ranges and the proportion of outliers. The script also discusses the binomial distribution's resemblance to the normal distribution, especially when the probability of success is around 0.5 and the number of trials is large.

Takeaways

  • 📊 Normal distribution, also known as Gaussian distribution, is a probability distribution that is symmetric about the mean.
  • 📈 The normal curve is unimodal and bell-shaped, and any specific normal curve is defined by its mean and standard deviation.
  • 🔍 The mean, median, and mode of a normal distribution are all equal and located at the center of the normal curve.
  • 📚 Changing the mean moves the normal curve along the horizontal axis without altering its shape.
  • 📉 Changing the standard deviation affects the 'flatness' or 'stiffness' of the normal curve; a larger standard deviation results in a flatter curve.
  • 📉 The empirical rule, or the 68-95-99.7 rule, states that approximately 68% of the data falls within one standard deviation of the mean, 95% within two, and 99.7% within three.
  • 📐 For a normal distribution with a mean of 150 and a standard deviation of 3, about 68% of data values fall between 147 cm and 153 cm.
  • 📈 Approximately 97.5% of heights are greater than 144 cm when the mean height is 150 cm and the standard deviation is 3.
  • 📊 About 2.5% of students have a height greater than 156 cm under the same distribution parameters.
  • 🔄 The binomial distribution can resemble a normal distribution when the probability of success is around 0.5 and the number of trials is large.

Q & A

  • What is the normal distribution?

    -The normal distribution, also known as the Gaussian distribution, is a probability distribution that is symmetric about the mean. It shows that data near the mean are more frequent than data far from the mean and is represented graphically by a normal curve.

  • What are the key characteristics of a normal curve?

    -A normal curve is symmetric about the mean, has a single peak, is unimodal, and is bell-shaped.

  • How is a normal curve described?

    -A specific normal curve is described by its mean and standard deviation, which determine its location and flatness.

  • What happens when you change the mean of a normal curve?

    -Changing the mean moves the normal curve horizontally along the horizontal axis without changing its shape.

  • How does the standard deviation affect the normal curve?

    -The standard deviation affects how flat or steep the normal curve is. A larger standard deviation results in a flatter curve, while a smaller standard deviation results in a steeper curve.

  • What is the empirical rule or the 68-95-99.7 rule?

    -The empirical rule states that for a normal distribution, approximately 68% of the data falls within one standard deviation of the mean, about 95% within two standard deviations, and 99.7% within three standard deviations.

  • What proportion of data falls outside of three standard deviations in a normal distribution?

    -Only about 0.3% of the data falls outside of three standard deviations in a normal distribution.

  • In the example given, what proportion of grade n students have a height between 147 cm and 153 cm?

    -Approximately 68% of grade n students have a height between 147 cm and 153 cm, as these values are one standard deviation away from the mean of 150 cm.

  • What proportion of students have a height greater than 144 cm according to the empirical rule?

    -97.5% of the students have a height greater than 144 cm, as 2.5% are below one standard deviation from the mean and 95% are within two standard deviations.

  • How does the proportion of students with a height greater than 156 cm relate to the empirical rule?

    -Only about 2.5% of students have a height greater than 156 cm, as this value is three standard deviations away from the mean.

  • When does a binomial distribution resemble a normal distribution?

    -A binomial distribution resembles a normal distribution when the probability of success is approximately 0.5 and the number of trials (n) is large.

  • What factors determine if a binomial distribution can be approximated by a normal distribution?

    -The binomial distribution can be approximated by a normal distribution if the probability of success (p) is close to 0.5 and the number of trials (n) is large enough.

Outlines

00:00

📊 Understanding the Normal Distribution

This paragraph introduces the concept of the normal distribution, also known as the Gaussian distribution. It's a probability distribution that is symmetric about the mean, indicating that data points close to the mean are more frequent than those far away. The normal distribution is represented graphically by a bell-shaped curve. The shape of the curve is determined by the mean and standard deviation. The paragraph explains that changing the mean shifts the curve horizontally, while changing the standard deviation affects its steepness or flatness. It also mentions the empirical rule, or the 68-95-99.7 rule, which states that nearly all data points in a normal distribution fall within three standard deviations from the mean, with approximately 68% within one standard deviation, 95% within two, and 99.7% within three.

05:00

📏 Applying the Normal Distribution to Height Data

In this paragraph, the script discusses a practical example of the normal distribution by applying it to the heights of grade N students in a school. The distribution is normal with a mean height of 150 cm and a standard deviation of 3 cm. The paragraph calculates the proportion of students with heights between 147 cm and 153 cm, which is one standard deviation from the mean, and estimates this to be approximately 68%. It also addresses the proportion of students taller than 144 cm, which is two standard deviations below the mean, estimating this to be 97.5%. Lastly, it considers the proportion of students taller than 156 cm, which is two standard deviations above the mean, and estimates this to be 95%. The paragraph concludes by noting the resemblance of the binomial distribution to the normal distribution when the probability of success is around 0.5 and the number of trials is large, suggesting the use of normal distribution approximation for such cases.

Mindmap

Keywords

💡Normal Distribution

Normal distribution, also known as Gaussian distribution, is a probability distribution that is symmetric about the mean. It is characterized by a bell-shaped curve, which is the most widely recognized distribution in statistics. In the script, it is described as having data near the mean occurring more frequently than data far from the mean. This concept is central to understanding the video's theme as it lays the foundation for the discussion on how data is distributed in a normal curve.

💡Mean

The mean, often referred to as the average, is the sum of all values in a dataset divided by the number of values. It is a measure of central tendency and is crucial in normal distribution as it represents the center of the distribution. The script mentions that changing the mean moves the normal curve horizontally, illustrating its importance in determining the location of the distribution.

💡Standard Deviation

Standard deviation is a measure of the amount of variation or dispersion in a set of values. A low standard deviation indicates that the values tend to be close to the mean, while a high standard deviation indicates that the values are spread out over a wider range. In the context of the script, it is noted that changing the standard deviation affects the 'flatness' or 'stiffness' of the normal curve, thus influencing its shape.

💡Symmetric

Symmetry in the context of normal distribution means that the left side of the curve is a mirror image of the right side. This property is highlighted in the script as a defining characteristic of normal curves, emphasizing that data points near the mean are more frequent than those further away.

💡Bell-shaped Curve

A bell-shaped curve is a graphical representation of the normal distribution, where the peak of the bell represents the mean, and the tails extend to the left and right. The script uses this term to describe the graphical representation of the normal distribution, which is essential for visualizing how data is distributed.

💡Empirical Rule

The empirical rule, also known as the 68-95-99.7 rule, is a statistical concept that applies to normal distributions. It states that approximately 68% of the data falls within one standard deviation of the mean, 95% within two standard deviations, and 99.7% within three standard deviations. The script uses this rule to provide specific examples of how data is distributed in a normal curve.

💡Mode

The mode is the value that appears most frequently in a data set. In a normal distribution, the mode is the same as the mean, which is also the median. The script mentions this to emphasize the centrality of the mean in normal distributions.

💡Unimodal

Unimodal refers to a distribution having a single peak. The script describes normal distribution as unimodal, meaning it has one peak at the mean, which is a key characteristic that distinguishes it from other types of distributions that may have multiple peaks.

💡Z-Score

A Z-score represents how many standard deviations an element is from the mean. The script discusses the concept in relation to the empirical rule, explaining that data points within certain Z-scores (1, 2, or 3 standard deviations from the mean) represent specific percentages of the data set.

💡Binomial Distribution

Binomial distribution is a discrete probability distribution that models the number of successes in a fixed number of independent Bernoulli trials. The script mentions that when the probability of success is approximately 0.5 and the number of trials is large, the binomial distribution begins to resemble the normal distribution, illustrating the connection between these two types of distributions.

💡Normal Approximation

Normal approximation is a method used to approximate the distribution of a binomial random variable with a normal distribution under certain conditions. The script explains that when the number of trials is large and the probability of success is around 0.5, the binomial distribution can be approximated by a normal distribution, which is useful for simplifying calculations.

Highlights

Normal distribution is symmetric about the mean, showing that data near the mean are more frequent.

Normal curves have the same shape and are symmetric with respect to the middle value.

A normal curve is unimodal and bell-shaped.

A normal curve is completely described by its mean and standard deviation.

Changing the mean moves the normal curve horizontally.

Changing the standard deviation affects the flatness or stiffness of the normal curve.

The mean, median, and mode of a normal curve are all equal and located at the center.

The empirical rule (68-95-99.7 rule) states that almost all data falls within three standard deviations from the mean.

Approximately 68% of data falls within one standard deviation from the mean.

Approximately 95% of data falls within two standard deviations from the mean.

Approximately 99.7% of data falls within three standard deviations from the mean.

The remaining 0.3% of data falls outside three standard deviations from the mean.

The area under the normal curve equals 100%, representing the total proportion of data.

For a normal distribution with mean 150 and standard deviation 3, approximately 68% of students have heights between 147 cm and 153 cm.

For the same distribution, 97.5% of students have heights greater than 144 cm.

For the same distribution, 2.5% of students have heights greater than 156 cm.

Binomial distribution resembles a normal distribution when the probability of success is approximately 0.5 and the number of trials is large.

The closer p is to 0.5 and the larger n is, the more the binomial distribution resembles a normal distribution.

When p is close to zero or one and n is small, the normal approximation would yield inaccurate results.

Transcripts

play00:00

our lesson for today is about the normal

play00:03

distribution normal distribution by

play00:06

definition normal distribution or the

play00:08

gausian

play00:10

distribution is a probability

play00:12

distribution that is symmetric about the

play00:14

mean showing that data near the mean are

play00:17

more frequent in occurrence than data

play00:20

far from the mean it is graphically

play00:23

represented by a normal

play00:26

curve so this is an example of a normal

play00:30

curve so symmetric about the mean so

play00:34

whatever

play00:45

manues mean than

play00:49

those mean note that normal curves have

play00:53

the same

play00:54

shape

play00:56

soer symmetric with respect to the

play00:58

middle value

play01:00

it is single Peak

play01:03

unimodal and it is Bell

play01:06

shape any specific normal curve is

play01:09

completely described by giving its mean

play01:12

and standard

play01:14

deviation

play01:19

so or flatness depending on these two

play01:24

values okay your mean and

play01:27

your standard deviation so

play01:33

Al stiffness or flatness so we have here

play01:38

our normal curve okay mean let's say we

play01:42

have zero with the standard deviation

play01:44

equal to

play01:48

1ue

play01:50

mean as you can observe changing the

play01:54

value of mean moves our normal curve

play01:58

horizontally

play02:01

how

play02:03

about

play02:06

stand deviation

play02:12

mean normal

play02:18

curve

play02:20

flat normal curve as observ example the

play02:24

mean is located at the center of the

play02:27

normal curve in fact mean even the

play02:30

median and the

play02:32

mode and they are all

play02:35

equal center of the normal

play02:38

curve changing the mean without changing

play02:41

the standard deviation moves the normal

play02:43

curve along the horizontal

play02:52

axis okay and changing the standard

play02:55

deviation affects how flat or stiff the

play02:57

normal curve is so

play03:00

lesser value standard

play03:05

deviation

play03:06

larger value standard

play03:09

deviation

play03:11

flat normal

play03:15

curve for a normally distributed data we

play03:18

have this empirical rule or sometimes we

play03:21

call the three sigma rule or 6895 99.7

play03:25

rule which states that almost all obser

play03:30

data will fall within three standard

play03:32

deviations from the mean so normally

play03:36

distributed

play03:38

data

play03:41

99.7% data within three standard

play03:44

deviations okay so very small greater

play03:48

than three standard

play03:51

deviations

play03:52

specifically EMP

play03:55

rule Sigma rule okay

play04:00

it states

play04:02

that within one standard deviation from

play04:05

the mean approximately

play04:08

68%

play04:10

dat

play04:12

okay

play04:16

orus at within two standard

play04:19

deviations approximately

play04:22

95% data

play04:24

values and three standard deviations

play04:27

from the mean 99.7% % data

play04:33

values so remaining

play04:37

3% more than three standard deviations

play04:40

from the mean note that proportion is

play04:45

100% okay

play04:47

so area normal

play04:51

curve that is equal to one

play04:55

100%

play04:58

proportion

play05:00

now let's take a look at this example

play05:01

the distribution of heights of grade n

play05:03

students in a certain school is normal

play05:06

with mean equal to 150 and standard

play05:10

deviation equal to three What proportion

play05:13

of the grade n students have a height

play05:15

between 147 cm and 153

play05:20

CM or What proportion of the students of

play05:23

a height greater than 144 CM or What

play05:28

proportion of the student of a height

play05:30

greater than 156

play05:34

CM

play05:36

so proportion my height between 147 and

play05:43

153 observe that 147 and

play05:47

153 are one standard deviation from the

play05:51

mean CU 150 - 3 is 147 150 + 3 is

play05:57

153 since one St standard deviation sha

play06:00

from the mean then we say that

play06:04

approximately

play06:07

68% of the data values fall within 147

play06:12

to

play06:13

153 CM so n Heights no

play06:18

68% Heights n values 147 to 153 how

play06:24

about the proportion of students with a

play06:27

height greater than 144 C CM looking at

play06:31

our normal curve okay observe that you

play06:35

one standard deviation

play06:38

right two standard

play06:40

deviations meaning from 144 up

play06:46

to6 95% na Yan okay 95% based on the

play06:51

empirical rule however P small portion

play06:56

here okay so 95

play07:05

that is 5%

play07:13

okay5 so what is that that is

play07:16

2.5% 5% so if we have

play07:21

95 plus

play07:23

2.5

play07:26

totalan

play07:28

97.5%

play07:29

so

play07:31

97.5% of the heights are greater than

play07:35

144

play07:38

cm and finally What proportion of

play07:41

students have a height greater than

play07:44

156 okay so from

play07:48

here one standard deviation

play07:51

Yan two standard

play07:53

deviations so that is

play07:55

95% so nawan 5% Lang

play08:04

okay then this

play08:06

is

play08:08

2.5% when we deal with a binomial random

play08:11

variable with large number of

play08:14

Trials you might observe that

play08:16

calculating the probabilities will be

play08:18

too difficult or impossible without the

play08:21

use of a statistical

play08:23

tools if we investigate the shape of a

play08:26

binomial

play08:27

distribution we can observe that that it

play08:30

resembles the normal distribution when

play08:33

the probability of success is

play08:34

approximately

play08:36

0.5 and the number of Trials n is large

play08:41

enough in fact the closer p is to 0.5

play08:46

and the larger n is the more it

play08:49

resembles the shape of a normal

play08:58

distribution

play09:00

we can observe from these figures how

play09:02

the values of p and N affects the shape

play09:05

of the

play09:08

distribution however how do we determine

play09:11

if n is already large

play09:13

enough so depend a probability of

play09:17

success this is because when p is close

play09:20

to zero or one and N is small the normal

play09:24

approximation would yield in accurate

play09:28

result

play09:37

figure look at the figure

play09:39

four okay in

play09:41

approximate normal

play09:44

distribution

Rate This

5.0 / 5 (0 votes)

Ähnliche Tags
StatisticsNormal DistributionEmpirical RuleData AnalysisProbabilityGaussian CurveMeanStandard DeviationBinomial DistributionStatistical Tools
Benötigen Sie eine Zusammenfassung auf Englisch?