Standard Deviation Formula, Statistics, Variance, Sample and Population Mean

The Organic Chemistry Tutor
12 Feb 201710:21

Summary

TLDRThis educational video script explains the process of calculating standard deviation, both for a population and a sample. It introduces two formulas: one for population standard deviation (using the population mean, μ, and dividing by n) and another for sample standard deviation (using the sample mean and dividing by n-1). The script uses the example of two sets of numbers, 4, 5, 6 and 3, 5, 7, to demonstrate the calculation, highlighting that the set with more spread out numbers will have a higher standard deviation. It also touches on the concept of variance, which is the square of the standard deviation. The video concludes by inviting viewers to explore more educational content on the creator's channel and website.

Takeaways

  • 📐 The video explains how to calculate standard deviation, which measures how spread out numbers are in a set.
  • 🔢 There are two formulas for standard deviation: one for the entire population and one for a sample of the population.
  • 👫 The population standard deviation (σ) is calculated by summing the squared differences from the mean (μ), divided by the number of data points (n), and then taking the square root.
  • 🔄 For a sample standard deviation (s), the formula is similar but divides by n-1 instead of n, accounting for the sample size.
  • 📊 The video uses an example with the sets 4, 5, 6 and 3, 5, 7 to illustrate that the latter has a greater standard deviation due to its numbers being more spread out.
  • 🧮 To calculate the mean, sum all numbers and divide by the count, which for evenly spaced numbers will be the middle value.
  • 📉 The process for calculating standard deviation involves subtracting the mean from each number, squaring the result, summing these squares, dividing by n (or n-1), and taking the square root.
  • 🔍 The video demonstrates the calculation for the set 3, 5, 7, resulting in a standard deviation of approximately 1.63.
  • 📈 The variance is the square of the standard deviation and is calculated by summing the squared differences from the mean and dividing by n (or n-1), without taking the square root.
  • 🌐 The video concludes by mentioning that the process for calculating population and sample standard deviation is similar, with the main difference being the divisor (n vs. n-1).

Q & A

  • What is the population standard deviation formula?

    -The population standard deviation formula is represented by sigma (σ). It is equal to the square root of the sum of the squared differences between each data point and the population mean (μ), divided by the total number of data points (N).

  • How does the sample standard deviation formula differ from the population standard deviation formula?

    -The sample standard deviation formula differs from the population standard deviation in that it divides by (N - 1) instead of N. This adjustment is made to account for the fact that the data set is only a sample of the entire population.

  • What does standard deviation measure?

    -Standard deviation measures how spread out the numbers in a data set are in relation to each other. A higher standard deviation indicates that the numbers are more spread out, while a lower standard deviation indicates that the numbers are closer together.

  • Why does the set of numbers 3, 5, and 7 have a higher standard deviation than the set 4, 5, and 6?

    -The set of numbers 3, 5, and 7 has a higher standard deviation because the numbers are more spread out compared to the set 4, 5, and 6. The differences between each number and the mean are larger in the first set.

  • How do you calculate the mean of a data set?

    -To calculate the mean of a data set, sum all the numbers in the set and then divide by the total number of data points.

  • What are the steps to calculate the population standard deviation?

    -First, calculate the mean of the data set. Then, subtract the mean from each data point and square the result. Sum all the squared differences, divide by the total number of data points (N), and finally, take the square root of the result.

  • How would you calculate the standard deviation for the numbers 3, 5, and 7?

    -First, calculate the mean, which is 5. Then, find the differences between each number and the mean, square those differences, sum them, and divide by the number of data points (3). Finally, take the square root of the result, which gives a standard deviation of approximately 1.63.

  • What is the variance and how is it related to standard deviation?

    -Variance is the square of the standard deviation. It measures the average of the squared differences from the mean. For example, if the standard deviation is 1.63, the variance would be 1.63 squared, approximately 2.66.

  • Why do you use N - 1 in the sample standard deviation formula instead of N?

    -N - 1 is used in the sample standard deviation formula to correct for bias in the estimation of the population variance. This adjustment, known as Bessel's correction, accounts for the fact that a sample may not fully represent the entire population.

  • What subjects does the instructor offer on their website?

    -The instructor offers tutorials on algebra, trigonometry, pre-calculus, calculus, chemistry, and physics on their website.

Outlines

00:00

📊 Introduction to Standard Deviation Calculation

This paragraph introduces the concept of standard deviation, a measure used to quantify the amount of variation or dispersion in a set of values. It distinguishes between two formulas: the population standard deviation (σ) and the sample standard deviation (s). The population standard deviation is calculated as the square root of the sum of the squared differences between each data point and the population mean (μ), divided by the number of data points (n). The sample standard deviation formula is similar but divides by n-1 instead of n. The paragraph sets the stage for an example to compare the standard deviation of two sets of numbers: {4, 5, 6} and {3, 5, 7}, aiming to demonstrate which set has a greater standard deviation.

05:00

🔢 Calculating Standard Deviation: An Example Walkthrough

This paragraph delves into a step-by-step calculation of the standard deviation for the set {3, 5, 7} using the population standard deviation formula. It begins by calculating the mean of the set, which is the middle number, 5. Next, it guides through the process of finding the differences between each data point and the mean, squaring these differences, summing them, and then dividing by the number of data points (n=3). The result is then used to calculate the standard deviation by taking the square root of the sum divided by n. The calculated standard deviation for this set is approximately 1.63, illustrating a methodical approach to understanding the concept.

10:02

📉 Comparing Standard Deviations and Calculating Variance

The final paragraph compares the standard deviation of the set {4, 5, 6} with the previously calculated set {3, 5, 7}. It reinforces the concept that standard deviation measures how spread out numbers are, with the set {4, 5, 6} having a lower standard deviation due to its numbers being closer to each other. The mean for this set is also calculated as 5, and the standard deviation is computed to be approximately 0.816, which is lower than that of the set {3, 5, 7}. The paragraph concludes with an explanation of how to calculate variance, which is the square of the standard deviation, for the set {3, 5, 7}. It also directs viewers to the tutor's website for more educational content on various subjects.

Mindmap

Keywords

💡Standard Deviation

Standard deviation is a measure of the amount of variation or dispersion in a set of values. In the video, it is used to determine how spread out a set of numbers is. The video explains two formulas for calculating standard deviation: one for the entire population and one for a sample of the population. The script uses the example of the numbers 3, 5, and 7 to demonstrate the calculation of standard deviation, which is approximately 1.63, indicating the spread of these numbers from their mean.

💡Population Standard Deviation

Population standard deviation refers to the calculation of standard deviation when considering the entire population. It is represented by the Greek letter sigma (σ) and is calculated by dividing the sum of the squared differences between each data point and the population mean by the number of data points (n). The video script provides a formula and an example to illustrate how to calculate it, emphasizing its use when you have data from the whole population.

💡Sample Standard Deviation

Sample standard deviation is used when calculating the standard deviation from a subset of a larger population. It is denoted by 's' and is calculated similarly to the population standard deviation but with a slight difference: the sum of squared differences is divided by n-1 (the number of data points minus one) instead of n. This adjustment accounts for the fact that the sample may not be entirely representative of the population.

💡Population Mean

The population mean, represented by the Greek letter mu (μ), is the average of all the values in a population. In the video, the script explains that to find the population mean, you sum all the numbers in the dataset and divide by the total number of values. It is used as a reference point in the calculation of standard deviation.

💡Sample Mean

Sample mean is the average of the values in a sample taken from a larger population. It is calculated in the same way as the population mean but is used when you only have a subset of the data. The video script uses the sample mean in the calculation of sample standard deviation, emphasizing its role in estimating the population mean from sample data.

💡Variance

Variance is a measure of dispersion that indicates how much the values in a dataset vary from the mean. It is calculated as the average of the squared differences from the mean. In the video, variance is described as the square of the standard deviation, which provides a measure of the spread of the data points without the units being 'standard deviations' from the mean.

💡Sigma (Σ)

In the context of the video, sigma (Σ) is used as a symbol to represent the sum of a series of values, particularly in the calculation of standard deviation and variance. It is a mathematical symbol that instructs to add up all the terms in a sequence, which is a crucial step in both population and sample standard deviation calculations.

💡Spread

Spread refers to the extent to which the values in a dataset are dispersed or spread out from the mean. The video uses the concept of spread to explain the intuition behind standard deviation. A greater spread indicates a higher standard deviation, which means the data points are farther from the mean, as illustrated with the sets 3, 5, 7 and 4, 5, 6.

💡Number Line

A number line is a visual representation of numbers in a straight line, where each point corresponds to a number. In the video, the number line is used to graphically represent the sets of numbers to compare their spread. The script describes how the numbers 4, 5, and 6 are closer together on a number line compared to 3, 5, and 7, helping to visually demonstrate the concept of standard deviation.

💡Data Points

Data points are individual values in a dataset. The video script refers to data points when explaining how to calculate the mean and standard deviation. Each data point is considered in relation to the mean to determine the differences that contribute to the calculation of standard deviation.

Highlights

Introduction to calculating standard deviation with two formulas: population and sample standard deviation.

Explanation of the population standard deviation formula represented by sigma.

Description of the sample standard deviation formula, using 's' instead of sigma.

Clarification that sample standard deviation is calculated with n-1 instead of n.

Discussion on the concept of standard deviation as a measure of how spread out numbers are.

Visual comparison of two sets of numbers to intuitively determine which has a higher standard deviation.

Calculation of the mean for the set of numbers 3, 5, and 7.

Step-by-step calculation of the population standard deviation for the set 3, 5, and 7.

Explanation of the process to calculate standard deviation using the formula.

Calculation of the variance as the square of the standard deviation.

Instruction to pause the video for the audience to calculate standard deviation for the set 4, 5, and 6.

Calculation of the mean for the set of numbers 4, 5, and 6.

Step-by-step calculation of the population standard deviation for the set 4, 5, and 6.

Comparison of standard deviations between the two sets of numbers to illustrate the concept.

Conclusion of the video with a summary of the methods to calculate population and sample standard deviation and variance.

Promotion of the video creator's channel and website for more educational content.

Transcripts

play00:01

in this video we're going to calculate

play00:03

the standard deviation of a set of

play00:05

numbers

play00:06

now there's two formulas you need to be

play00:07

aware of the first one

play00:10

is the population

play00:11

standard deviation

play00:17

now this formula

play00:19

is represented by the letter sigma

play00:21

that's the standard deviation

play00:24

it's equal to the sum

play00:26

of all the differences between

play00:28

every point

play00:30

in the data set and the population mean

play00:33

the population mean is mu

play00:36

which is this symbol here

play00:41

and then you need to square it

play00:43

divided by

play00:45

n which is

play00:46

all of the

play00:48

numbers in the set

play00:49

and then you got to take the square root

play00:51

of the whole result

play00:52

so that's the population standard

play00:54

deviation

play00:56

the next formula is the sample standard

play00:59

deviation

play01:01

so let's say if

play01:03

you have

play01:05

just a sample of a population not the

play01:07

entire population

play01:09

if you just have a sample data out of

play01:11

the entire data

play01:13

then you want to use this formula

play01:16

s which is the standard deviation

play01:19

is equal to sigma

play01:22

the sum of all of the

play01:24

differences between every point

play01:27

and the mean that's the sample mean

play01:30

in the other equation we had the

play01:32

population mean represented by mu

play01:35

but this is the sample mean which is

play01:37

basically the average of all the data

play01:39

points in the set

play01:41

and then you have to square it but it's

play01:43

going to be divided by n

play01:45

minus 1 as opposed to n

play01:49

and so that's how you calculate the

play01:50

standard deviation of the sample

play01:55

now let's work on an example

play01:58

let's say if we have two set of numbers

play02:00

four five and six

play02:03

and also three

play02:05

five and seven

play02:07

which one has a greater standard

play02:09

deviation let's use the population

play02:11

standard deviation formula

play02:13

but if we had to guess

play02:15

which set of numbers

play02:17

has the greater standard deviation is it

play02:19

the one on the left or the one on the

play02:21

right

play02:23

what would you say

play02:24

we need to understand the basic idea

play02:26

of standard deviation you need to know

play02:28

what it measures

play02:29

standard deviation tells you

play02:31

how far apart the numbers

play02:33

are related to each other

play02:35

so the more spread out they are the

play02:37

greater the standard deviation

play02:40

four five and six are closer to each

play02:42

other than three five and seven

play02:44

and you could tell if you plot them on a

play02:45

number line

play02:47

let's put five in the middle

play02:56

so 4 5 and 6

play02:58

here they are on a number line

play03:01

now in contrast

play03:02

let's put the same numbers

play03:05

on this number line

play03:13

we're going to have 3

play03:15

5 and 7.

play03:17

so if you look at the the red points

play03:20

the points in red are further apart

play03:23

the points in blue they're very close

play03:25

together

play03:26

so therefore

play03:27

four five and six

play03:28

has a lower standard deviation in three

play03:30

five and seven

play03:33

so sigma is low

play03:36

and here the sigma value is high

play03:42

now go ahead and calculate the

play03:45

standard deviation

play03:48

for this

play03:53

for the set of numbers three five and

play03:54

seven

play03:55

so what's the first thing that we should

play03:57

do

play03:59

the first thing that we should do is

play04:00

calculate the mean

play04:03

to find the mean

play04:04

it's going to be the sum

play04:06

of all the numbers divided by 3.

play04:09

now because the three numbers are evenly

play04:11

spaced apart the mean is going gonna be

play04:13

the middle number five

play04:15

three plus five is eight eight plus

play04:17

seven

play04:18

is fifteen

play04:19

fifteen divided by three is five so

play04:22

that's the mean

play04:27

now what should we do next now that we

play04:28

have the mean

play04:32

now think of the formula

play04:35

it's going to be a sigma

play04:38

of every point minus the

play04:40

mean squared

play04:42

divided by

play04:44

n

play04:45

and then all of this is within the

play04:47

square root

play04:49

so here's how to use the equation first

play04:52

we're going to use the first point 3

play04:55

subtract it by the mean and then squared

play04:58

next we're going to take the second

play04:59

point 5

play05:00

subtract it from the mean squared

play05:02

and then it's going to be 7

play05:04

minus 5

play05:06

squared

play05:08

so each of these three points

play05:10

you're going to plug into x sub i

play05:13

and then you're going to square the

play05:14

differences between each of those values

play05:17

and the sigma represents sum so you're

play05:19

going to add every difference

play05:21

that you get

play05:23

or you can add the square of every

play05:25

difference that you get

play05:26

and now let's divide it by n

play05:29

so n is the number of numbers that we

play05:31

have in this set there are three numbers

play05:33

inside

play05:34

so n is 3

play05:35

and then we're going to take the square

play05:36

root of the entire thing

play05:39

three minus five is negative two

play05:42

negative two squared is four

play05:44

five minus five is zero

play05:47

seven minus five is two two squared is

play05:49

four

play05:52

four plus four is eight

play05:54

so we have the square root of eight

play05:56

divided by three

play05:58

and at this point we're going to use the

play06:00

calculator

play06:05

8 divided by 3 is about 2.67 and if you

play06:08

take the square root of that

play06:10

you're going to get 1.63

play06:14

so that's the standard deviation

play06:17

for 3 5 and 7.

play06:20

now let's calculate the standard

play06:22

deviation

play06:23

for the other set of numbers four five

play06:26

and six so why don't you go ahead and

play06:28

pause the video

play06:29

and try this example

play06:31

calculate the standard deviation using

play06:33

the same formula

play06:35

so let's go ahead and begin let's

play06:36

calculate the population mean

play06:39

it's going to be 4 plus 5 plus 6 divided

play06:43

by

play06:44

the number of numbers that we have which

play06:45

is

play06:46

3. four plus six is ten ten plus five is

play06:50

fifteen and we know that fifteen divided

play06:52

by three is five

play06:54

so once again any time the numbers are

play06:56

evenly spread apart the mean is going to

play06:58

be the middle number

play07:02

so now

play07:03

we can calculate the standard deviation

play07:06

so sigma

play07:07

is going to equal the square root but

play07:10

before we do that let's calculate

play07:12

the differences

play07:14

so the first difference that we have the

play07:16

first number is going to be 4

play07:18

and we're going to subtract it from the

play07:19

mean

play07:20

and then square it the next number

play07:22

is 5

play07:24

subtract it from the mean

play07:26

and then square it

play07:27

and then after that the last number is

play07:28

six

play07:29

this is going to be

play07:31

six minus five squared

play07:34

now it's divided by n

play07:36

and let's not forget to take the square

play07:38

root of the entire thing

play07:41

four minus five is negative one negative

play07:44

one squared is simply one

play07:46

five minus five is zero

play07:48

six minus five is one

play07:51

and it's all divided by three one plus

play07:53

one is two

play07:55

so we have the square root of two

play07:57

divided by 3.

play08:00

now 2 divided by 3 as a decimal is about

play08:02

0.67

play08:03

and the square root of 0.67

play08:07

is 0.816

play08:12

so as you can see the standard deviation

play08:14

is less because

play08:16

these numbers are closer to each other

play08:19

they're not far apart from the mean

play08:21

in the other example

play08:22

three five and seven

play08:24

they're further apart from the mean

play08:26

which is five

play08:27

three is two units away from five

play08:30

four

play08:31

is only one unit away from five and

play08:33

that's why the standard deviation is so

play08:35

much less

play08:37

now let's go back to the first example

play08:39

we said that the population standard

play08:41

deviation

play08:42

is approximately 1.63

play08:45

so given this information

play08:47

how can you calculate

play08:49

the variance

play08:52

v a r i a-n-c-e

play08:56

how can we find the variance

play08:58

the variance is simply the square

play09:00

of the standard deviation

play09:03

so 1.63 squared

play09:06

is equal to now keep in mind this is a

play09:09

rounded answer

play09:10

i don't remember what the exact answer

play09:12

was but

play09:14

once you square it it's about 2.66

play09:16

so that's how you can calculate the

play09:18

variance

play09:20

the formula for variance is basically

play09:23

the sum

play09:24

of all the square differences

play09:26

between every point and the population

play09:28

mean

play09:29

divided by n

play09:30

it's basically the same formula without

play09:32

the square root symbol

play09:36

well that's it for this video so now you

play09:38

know how to calculate

play09:40

the population standard deviation and

play09:42

also the sample standard deviation even

play09:44

though we did just one of them the

play09:46

process is the same

play09:48

of finding the other one

play09:50

the only difference is you have n minus

play09:52

one instead of n

play09:54

you also know how to calculate the

play09:55

variance as well

play09:57

so that concludes this video by the way

play10:00

if you want to find more of my videos

play10:01

you can check out my channel or visit my

play10:03

website video.tutor.net and you can find

play10:06

playlists on algebra trade pre-calculus

play10:09

calculus

play10:11

chemistry and physics

play10:13

so those are the subjects that i

play10:14

currently offer right now

play10:16

and if you're interested just feel free

play10:18

to check that out

play10:19

so thanks again for watching

Rate This

5.0 / 5 (0 votes)

Ähnliche Tags
Standard DeviationVariance CalculationStatistical AnalysisMath TutorialData SpreadPopulation MeanSample MeanMathematics EducationCalculation TechniquesEducational Content
Benötigen Sie eine Zusammenfassung auf Englisch?