Application of Mean, Median and Mode [In Real Life] [Uses in Real Life]

Learning Puree
23 Jun 202013:02

Summary

TLDRThis video explores the appropriate use of mean, median, and mode in data analysis. It explains these measures of central tendency and their applications in real-life scenarios, using examples to illustrate when each is most effective. The video emphasizes the importance of data distribution and the presence of outliers in choosing the right statistical measure, offering practical tips for personal and professional decision-making.

Takeaways

  • πŸ“Š Mean, median, and mode are summary statistics used to analyze and summarize data sets.
  • πŸ”’ The mean is the arithmetic average of all values, but it can be skewed by outliers.
  • πŸ“ˆ Median is the middle value in a data set and is less affected by extreme values.
  • 🍦 Mode is the most frequently occurring value in a data set, useful for categorical data.
  • πŸ“š It's important to choose the right measure of central tendency based on the data's distribution and characteristics.
  • πŸ“‰ Outliers can significantly affect the mean, making it less representative of the overall data set.
  • 🌾 In skewed data sets, like exam scores or agricultural output affected by drought, the median can provide a better summary.
  • 🍹 For evenly distributed data, such as monthly sugar usage in a manufacturing process, the mean can be a reliable summary statistic.
  • πŸ‘  Retailers can use mode to identify the most popular shoe sizes to stock based on sales data.
  • 🏠 Mode can also be applied in everyday life, such as organizing household items based on frequency of use.
  • πŸ“‹ The script provides a comparison table to help decide when to use mean, median, or mode based on the data's characteristics.

Q & A

  • What are the three main summary statistics discussed in the video?

    -The three main summary statistics discussed in the video are mean, median, and mode.

  • Why might the mean not be the most appropriate measure to summarize data?

    -The mean might not be the most appropriate measure to summarize data when there are extreme values or outliers, as it can skew the average and provide a misleading representation of the data set.

  • What is the definition of the median in statistics?

    -The median is the middle value in a data set when all the observations are arranged in ascending or descending order. It divides the data into two equal halves.

  • Can you provide an example from the script where the mean would overestimate the performance of a group?

    -An example from the script is the math class where the mean score of 43 overestimates the performance of the class, as most students scored below 40, which is the passing percentage.

  • What is the mode and how is it useful in certain scenarios?

    -The mode is the value that occurs most frequently in a data set. It is useful in scenarios where you need to identify the most common response or the most popular item, such as in consumer surveys or inventory management.

  • How can the median be a better choice than the mean in certain situations?

    -The median can be a better choice than the mean when the data is skewed or has outliers, as it represents the middle value and is not affected by extreme data points.

  • What thumb rules are suggested in the video for using mean and median?

    -The thumb rules suggested are: use the mean when data is evenly distributed without extreme values, when all data points are important, and there are no special circumstances affecting the data. Use the median when data is skewed or has outliers.

  • In what type of data set can the mode be more useful than the mean or median?

    -The mode can be more useful in data sets with categorical variables or when you need to identify the most frequently occurring item or response, as it is not affected by extreme values.

  • What is a potential application of the mode in a retail setting?

    -A potential application of the mode in a retail setting is to determine the most popular shoe size or clothing size to stock more of, based on past sales data.

  • How can understanding the concepts of mean, median, and mode help in personal finance?

    -Understanding these concepts can help in personal finance by allowing you to analyze expenses, set budgets, and monitor spending patterns, such as using the mean to budget for monthly household expenses.

  • What is the importance of observing the range and distribution of data when choosing a measure of central tendency?

    -Observing the range and distribution of data is important because it helps determine the presence of outliers or skewness, which can influence the choice between mean, median, or mode for an accurate representation of the data.

Outlines

00:00

πŸ“Š Understanding Mean, Median, and Mode in Data Analysis

This paragraph introduces the video's focus on when and where to use mean, median, and mode in real-life scenarios. The speaker addresses the common tendency to use the mean but questions its appropriateness in all situations. A quick recap of these statistical measures is provided, with a reference to a previous video for further understanding. The importance of choosing the right measure of central tendency based on the data's distribution and presence of outliers is emphasized through examples, such as students' exam scores and food grain production data.

05:03

πŸ“ˆ Appropriate Use of Median and Mean in Data Summarization

This section discusses the appropriate application of median and mean in summarizing data. It uses the example of a math class's exam scores to illustrate how the mean can overestimate performance, while the median provides a more accurate representation. Similarly, the food grain production example shows how the mean can underrepresent performance due to extreme values. The speaker provides thumb rules for using mean and median, suggesting the mean is suitable for evenly distributed data without outliers, while the median is beneficial when data is skewed or has outliers.

10:05

🍦 Practical Applications of Mode in Consumer Preferences and Inventory Management

The final paragraph explores the use of mode in determining the most popular ice cream flavor through consumer research and in managing a retail shoe store's inventory. It explains how mode can identify the most frequently occurring item in a dataset, aiding in decision-making for product manufacturing and inventory stocking. The speaker also highlights that mode is particularly useful for categorical data and less applicable for continuous data without repeating values. The paragraph concludes with a tip on using mode to organize household items and a call to action for viewers to engage with the content.

Mindmap

Keywords

πŸ’‘Summary Statistics

Summary statistics are numerical measures that summarize and describe the general features of a dataset. In the video, summary statistics like mean, median, and mode are discussed as tools to analyze data and comment on expected results. They are essential for understanding the central tendency of the data, which is a core theme of the video.

πŸ’‘Mean

Mean, also known as the arithmetic average, is calculated by dividing the sum of all values in a dataset by the number of observations. The video discusses the mean as a common measure used to summarize data but also points out its limitations, such as being influenced by outliers, as seen in the example of the math class marks.

πŸ’‘Median

Median is the middle value of a dataset when the observations are arranged in ascending or descending order. The video illustrates the use of the median in scenarios where the data is skewed, such as the example of the farm produce data, where the median provides a more accurate representation of the central tendency than the mean.

πŸ’‘Mode

Mode refers to the value that appears most frequently in a dataset. The video explains the application of mode in situations where one is interested in the most common or popular item, such as choosing a popular ice cream flavor based on consumer preferences.

πŸ’‘Skewed Data

Skewed data refers to a dataset where the distribution of values is not symmetrical and leans towards one extreme. The video uses the term to describe datasets with outliers that can distort the mean, such as the math exam scores example, where one student's exceptionally high score skews the mean.

πŸ’‘Outliers

Outliers are data points that are significantly different from other observations in the dataset. The video mentions outliers as extreme values that can affect the mean, making it a less reliable measure of central tendency in certain scenarios, such as the food grain production example.

πŸ’‘Categorical Data

Categorical data consists of values that are categories or groups rather than numerical values. The video discusses the use of mode with categorical data, such as the ice cream flavor preferences, where mode is a suitable measure to determine the most popular choice.

πŸ’‘Continuous Data

Continuous data refers to numerical values that can take on any value within a range. The video explains that mode is less useful with continuous data, as it is difficult to find repeating values, and suggests using mean and median instead.

πŸ’‘Budgeting

Budgeting in the context of the video refers to the process of estimating future expenses or production costs based on past data. The video uses the example of a soft beverage manufacturer using the mean of past sugar utilization to budget for production expenses.

πŸ’‘Distribution

Distribution in statistics refers to the way values in a dataset are spread out. The video emphasizes the importance of observing the distribution of data when choosing a measure of central tendency, as evenly distributed data is more suitable for the mean, while skewed data may require the median or mode.

πŸ’‘Professional and Personal Growth

The term is used by the channel 'Learning Puri' to describe its aim of providing tips and tutorials to help viewers grow in both professional and personal aspects of life. The video's content on statistical measures for data analysis contributes to this goal by teaching viewers how to make informed decisions based on data.

Highlights

Summary statistics like mean, median, and mode are often used to analyze data and comment on expected results.

People often default to using the mean, but it may not always be the most appropriate measure.

Mean is the arithmetic average of all values, useful when data is evenly distributed without outliers.

Median is the middle value in a data set and can be more representative when data is skewed.

Mode is the most frequently occurring value and is helpful for categorical data or when identifying popular items.

In a math class example, using the mean overestimates the class performance due to a skewed distribution.

For food grain production data, the mean under-represents performance due to extreme values from drought years.

Median provides a more accurate representation of central tendency in skewed data sets.

In the case of sugar utilization data for a soft beverage manufacturer, the mean is a suitable measure due to even data distribution.

Thumb rules for using mean and median are provided to help decide which measure to apply in different scenarios.

Mean can be used to study household expenses for budgeting and setting expense targets.

Mode is used to identify the most popular ice cream flavor based on consumer preferences.

A retail shoe store owner uses mode to determine the most sold shoe sizes and optimize stock.

Mode is less useful for continuous data where repeating values are less likely, unlike mean and median.

Mode can help in organizing a household by categorizing items based on frequency of use.

The video provides a comparison table for choosing between mean, median, and mode based on data characteristics.

Practical applications of mean, median, and mode are demonstrated through real-life examples for better understanding.

Transcripts

play00:00

whenever we are asked to analyze data

play00:02

and comment on the expected result or

play00:04

the most common response we tend to find

play00:07

a number that would summarize the result

play00:09

we tend to use summary statistics like

play00:11

mean median or mode most often people

play00:15

jump at using mean or the arithmetic

play00:17

average but is it the most appropriate

play00:20

measure to use so what are the

play00:22

applications of mean median and mode in

play00:24

real-life scenarios recently I had

play00:26

subscribers asking me about when and

play00:28

where to use mean median and more and

play00:31

that's what we are going to do in this

play00:33

video so let's get rolling I have

play00:40

explained the concept of mean median and

play00:42

mode in this video over here you can

play00:45

watch it to know the fundamentals of

play00:47

mean median and mode however since the

play00:50

applications of mean median and mode in

play00:52

real-life scenarios were not adequately

play00:54

addressed I decided to do the honors I'm

play00:57

finished and you're watching my channel

play00:58

learning puri where you will get tips

play01:00

and tutorials to help you grow faster in

play01:02

a professional and personal life so if

play01:05

you're new here consider subscribing to

play01:07

the channel click the Malayan to get

play01:09

notified every time I post a video your

play01:12

small gesture tells YouTube to push this

play01:14

video to more like-minded people alright

play01:17

before we proceed here is a quick recap

play01:19

of what these measures of central

play01:21

tendency mean mean is an arithmetic

play01:24

average of all the values where we

play01:26

divide the sum of all the values by the

play01:29

number of observations median is the

play01:32

middle value when all the observations

play01:34

are lined up in an ascending or a

play01:36

descending order and mode is the value

play01:39

with the most number of occurrences to

play01:41

get a detailed understanding of the

play01:43

basics of descriptive statistics you

play01:46

should watch this video over here for

play01:48

which the link is posted in the

play01:49

description below and infocard above I

play01:52

would encourage you to watch the current

play01:54

video till the end to discover some

play01:56

useful tips and interesting features I

play01:59

have discussed about these metrics along

play02:01

the way now that is done and s-type

play02:03

let's look at where we can use them

play02:06

whenever we are asked to analyze data

play02:08

and comment on the expected result or

play02:10

the most common response

play02:13

to find a number that would summarize

play02:14

the result we tend to use summary

play02:17

statistics like mean median or mode most

play02:20

of the people jump at using mean or the

play02:23

arithmetic average but is it the most

play02:26

appropriate measure to use let's look at

play02:28

this example in a math class the marks

play02:31

obtained by 13 students during the exam

play02:33

are as follows the teacher wanted to

play02:36

know how the class was faring overall

play02:38

the immediate tendency is to take a mean

play02:41

or an arithmetic average of the marks

play02:43

obtained by students in this case it

play02:46

will be forty three point one for

play02:47

approximately four three this is

play02:50

obtained by dividing 3:02 which is the

play02:52

sum of all the box divided by the number

play02:54

of students obviously all except one

play02:57

student have scored below 40 which is

play03:00

the passing percentage and they seem to

play03:02

be having a tough time in the class if

play03:05

the teacher used this average of 43 to

play03:07

describe the performance of the class

play03:08

then she would be overestimating the

play03:11

performance of the class on the higher

play03:13

side based on how the class actually

play03:15

performed this clear it cannot be true

play03:17

right in this data set

play03:19

there is one number that is 99 which is

play03:22

polarized to one extreme here the data

play03:25

is described as not evenly distributed

play03:27

or skewed now let's take another example

play03:30

the government was studying food grain

play03:32

output of one state in a country across

play03:35

seven years from 1969 to 1975 the output

play03:39

measured in metric tonnes is in this

play03:40

table across seven years from 1969 to

play03:43

1975 if we summarize the performance for

play03:46

this data set using mean then we would

play03:48

have thirty three point five metric tons

play03:50

of average food grade production across

play03:53

the seven years like usual this average

play03:56

is obtained by dividing two 34.5 the sum

play03:59

of production across seven years by

play04:01

seven using this average figure the

play04:03

performance for five years appears above

play04:06

average barring the two years that is

play04:08

1971 and 1972 where it is below average

play04:12

this again under represents the

play04:15

performance in both these cases we

play04:17

discussed we observed two peculiar

play04:19

features in the data first the data is

play04:23

skewed to one end that is there are

play04:24

extreme values or outliers

play04:26

the data and second the arithmetic

play04:29

average assigns equal weight edge to

play04:32

each data point in the data set that is

play04:34

we divide each data point in both the

play04:37

data sets by seven which is the number

play04:39

of observations however here's the

play04:41

caveat in the first example of marks in

play04:44

the math exam we have data set that is

play04:46

polarized or skewed to one end in the

play04:49

second data set of food grain production

play04:51

years 1971 and 1972 were two years that

play04:55

the state faced an extreme drought

play04:56

resulting in lower produced both these

play04:59

two cases have Extra Ordinary scenarios

play05:02

in the second case the data is not only

play05:05

skewed but also equal importance is

play05:07

being given to two years of drought

play05:09

against five years of good rainfall

play05:12

during the comparison it's like

play05:15

comparing apples to oranges now let's

play05:17

look at doing something different for

play05:20

the math exam performance if the teacher

play05:22

wanted to summarize the performance she

play05:24

could use the median when we use the

play05:26

median we get thirty five as a middle

play05:29

figure for the class performance there

play05:31

are exactly 50% of students above and

play05:34

below this figure hmm this seems to work

play05:41

for the farm produced data using median

play05:44

45 works out to be the middle figure 45

play05:47

provides an acceptable average

play05:49

production across years despite the two

play05:52

years of drought so there are instances

play05:55

when not jumping at taking the

play05:57

arithmetic average or the mean is

play05:59

actually beneficial now that we are

play06:02

aware when not to jump at using the

play06:04

arithmetic average here is one more

play06:06

example let's say you're working for a

play06:09

manufacturer of soft beverage like Pepsi

play06:11

or coke who uses refined sugar in the

play06:15

manufacturing process seven months of

play06:17

data of sugar utilization in their

play06:19

factory is laid out in this table and

play06:21

now you are tasked to budget for likely

play06:25

production expenses in the year to come

play06:27

so with our newfound understanding let's

play06:29

work it out we observe that unlike the

play06:31

early to cases the data is not skewed to

play06:34

one or any end the data is evenly spread

play06:37

or distributed across the seven months

play06:40

that means that there are no extreme

play06:42

values to deal with in this data set the

play06:45

arithmetic average of this data set

play06:46

comes to 2 1 9 metric tons obtained by

play06:49

dividing 1/5 free--free in the sum of

play06:51

the data by 7 on a simple comparison

play06:55

with the data set this value appears to

play06:57

be very acceptable the company can

play06:59

safely use this figure of 2 benign

play07:01

metric tons as an average sugar

play07:03

utilization for manufacturing in

play07:06

budgeting for their production expenses

play07:08

to help you figure out when to use mean

play07:10

and median I have summarized a few thumb

play07:13

rules for using mean and median over

play07:16

here in this table we can safely

play07:18

conclude that we can use the arithmetic

play07:20

average or mean to summarize our data in

play07:23

the following instances first when we

play07:26

have evenly distributed data that is the

play07:29

data is not skewed or does not have

play07:31

extreme values second when we need to

play07:34

use all the data available and thirdly

play07:36

there are no special circumstances like

play07:38

the drought and agricultural produce

play07:40

affecting the importance given to each

play07:43

data point and in case if you are

play07:44

violating the conditions 1 & 2 we can

play07:47

consider using the medium as promised

play07:49

here is a quick tip do you know we can

play07:52

use the mean to study expenses like food

play07:54

and electricity consumption in the

play07:56

household to budget for expenses every

play07:58

month this way you can monitor and set

play08:00

expense targets to plan for your savings

play08:03

before we move on to discussing the

play08:05

application for mode if you have found

play08:07

value in this information so far please

play08:09

like and share the video with your

play08:11

friends and acquaintances this motivates

play08:14

me to create more good content for you

play08:16

on the other hand it makes you look cool

play08:18

when you help your friends to improve

play08:20

their understanding I know yes the

play08:22

manufacturer of the soft beverage

play08:24

company is reminding you to subscribe to

play08:25

this channel and click on the bell icon

play08:27

to get notified every time a video is

play08:30

posted on this channel now in this next

play08:34

example we have a manufacturer of ice

play08:36

cream who wants to choose a popular

play08:38

flavor of ice cream for manufacturing he

play08:41

employed a consumer research firm and

play08:43

conducted a survey of hundred people

play08:45

they were asked to choose the most light

play08:48

flavor from amongst four flavors of ice

play08:50

cream the choice given to the consumers

play08:53

worth shop

play08:53

vanilla strawberry and peach so how do

play08:57

you choose the flavor that would most

play08:59

appeal to a group of people well from

play09:02

this data it's a no-brainer

play09:04

that many people chose the chocolate

play09:06

flavor so when we choose the most common

play09:09

response or the most frequently

play09:11

occurring response we are using the mode

play09:13

did you find this amazingly simple then

play09:16

here's one more case a retail shoe store

play09:18

owner has observed that every month he

play09:21

has many shoe sizes that are not sold at

play09:23

all or stay in the stock for a long time

play09:26

he often must carry out an unseasonal

play09:28

sale offer to clear the stocks

play09:30

maintaining a long-standing stock or

play09:33

dead stock has become commercially

play09:35

unviable for him he had observed that

play09:38

sales for each shoe size were fairly

play09:40

consistent across each month so he

play09:43

decided to take the average of past 12

play09:45

months he undertook a statistical

play09:47

analysis of various shoe sizes sold by

play09:50

him and created the following table the

play09:53

table depicts the average monthly sales

play09:55

in number of shoes sold for each shoe

play09:58

size the analysis indicated to him that

play10:01

size is 6 7 & 8 are the most sold sizes

play10:05

these where the modes of the data set we

play10:07

can therefore see that we can have more

play10:10

than one mode in a data set here's

play10:13

something interesting using the

play10:14

frequency of the sold items we observe

play10:17

that even size 10 is sold in

play10:20

sufficiently large numbers therefore we

play10:23

can advise the retail shoe store manager

play10:25

to even stock size 10 mode is a useful

play10:28

measure in case we wanted to decide on

play10:30

more than one summary item not just that

play10:33

it can also help you choose when you

play10:35

have data on a categorical variable like

play10:38

the one we saw in the case of ice cream

play10:40

flavor preferred like the median mode

play10:43

does not get affected by extreme values

play10:46

however when you have continuous values

play10:48

like one twenty two point two thirty

play10:50

four point five thirty seven point nine

play10:53

and so on there is less likelihood of

play10:55

obtaining a repeating value and finding

play10:58

a mode is either difficult or impossible

play11:01

mode is a useless measure in such

play11:04

instances therefore continuous values

play11:07

are best

play11:07

my mean and median as against mood so

play11:10

here's the comparison table updated for

play11:12

mood you can look up this video on basic

play11:15

statistics to gain an understanding on

play11:17

categorical and continuous data the link

play11:20

for this is posted in the description

play11:22

below as well as in the info card above

play11:24

so all right here's an important tip so

play11:27

next time you're confused on which

play11:29

measure of central tendency or average

play11:32

to use first consider observing the

play11:35

range and the distribution of the data

play11:37

this will give you a far better

play11:39

understanding combining with the thumb

play11:41

rules in the comparison table I have

play11:43

shared with you to decide which measure

play11:45

of average to apply in a real life

play11:48

scenario before I share another quick

play11:50

tip with you if you have not yet

play11:51

subscribed do consider subscribing and

play11:54

if you did find value in this video so

play11:57

far don't forget to like and share the

play11:59

video with your friends and

play12:00

acquaintances all your gestures motivate

play12:03

me to keep on churning out more good

play12:05

content for you so as promised here is

play12:08

another quick tip do you know you can

play12:10

categorize the items and tools use in

play12:13

your household based on mood to help you

play12:16

organize the household the most used

play12:18

items can stay in the most reachable

play12:21

places as against the less used items

play12:23

similarly you can even clear the

play12:25

household of unused and less used items

play12:28

by selling them off and replacing them

play12:30

with something more useful this will

play12:33

help you clear an awful amount of

play12:35

clutter in your house so leave a less in

play12:37

the comment below if you have used more

play12:39

to help you organize your household also

play12:42

do let me know in the comments below by

play12:43

tidying yes if you have used mode in any

play12:46

other place other than your profession

play12:48

and yes where have you used it to know

play12:51

more about how data is collected you

play12:54

could watch this video over here thanks

play12:56

for watching see you in the next video

play12:58

till the next time stay healthy and stay

play13:00

peaceful

Rate This
β˜…
β˜…
β˜…
β˜…
β˜…

5.0 / 5 (0 votes)

Related Tags
Data AnalysisCentral TendencyStatistics TipsMean MedianMode ApplicationDescriptive StatsSkewed DataOutliers ImpactReal-life StatsBudgeting AdviceConsumer Research