Data Collection and Presentation | Statistics

Learn With Mayora
20 Jan 202216:19

Summary

TLDRThis video delves into the crucial aspects of data collection and presentation in statistics. It distinguishes between primary and secondary data, detailing methods like direct experiments, interviews, surveys, and observations for gathering primary data. The script also emphasizes the importance of proper citation when using secondary data. For data presentation, it outlines textual, tabular, and graphical methods, including frequency distribution tables and various graphs like histograms, bar graphs, line graphs, and pie charts, to effectively communicate data insights.

Takeaways

  • πŸ” Data collection is essential for statistics and involves gathering primary (first-hand) and secondary (existing) data.
  • πŸ§ͺ Primary data collection methods include direct experiments, interviews, surveys, observations, and registrations.
  • πŸ“Š Tools for data collection can range from experiment logs and interview guides to attendance sheets.
  • πŸ“š Secondary data is borrowed from other researchers and requires proper citation to credit the original source.
  • ✍️ Data presentation involves transforming raw data into formats that are understandable and accessible to an audience.
  • πŸ“ Textual presentation of data involves using narrative and paragraphs to describe the data.
  • πŸ“‹ Tabular presentation, such as frequency distribution tables (FDT), organizes data into classes and frequencies for easy analysis.
  • πŸ“Š Graphical methods like histograms, bar graphs, line graphs, and pie charts provide visual representations of data for quick understanding.
  • πŸ“ˆ Histograms are used for quantitative data and show continuous data flow without gaps between bars.
  • πŸ“Š Bar graphs are suitable for categorical data, allowing gaps between bars to represent different categories.
  • πŸ“ˆ Line graphs, also known as time series, depict data trends over time, highlighting patterns and the steepness of data changes.
  • πŸ• Pie charts illustrate the proportion of components that make up a whole, useful for showing parts of a total like sales or budget allocations.

Q & A

  • What are the two main types of data discussed in the video?

    -The two main types of data discussed in the video are primary data and secondary data. Primary data refers to data collected by the researchers themselves, while secondary data comes from other researchers, which may include published or unpublished materials.

  • What is an example of a tool used for collecting primary data through direct experiment?

    -An example of a tool used for collecting primary data through direct experiment is an experiment log or observation log, which is used to record observations and results during the experiment.

  • How is an interview guide defined in the context of the video?

    -An interview guide, as defined in the video, is an outline of the topics to be discussed with an interviewee. It may contain questions or simple topics that serve as the flow of the interview.

  • What is the purpose of a survey in data collection?

    -A survey is used for collecting primary data through self-administered questionnaires, which can be in the form of checklists or rating scales distributed to respondents to gather their perceptions or data.

  • Can you explain the process of creating a frequency distribution table (FDT) for qualitative data?

    -To create an FDT for qualitative data, list the different categories or responses in the first column as row headers, and then count the frequency or number of occurrences for each category, placing these counts in the corresponding row.

  • What are the common parts of a statistical table?

    -The common parts of a statistical table include the table number and title, column headers, row headers, and the body, which is the intersection of rows and columns containing the data.

  • How does a histogram differ from a bar graph in terms of data representation?

    -A histogram represents quantitative data and does not have gaps between the rectangles, showing a continuous flow of data. In contrast, a bar graph represents categorical or qualitative data and allows gaps between the bars, as these represent different categories.

  • What is the significance of the slope in a line graph?

    -The slope of a line graph indicates the rate of change in the data over time. A steep slope signifies a rapid increase or decrease, while a gentle slope indicates a slower change.

  • Why is a pie graph used for data presentation?

    -A pie graph is used to show how different components make up a whole. It is effective for displaying data that can be divided into subparts, such as the allocation of resources or the composition of sales.

  • What are the three main methods of data presentation mentioned in the video?

    -The three main methods of data presentation mentioned in the video are textual, tabular, and graphical methods. Textual is used for narratives, tabular for key points, and graphical for audience-friendly visual representations.

Outlines

00:00

πŸ“Š Introduction to Data Collection and Presentation

This paragraph introduces the topic of data collection and presentation, emphasizing the importance of data in statistics. It distinguishes between primary and secondary data, explaining that primary data is collected by the researcher, while secondary data is sourced from other researchers' work. Various methods and tools for collecting primary data are outlined, including direct experiments, interviews, surveys, observations, and registration. The paragraph also touches on the necessity of presenting raw data in a way that is understandable to the audience, hinting at different presentation formats to be discussed later.

05:01

πŸ“ˆ Presenting Data: Textual and Tabular Formats

The second paragraph delves into the presentation of data, starting with textual presentation where data is narrated through paragraphs and narrations. It provides an example of how to present data from a class test, detailing the scores and performance of students. The paragraph then transitions to tabular presentation, focusing on frequency distribution tables (FDT). It explains the structure of statistical tables, including table number, title, column headers, row headers, and the body containing the data. An example is given for creating a qualitative FDT for students' comfort food preferences, illustrating how to list categories, count frequencies, and calculate totals.

10:03

πŸ“Š Graphical Data Presentation: Histograms and Bar Graphs

This paragraph explores graphical methods of data presentation, beginning with histograms derived from quantitative frequency distribution tables. It describes the process of creating a histogram using age distribution data, emphasizing the need for continuous flow and no gaps between rectangles. The paragraph then contrasts histograms with bar graphs, which are used for categorical data and allow gaps between bars. It discusses the orientation of bar graphs, whether vertical or horizontal, and how they differ in axis representation. The paragraph also provides guidelines for interpreting histograms and bar graphs, such as looking for continuous data flow and the absence of gaps in histograms.

15:04

πŸ“‰ Advanced Graphical Presentations: Line Graphs and Pie Charts

The final paragraph covers additional graphical data presentation methods, including line graphs and pie charts. It explains that line graphs, similar to histograms, do not touch the zero point on the y-axis and are used to display trends over time. The paragraph advises looking for patterns and noting the slope of the line to understand data behavior. Pie charts are introduced as a way to show how components make up a whole, with examples given for student allowance allocation and business sales components. The paragraph concludes by summarizing the different data presentation methods, highlighting the importance of choosing the right method to make data easily understandable for the audience.

Mindmap

Keywords

πŸ’‘Data Collection

Data collection refers to the process of gathering and measuring information on variables of interest, which is essential for conducting research and analysis. In the video, data collection is discussed in the context of both primary and secondary data. Primary data is collected directly by researchers, often through methods like direct experiments, interviews, and surveys. Secondary data is obtained from existing sources, such as previously conducted research. The video emphasizes the importance of proper data collection as the foundation for accurate and meaningful data analysis.

πŸ’‘Primary Data

Primary data is original information collected directly from the source, such as through experiments, surveys, or interviews. The video explains that primary data is authentic and first-hand, providing a direct insight into the subject of study. It is contrasted with secondary data, which is data that has been collected by others. Examples from the script include using experiment logs or observation logs in a laboratory setting to collect primary data.

πŸ’‘Secondary Data

Secondary data consists of information that has been collected by others and is used for further analysis. The video discusses how secondary data can be obtained from published or unpublished materials, and it highlights the importance of proper citation to give credit to the original data owners. This type of data is valuable for researchers who may not have the resources to collect all data themselves.

πŸ’‘Data Presentation

Data presentation involves the methods used to display data in a way that is clear, understandable, and engaging for the audience. The video covers various data presentation techniques, including textual, tabular, and graphical methods. The goal of data presentation is to make the data accessible and interpretable, allowing the audience to quickly grasp key insights and trends.

πŸ’‘Textual Presentation

Textual presentation is a method of conveying data through written language, such as paragraphs and narratives. The video uses an example of describing student test scores in a class to illustrate textual presentation. This method is particularly useful for providing context and details that may not be as easily conveyed through tables or graphs.

πŸ’‘Tabular Presentation

Tabular presentation organizes data into rows and columns within a table, making it easy to compare and analyze different variables. The video explains how to create a frequency distribution table (FDT) for both qualitative and quantitative data. This method is particularly useful for presenting data that can be categorized or counted.

πŸ’‘Graphical Presentation

Graphical presentation uses visual representations, such as graphs and charts, to display data. The video discusses various types of graphs, including histograms, bar graphs, line graphs, and pie charts, each of which serves a different purpose in data visualization. Graphical presentation is highlighted as an audience-friendly method that allows for quick comprehension of data trends and patterns.

πŸ’‘Histogram

A histogram is a type of graph that represents the distribution of a quantitative variable. The video describes how to create a histogram from a frequency distribution table, with examples showing age groups and their corresponding frequencies. Histograms are useful for visualizing the shape of the data's distribution, such as whether it is skewed to the left or right.

πŸ’‘Bar Graph

A bar graph is a graphical representation of categorical data, where each bar represents a category and its length corresponds to the frequency or value of that category. The video distinguishes between vertical and horizontal bar graphs and explains that they are used to compare different categories. Bar graphs are mentioned in the context of showing the frequency of comfort food choices among students.

πŸ’‘Line Graph

A line graph, also known as a time series graph, is used to display data points connected by straight lines, showing trends over time. The video describes how to interpret line graphs by looking for trends and the slope of the line, which indicates the rate of change. An example from the script includes a line graph showing the trend of student enrollment over several years.

πŸ’‘Pie Chart

A pie chart is a circular graph that shows the relative proportions of parts to a whole. The video mentions pie charts as a way to represent data that can be divided into subparts, such as the allocation of a student's allowance or the components of a business's sales. Pie charts are effective for demonstrating the composition of a whole.

Highlights

Introduction to data collection and presentation in statistics.

Explanation of primary data as data collected by the researchers themselves.

Definition of secondary data as data obtained from other researchers' work.

Direct experiment as a method for collecting primary data with tools like experiment logs.

Interview as a method for primary data collection, using an interview guide as a tool.

Surveys as a self-administered method for collecting primary data through questionnaires.

Observation as a method for collecting primary data using checklists.

Registration as a simple method for collecting primary data, such as attendance sheets.

The necessity of proper citation when using secondary data from other researchers.

Raw data requires presentation methods to be understandable to an audience.

Textual presentation of data through narratives and descriptions.

Tabular presentation as a method for organizing data in tables with classes and frequencies.

Frequency Distribution Table (FDT) as a tool for organizing raw data.

Components of a statistical table including table number, title, column header, row header, and body.

Creating a qualitative FDT by listing categories and counting frequencies.

Quantitative data requires a different approach for constructing FDT, with a separate video provided for details.

Graphical methods as a visual way to present data for quick understanding.

Histogram as a graphical representation of quantitative data from FDT.

Bar graph as a graphical representation for categorical data with gaps between bars.

Line graph for showing trends over time with attention to patterns and slopes.

Pie graph as a way to represent parts of a whole, suitable for data that can be divided into subparts.

Summary of data presentation methods: textual, tabular, and graphical for different audience needs.

Transcripts

play00:00

hello everyone in this video let us talk

play00:02

about data collection and presentation

play00:04

without further ado let's get this

play00:06

lesson started

play00:10

[Music]

play00:16

data collection methods and tools of

play00:18

course when we are talking about

play00:19

statistics we are all dealing with data

play00:22

data set the numbers that needs to be

play00:25

interpreted

play00:26

we cannot interpret something that we

play00:28

have not yet collected so the following

play00:31

are the different data and the methods

play00:33

and tools on how we can collect them

play00:36

when we talk about primary data based on

play00:38

the previous video that i have presented

play00:41

primary data refers to the data that the

play00:43

researchers collected themselves these

play00:46

are authentic first-hand data

play00:50

and on the other hand we have secondary

play00:52

data these are data that came from other

play00:54

researchers

play00:55

it may be from published or unpublished

play00:58

materials

play01:00

when we are dealing with primary data

play01:02

the following are the methods and the

play01:03

tools that we can use to collect them

play01:06

first is by direct experiment say for

play01:10

example we have different set up

play01:12

in laboratories or in classroom

play01:15

and we are observing and taking down

play01:18

notes of the different results or

play01:20

behavior of the things that we are

play01:23

experimenting so what we can use as a

play01:26

tool in experiment are the experiment

play01:28

blogs or observation logs

play01:32

another method of collecting primary

play01:34

data is through interview it can be over

play01:37

the phone or

play01:39

through video conferencing or

play01:41

face-to-face interview

play01:43

and a tool can be the interview guide

play01:46

when we say interview guide it is an

play01:48

outline of the topic that you will be

play01:50

discussing with your interviewee it may

play01:53

contain questions or just simple topics

play01:56

that will be the flow of the interview

play02:00

next we have survey this is more of the

play02:03

self-administered questionnaires

play02:06

it can be in a form of checklist or

play02:08

rating scales which you will distribute

play02:11

to your respondents to get their

play02:13

perception or to get data from them

play02:17

observation say for example you are

play02:20

seated

play02:21

and

play02:22

immersed in a certain setting and you

play02:24

want to observe how the different

play02:27

people interact with each other so you

play02:29

can provide a checklist

play02:32

and you will just put a check on the

play02:34

things that you have noticed

play02:37

and the most simple

play02:40

way of

play02:41

collecting primary data is through

play02:42

registration for example you just want

play02:45

to know how many students attended the

play02:47

synchronous class so you can generate an

play02:50

attendance sheet or a registration sheet

play02:53

at the end or before the activity or

play02:56

before the class

play02:59

next secondary date as i mentioned a

play03:01

while ago these are the data that

play03:04

the other researchers

play03:06

collected so when you are going to

play03:10

borrow these data you have to

play03:13

search no published and unpublished

play03:15

materials

play03:17

and you have to observe proper citation

play03:20

to give credit to the original owner of

play03:22

the data

play03:24

data gathered are raw data they are not

play03:26

yet ready to be presented to the

play03:30

audience so you have to be familiar with

play03:33

the different ways of presenting data

play03:35

first and i think the very simple is the

play03:39

textual presentation

play03:41

from the word itself text while it is

play03:44

presenting the data using paragraphs

play03:47

narration

play03:48

you are narrating the data gathered from

play03:50

the respondents or the participants so

play03:53

for example here in a class of 40

play03:55

students five of them obtained the

play03:58

perfect score of 50 in their third

play04:00

quarter long test in statistics 18

play04:03

students got a score of at least 40

play04:06

while two students scored 19 and below

play04:08

generally the students performed well in

play04:11

the test with 30 or 75 of them obtaining

play04:15

a passing score of at least 38 so that

play04:18

is one good way to present the data

play04:22

there are also other ways like tabular

play04:24

form

play04:25

from the word itself table or frequency

play04:28

distribution table fdt

play04:30

when we talk about fdt this is

play04:33

organizing raw data in table using

play04:35

different classes and frequencies when

play04:37

we say classes it can be

play04:40

qualitative or categorical

play04:42

like

play04:43

city or country where a person lives and

play04:46

when we say frequency it is the number

play04:48

of data

play04:49

that is corresponding to a certain class

play04:53

the count of the data

play04:55

corresponding to the certain class

play04:58

so the following are the common parts of

play05:01

statistical tables

play05:03

first part is the table number and title

play05:06

when we say title it is a short phrase

play05:09

that describes what the table is all

play05:10

about and this part is found above the

play05:14

table

play05:16

column header it

play05:18

talks about the label of each column

play05:20

what does each column represent

play05:24

row header is the label per row what is

play05:27

that row referring to

play05:29

and of course the body the body itself

play05:31

is the intersection of a certain row and

play05:35

a certain column that contains the data

play05:40

we have different fdt depending upon the

play05:43

nature of the data when we have a

play05:46

categorical or nominal or qualitative

play05:51

data something like this now 10 students

play05:53

were asked regarding their comfort food

play05:55

and the data below are their answers

play06:00

we are tasked to find the frequency

play06:02

distribution for the data so what we can

play06:05

do to

play06:07

create the fdt for this is we will just

play06:11

list down the different comfort food

play06:14

that will take up the first column

play06:17

this is the column header for the first

play06:20

column comfort food and the entries on

play06:23

the first column will serve as the row

play06:25

header

play06:26

so we just list down all the possible

play06:29

answers

play06:31

and for the frequency we just count how

play06:33

many students answered that comfort food

play06:36

so for ice cream we have four that's why

play06:39

we have four here

play06:41

french fries we have two donuts we have

play06:44

one and milk tea we have three so at the

play06:47

end of the table of course we put the

play06:49

total 10 students

play06:51

and as you may notice the table also

play06:53

contains the number in the title which

play06:56

is found above it

play06:58

that is how a qualitative or categorical

play07:01

or nominal frequency distribution table

play07:04

looks like

play07:06

of course we know that aside from

play07:08

qualitative data we also have

play07:11

quantitative

play07:12

data so in order to construct the fdt of

play07:16

quantitative data i have made a separate

play07:19

video and you can access it here in the

play07:22

upper right corner you just click on

play07:24

this link

play07:25

i made a separate video about that

play07:28

because it has certain steps that you

play07:30

need to follow

play07:32

the third

play07:34

method of presenting data is the

play07:37

graphical method when we talk about

play07:40

graphs these are visual representations

play07:43

that helps the audience or the viewers

play07:45

understand the data at a single glance

play07:47

so at first look they can get overview

play07:51

of the data is the data increasing or

play07:54

decreasing over time is the data

play07:57

divided into different subparts those

play08:01

are the important key points that the

play08:03

data set may have and can be highlighted

play08:06

by graphical representation

play08:10

first we have the histogram histogram is

play08:14

generated from

play08:16

quantitative frequency distribution

play08:18

table the one that has different class

play08:21

boundaries so for example let's have an

play08:25

excerpt of that fdt quantitative ftt let

play08:29

us just take the columns that we need

play08:32

which are the class boundaries and the

play08:34

frequency let's make a histogram for

play08:37

this

play08:38

so this is actually the distribution for

play08:40

the viewers ages

play08:41

so you may notice we prepare

play08:44

this blank

play08:46

graph

play08:47

for the y-axis we have the frequencies

play08:51

and for the x-axis we have the

play08:53

boundaries

play08:55

we just write here the possible values

play08:58

found in the class boundaries

play09:01

first

play09:02

is 19.5 to 29.5 that is 8

play09:08

so we can draw a rectangle from 19.5

play09:12

until 29.5 and its height should be 8

play09:17

because that corresponds to the

play09:19

frequency of that class for the next

play09:22

class which is 29.5

play09:25

until

play09:26

39.5 its frequency is 9 that's why its

play09:30

height should be 9.

play09:34

for the next class 39.5 until 49.5

play09:39

its frequency is 6 that's why its height

play09:43

is until

play09:44

6.

play09:45

and finally we have 2

play09:47

as the frequency or the height for 49.5

play09:51

to 59.5

play09:52

so the following are the things that you

play09:54

have to take note when dealing with

play09:56

histogram

play09:58

first the histogram does not touch zero

play10:02

the

play10:03

leftmost part or leftmost line of the

play10:07

histogram should not touch the

play10:10

frequencies or the y-axis because it

play10:13

will always start with

play10:15

the lower class bounded here

play10:19

second the histogram shows a continuous

play10:22

flow of data or in other words there are

play10:26

no gaps between the rectangles again let

play10:29

me repeat that for histogram there

play10:32

should be no gaps between the rectangles

play10:36

because as you can see on the class

play10:38

boundaries the data is also continuous

play10:42

the first class ends with 29.5

play10:46

and the second class starts with 29.5

play10:50

so it seems like the ending point of the

play10:52

first class

play10:54

is the starting point of the next one

play10:59

so that is for the histogram it's quite

play11:02

similar to bar graph however bar graph

play11:05

is dealing with categorical or

play11:08

qualitative ftt

play11:11

unlike in the histogram it deals with

play11:15

quantitative fdt

play11:18

since bar graph is derived from

play11:21

qualitative or categorical fdt as you

play11:25

may notice we have here the bars that

play11:28

has gaps between them

play11:31

yes bar graph allows gaps between the

play11:34

rectangles because these are not

play11:37

continuous data

play11:39

different

play11:40

bars refers to different categories

play11:44

bar graph can either be

play11:47

vertically oriented like this one

play11:50

frequency of comfort food

play11:52

it can also be horizontally oriented

play11:54

just like the bar graph on the right the

play11:58

first quarter sales by sales person in

play12:01

us dollars

play12:03

depending upon what is more appropriate

play12:06

with the data set you can either use the

play12:09

horizontal or the vertical bar graph

play12:12

of course when we are dealing with

play12:14

vertical bar graph the y-axis are the

play12:18

frequencies

play12:19

and the x-axis are the different

play12:22

categories

play12:23

the access will be switched

play12:25

if we are dealing with a horizontal bar

play12:28

graph

play12:29

the y-axis will now be the categories

play12:32

and

play12:32

the x-axis will now be the sales or the

play12:36

frequency

play12:39

another type of graphical data

play12:41

presentation is line graph or time

play12:44

series

play12:45

just like histogram line graph does not

play12:49

touch any number on the y-axis or the

play12:52

zero part because it starts with the

play12:54

first entry which is the year

play12:57

also line graph talks about the trend of

play13:01

data over time how the different values

play13:04

behave over time when you are presented

play13:07

with a time series or line graph

play13:09

the following are the things that you

play13:11

have to look

play13:13

first

play13:14

look for trends or pattern

play13:16

is the line decreasing or increasing

play13:20

well of course it has something to do

play13:22

with the behavior of the data if the

play13:25

line is decreasing meaning the data is

play13:28

also decreasing

play13:31

or if the line is rising to the right

play13:34

then the data is also increasing

play13:38

another thing that you have to take note

play13:40

is the slope of the line or the

play13:43

steepness of the line

play13:44

a steep slope

play13:47

talks about a rapid decrease or increase

play13:50

of the data set

play13:52

just like what you can see in this line

play13:54

graph or time series

play13:58

from 2016 to 2017 it's decreasing but

play14:01

not that rapid because the slope is not

play14:04

that steep

play14:06

there was

play14:07

a rapid decrease

play14:09

from 2017 to 2018

play14:11

then it again increased from 2018 to

play14:14

2019 and there was also a sudden drop

play14:18

of enrollees from 2019 to 2020

play14:25

so again for line graph or time series

play14:28

look for pattern and take note of the

play14:31

slope of the line

play14:35

finally we have pie graph pie graph

play14:38

talks about the components and how they

play14:40

make up the whole like what we can see

play14:43

in this pie graph

play14:45

so 30

play14:47

of the students shows milk tea as the

play14:50

comfort food 40 chose ice cream

play14:54

uh 20 chose french fries and 10

play14:57

shows donut so always remember that pie

play15:00

graph is used if the data can be divided

play15:04

into different subparts for example the

play15:07

allocation of allowance of a certain

play15:10

student

play15:11

how much does he allocate for food how

play15:15

much

play15:16

does he allocate for

play15:18

mobile plan

play15:20

for transportation

play15:23

another pie graph can be about the

play15:27

components of sales of a certain

play15:30

business

play15:32

how much of the sales came from a

play15:35

certain business how much came from

play15:37

investment and so on

play15:41

so those are the different ways on how

play15:43

we can present data

play15:45

we have textual which is used more for

play15:49

narratives

play15:50

we have tabular if we need to look at

play15:54

key points

play15:55

that needs to be reported as well and

play15:58

third presentation is the graphical

play16:02

method which is more audience friendly

play16:05

because

play16:06

the viewers can understand the data at a

play16:08

single glance

play16:14

[Music]

Rate This
β˜…
β˜…
β˜…
β˜…
β˜…

5.0 / 5 (0 votes)

Related Tags
Data CollectionPresentation MethodsPrimary DataSecondary DataStatisticsData InterpretationResearch ToolsGraphical DataEducational ContentData Analysis