ENGINEERING DATA ANALYSIS LESSON 1 TYPES OF DATA

Engr. Eking Explains
1 Sept 202013:21

Summary

TLDRThis video script delves into the realm of statistics, tracing its origins to 1749 and highlighting its evolution from state information to a broader analytical discipline. It defines statistics as the science and art of handling numerical data, encompassing collection, presentation, analysis, and interpretation. The script categorizes data into quantitative and qualitative types, further dividing them into continuous, discrete, attribute, and open data, with nominal and ordinal as subsets of attribute data. The importance of understanding these data types for research and analysis is emphasized.

Takeaways

  • πŸ“š The history of statistics can be traced back to 1749, with the original meaning referring to information about states, and it has evolved to include both the collection and analysis of data.
  • πŸ” Statistics is defined as the science and art of dealing with figures and facts, encompassing the collection, presentation, analysis, and interpretation of numerical data from various sources.
  • πŸ“ˆ Statistical data is a sequence of observations made on a sample from a population, and can be either quantitative or qualitative, serving as the foundation for further measurement and analysis.
  • πŸ“Š Quantitative data is measured by values or counts and is expressed as numbers, while qualitative data is non-numerical and characterized by descriptions or observations.
  • πŸ“‰ Continuous data, also known as variable data, can take the form of decimals or continuous values, such as height measurements that can have infinite precision.
  • 🏠 Discrete data, on the other hand, is data that is represented as whole numbers and cannot be divided into smaller units, like family size or class enrollment.
  • πŸ‘• Attribute data is countable and can be recorded for analysis, such as the size of a t-shirt, categorized as small, medium, large, etc.
  • 🌐 Open data is qualitative and allows for a wide range of responses without a specific set of values, such as personal opinions or predictions about the future.
  • πŸ‘₯ Nominal data is used to categorize without implying an order, such as gender, race, or political affiliation, where the categories are distinct and not ranked.
  • πŸ… Ordinal data indicates a ranking or order among categories, such as levels of awareness or skill levels, where there is a clear hierarchy.
  • πŸ“ The process of statistics involves the collection of data, followed by its representation, analysis, and interpretation, which are key steps in understanding and applying statistical concepts.

Q & A

  • When is the history of statistics said to have started?

    -The history of statistics is said to have started around 1749.

  • What does the original meaning of 'statistics' refer to?

    -In early times, the original meaning of 'statistics' was restricted to information about the states.

  • How has the meaning of 'statistics' evolved over time?

    -Over time, the meaning of 'statistics' has evolved to include not only the collection of information about states but also analytical work requiring statistical data.

  • What is the formal definition of statistics?

    -The formal definition of statistics is the science and art of dealing with figures and facts.

  • What are the four main components of the process of statistics?

    -The four main components of the process of statistics are the collection, representation, analysis, and interpretation of numerical data.

  • What is considered as statistical data?

    -Statistical data are a sequence of observations made on a set of objects included in the sample drawn from a population, or it can be defined as quantitative or qualitative values of a variable.

  • What are the two main classifications of data?

    -The two main classifications of data are quantitative and qualitative.

  • How is quantitative data defined?

    -Quantitative data is defined as data that measures values or counts and is expressed as numbers.

  • What is the difference between continuous and discrete data?

    -Continuous data, also known as variable data, can take the form of decimals or continuous values of varying degrees of precision. Discrete data, on the other hand, cannot be represented as decimals and are represented as whole numbers.

  • What are the two types of qualitative data?

    -The two types of qualitative data are attribute data and open data. Attribute data can be counted for recording and analysis, while open data is not given a specific value and can have a wide range of responses.

  • How are attribute data further subdivided?

    -Attribute data is further subdivided into nominal and ordinal data. Nominal data allows making statements only of quality or difference, while ordinal data has a rank or order.

Outlines

00:00

πŸ“Š Introduction to Data Collection and Statistics

This paragraph introduces the video's focus on data collection, explaining the types of data, methods of collection, and a brief history and definition of statistics. It mentions that the history of statistics dates back to 1749, with changes in the interpretation over time. Statistics is defined as the science and art of dealing with figures and facts, involving the collection, presentation, analysis, and interpretation of numerical data from various sources.

05:01

πŸ”’ Understanding Quantitative and Qualitative Data

The second paragraph delves into the classification of data, distinguishing between quantitative and qualitative data. Quantitative data is measured by values or counts and is represented numerically, such as continuous data that can take decimal values like height, and discrete data that is represented as whole numbers like family size. Qualitative data, on the other hand, is non-numerical and describes or characterizes data, collected through observation and interviews. It is further divided into attribute data, which can be counted and analyzed, and open data, which allows for a wide range of responses without specific values.

10:02

πŸ“š Further Classification of Qualitative Data

The final paragraph further breaks down qualitative data into nominal and ordinal categories. Nominal data allows for statements of quality or difference without an inherent order, such as gender, race, or political affiliation. Ordinal data, however, implies a ranking or order, like awareness levels or skill proficiency. The paragraph emphasizes the importance of understanding these data types for proper analysis and interpretation in research.

Mindmap

Keywords

πŸ’‘Data Collection

Data Collection refers to the process of gathering and compiling information from various sources for analysis. In the context of the video, it is the first step in the statistical process, where different types of data are collected using various methods. The video emphasizes the importance of data collection as the foundation for any statistical analysis, highlighting its role in understanding the subject matter.

πŸ’‘Statistics

Statistics is defined in the video as the science and art of dealing with figures and facts. It encompasses the collection, presentation, analysis, and interpretation of numerical data from different sources. The term has evolved from its early meaning, which was restricted to information about states, to its modern interpretation that includes analytical work requiring statistical data. The video uses the history and formal definition of statistics to establish its significance in data analysis.

πŸ’‘Quantitative Data

Quantitative data is characterized by measurements of values or counts that are expressed as numbers. The video script explains that this type of data can be represented as decimals or continuous values, such as height or weight, providing a degree of precision. Quantitative data is a key aspect of statistical analysis, as it allows for numerical comparisons and calculations.

πŸ’‘Qualitative Data

Qualitative data, as described in the video, is non-numerical in nature and is collected through observation, interviews, and other methods that describe or characterize the data. This type of data is important for understanding the attributes or qualities of the subjects being studied, such as opinions, experiences, or descriptions.

πŸ’‘Continuous Data

Continuous data, also known as variable data, can take the form of decimals or have continuous values with varying degrees of precision. The video provides the example of a person's height, which can be measured to a high degree of accuracy, such as 120.15 centimeters, illustrating the concept of continuous data in statistical analysis.

πŸ’‘Discrete Data

Discrete data, in contrast to continuous data, is data that is represented as whole numbers and cannot be divided into smaller units. The video uses examples such as family size or enrollment size, where the number of individuals is counted as whole entities, and cannot be expressed as fractions or decimals.

πŸ’‘Attribute Data

Attribute data is a type of qualitative data that can be counted and recorded for analysis. The video script mentions criteria such as the size of a person's waist or the size of a t-shirt, which can be categorized into small, medium, large, etc., and are used to classify and analyze data based on specific attributes.

πŸ’‘Open Data

Open data is qualitative data that is not given a specific value and allows for a wide range of responses. The video provides the example of a survey question asking how one sees their life 50 years from now, which can elicit an infinite variety of responses, making it an example of open data.

πŸ’‘Nominal Data

Nominal data is a subtype of attribute data that allows for statements of quality or difference but does not imply an order. The video uses examples such as gender, race, and political affiliation, which are categories that distinguish between different groups without a hierarchical order.

πŸ’‘Ordinal Data

Ordinal data is another subtype of attribute data where the members of a group are ranked in a specific order. The video gives the example of ranking in a game or level of awareness, where there is a clear order from the least aware to the most aware, illustrating the concept of ordinal data in statistical analysis.

Highlights

The history of statistics begins around 1749, evolving from state information to a broader analytical field.

Statistics is defined as the science and art of dealing with figures and facts, encompassing collection, presentation, analysis, and interpretation of numerical data.

The term 'statistics' originally referred to information about states, leading to the term 'easter egg statistics'.

Statistical data is a sequence of observations made on a sample drawn from a population, or defined as quantitative or qualitative values of a variable.

Data types are classified based on collection methods into quantitative and qualitative data.

Quantitative data measures values or counts and is expressed as numbers.

Qualitative data is non-numerical, characterized by observation or description, and collected through interviews and similar methods.

Quantitative data is further divided into continuous (variable data) and discrete (discontinuous data).

Continuous data can take the form of decimals or continuous values, such as height measurements.

Discrete data is represented as whole numbers, such as family size or enrollment numbers.

Qualitative data is divided into attribute data, which can be counted and recorded for analysis, and open data, which allows for a wide range of responses.

Attribute data includes nominal data, which allows for statements of quality or difference without order, such as gender or race.

Ordinal data is defined by an operation that allows for a rank order, such as awareness levels or skill rankings.

The importance of data in research studies is emphasized as the foundation for measurement and analysis.

The video will continue to discuss how data is presented after collection in subsequent content.

The video concludes with an invitation to enjoy learning more about statistics.

Transcripts

play00:01

[Music]

play00:14

hey

play00:16

[Music]

play00:26

okay so on this video this question we

play00:29

are going to

play00:30

discuss uh the uh

play00:34

topic about data collections in which we

play00:36

are going

play00:37

to know what are the types of data

play00:41

the methods of collecting data

play00:45

and also we are going to take a little

play00:48

review on the history and the definition

play00:51

of

play00:56

statistics

play01:07

okay so we have the beginnings of

play01:09

statistics so the history of

play01:11

statistics can be said to start around

play01:14

1749

play01:16

although over time there have been

play01:18

changes to the interpretation

play01:20

of the world statistics in early times

play01:22

the meaning was restricted to

play01:24

information about the states in modern

play01:27

terms

play01:27

statistics means both sets of collected

play01:30

information

play01:31

as in national accounts and temperature

play01:33

records and analytical work which

play01:36

requires

play01:36

statistical data so the beginnings of

play01:39

statistics

play01:40

is said to be around 1749 and the

play01:44

previous uh meaning of the word is only

play01:46

about the information about the states

play01:48

so that's why we have the term easter

play01:50

egg

play01:51

statistics okay but later on

play01:54

the uh name or the meaning of the

play01:57

statistics uh go beyond the

play02:01

uh collection of information about the

play02:03

states but also the

play02:04

analytical work which requires specific

play02:06

statistical reference or the

play02:08

interpretation of the information

play02:10

that are collected from the different

play02:12

states hence we have now

play02:14

the word statistics

play02:21

so the formal definition of statistics

play02:24

is the science

play02:25

and art of dealing with figures and

play02:28

facts

play02:29

so statistics is well defined as

play02:31

collection presentation

play02:33

analysis and interpretation of numerical

play02:35

data collected from different sources

play02:38

so is statistics

play02:43

so statistics is a science

play02:46

so that is and also an art so which

play02:49

deals about

play02:50

figures and facts

play02:53

that is the most important terms in this

play02:57

indus first definition

play02:59

of the word statistics and also

play03:02

so the the process of

play03:05

uh statistics or the flow of statistics

play03:08

starts from the collection of data

play03:11

the representation analysis

play03:15

and the interpretation so

play03:18

that is how estatistics

play03:21

works and that is mainly

play03:24

one of the the concepts that we are

play03:28

going to discover or

play03:31

we are going to understand in our

play03:34

subject

play03:35

[Music]

play03:39

okay so we have a statistical data so in

play03:42

the previous

play03:44

slides we learned the definition that uh

play03:47

statistics is the science and art of

play03:50

dealing with figures and data

play03:51

and those figures and data can

play03:54

can be put into one word or in this case

play03:58

two words and that is statistical data

play04:01

so a sequence of observation

play04:04

made on a set of objects included in the

play04:06

sample drawn from

play04:08

our population so that is the first

play04:09

definition of

play04:11

statistical data it can be also defined

play04:13

as quantitative or qualitative value of

play04:16

a variable

play04:17

so meaning could be number images words

play04:19

figures

play04:20

facts or ideas so it is the lowest unit

play04:25

of

play04:25

information from which other measurement

play04:27

and analysis

play04:29

can be done so that is also another

play04:32

definition and data is one of the most

play04:34

important and vital

play04:35

aspects of any research

play04:38

study so those are the definitions of

play04:41

statistical data

play04:46

then we have data types so data types

play04:49

according to

play04:52

the how they are collected so for

play04:55

example

play04:56

the raw data itself so they are

play05:00

classified into different types

play05:04

so if you are going to have a

play05:07

3 3 diagram of that so we'll start with

play05:10

data

play05:11

so the very first two qualification

play05:14

qualification

play05:15

classification of data is quantitative

play05:18

and qualitative so if we are going to

play05:22

define what is quantitative data so it

play05:24

is

play05:25

data are measures of values or

play05:28

counts and expressed as numbers so

play05:32

meaning those data are expressed or

play05:34

represented

play05:35

as numbers so that is for quantitative

play05:38

data

play05:41

for qualitative data so defined as the

play05:44

data that approximates or characterized

play05:47

so meaning it only approximates

play05:50

the data itself or also just

play05:53

characterize or describe the data it is

play05:55

non-numerical in nature

play05:57

and collected through methods of

play05:59

observation one-on-one interviews

play06:01

and similar methods

play06:06

then we can also divide the quantitative

play06:10

data

play06:11

into two groups so we have continuous

play06:15

and discrete so the continuous data also

play06:18

known as the variable data is data that

play06:20

can

play06:21

can take the form of decimals or

play06:24

continuous

play06:25

values of varying degrees of precision

play06:28

so meaning the data can be represented

play06:30

in

play06:31

in numbers but they are we could

play06:35

go to the realm of or to

play06:38

we could express them as decimals so

play06:41

that we could have

play06:42

a degree of precision so meaning example

play06:45

in that sample of a continuous data is

play06:48

height

play06:48

so we could not say that the height of a

play06:50

person is exactly 120 centimeter

play06:54

or equal or

play06:57

just 130 centimeter 120 130 160 150

play07:01

no every person has different height

play07:04

so there are some exactly 120 but the

play07:07

other one

play07:09

might be have a height of 120.15

play07:12

centimeter

play07:13

so that is a type of a continuous data

play07:16

also the other example for holy stick

play07:18

that is the

play07:19

weight here we will go to the script

play07:24

so this script so these are data

play07:27

whose form can take a uh

play07:31

form or the form is cannot be

play07:34

represented

play07:35

as decimals or um they are just

play07:40

represented as whole numbers so for

play07:41

example family size enrollment size so

play07:44

in a family size if you are going

play07:46

if you will be asked the size of your

play07:48

family so we see

play07:49

six persons made up my families

play07:52

so you could not say that six and one

play07:54

half percent

play07:55

or 6.5 percent so that is not

play07:58

how family size is represented suppose

play08:01

is enrollment size you could not say

play08:03

that

play08:04

uh in in a class there are 45.5 students

play08:09

so it is always whole number and that is

play08:12

considered as discrete

play08:14

data or discontinuous data

play08:20

then in the qualitative it is also

play08:23

divided into two

play08:25

so we have attribute and open

play08:28

so we say attribute so data that can be

play08:32

counted

play08:33

for recording and analysis so it could

play08:37

be counted for example

play08:40

the uh the criteria for the height of a

play08:44

person

play08:45

tall

play08:48

small so that is actually the data the

play08:52

size of the waist so or the size of

play08:55

the t-shirt of a person so we could have

play08:58

small video large extra large global xl

play09:00

triple xl

play09:01

so that is an attribute data it can be

play09:04

counted and can be also

play09:06

recorded for analysis now the

play09:10

opposite of this type is the open data

play09:13

so the open data

play09:14

is uh is depending on the sample and

play09:18

that

play09:18

are not given a specific value value

play09:26

so it is not given and plus a specific

play09:30

value the possible set of responses or

play09:32

answers

play09:33

so it is open free so

play09:36

if you are going to give your response

play09:39

so you could have

play09:40

your response anything under the sun

play09:44

so for example if you are going to have

play09:46

a survey form

play09:47

and you are asked that um

play09:51

what uh how do you see

play09:54

your life 50 years from now so that is

play09:58

um considered to be as an open data

play10:01

because

play10:03

there are um maybe infinite

play10:06

number or infinite uh types

play10:09

of responses and that is considered as

play10:11

open so the problem if this was

play10:13

we could not analyze this properly

play10:16

is sometimes um put in a survey form in

play10:20

order

play10:20

for the researcher to have a graph on

play10:24

uh on the nature of the

play10:27

uh of of the respondents

play10:30

okay

play10:34

then the up the attribute data is um

play10:38

further subdivided into two so abnominal

play10:42

and ordinal so the

play10:45

nominal data is defined

play10:48

by an operation which allows making

play10:50

statements

play10:52

only quality or difference so meaning

play10:56

they are just criterias nominal data i

play10:59

just created as for example

play11:00

gender so gender male female race

play11:04

american african european

play11:08

religion so we have our roman catholic

play11:12

islam cristo so on and so forth and have

play11:15

political affiliation so if you are

play11:19

a atmos if you are a democrat or a

play11:22

republican if you are living in the

play11:24

united states

play11:26

so that is nominal data so there is no

play11:29

order

play11:29

only the criteria the quality of the

play11:32

data

play11:33

either it is uh either they are equal

play11:36

or different okay

play11:41

then we have ordinal so ordinal so we

play11:44

could have the

play11:45

um the data defined application whereby

play11:49

the members are grouped

play11:52

uh of a particular group are rough so

play11:54

there is iran game for example awareness

play11:57

so you have those people that are very

play11:59

aware

play12:00

not so aware i feel

play12:04

then for example the

play12:09

level of your dexterity so are you

play12:13

um are you

play12:16

very versatile not so versatile so

play12:21

the the most common example of that is

play12:24

for

play12:25

for this uh for this generation is the

play12:28

ranking on the ml so you can have the

play12:31

mythic one meeting two

play12:33

are you unknown so those ranking is

play12:36

also an ordinal data so there is a wrap

play12:40

there is

play12:41

a order so that is ordinal

play12:44

data so these are the types of data

play12:49

the raw data that are collected

play12:52

then after those data are collected we

play12:54

could also

play12:56

um group them

play12:59

according to how they are now presented

play13:03

in a certain way and that will be

play13:05

discussed on

play13:07

the next video so thank you for watching

play13:10

and as always

play13:11

enjoy learning

play13:21

you

Rate This
β˜…
β˜…
β˜…
β˜…
β˜…

5.0 / 5 (0 votes)

Related Tags
StatisticsData CollectionQuantitative DataQualitative DataContinuous DataDiscrete DataAttribute DataOpen DataNominal DataOrdinal DataStatistical Analysis