Statistical and Critical Thinking

Nick Conti
12 Jan 202227:42

Summary

TLDRThis video script emphasizes the importance of critical thinking in statistics, highlighting the need to complement textbook material with supplementary resources. It underscores the significance of active learning, such as writing definitions by hand and analyzing examples. The script warns against common pitfalls like confusing correlation with causation and the unreliability of self-reported data. It also addresses the impact of question order and phrasing on responses, the challenges of non-response and low response rates, and the potential bias in voluntary response samples. Additionally, it provides practical examples to illustrate the correct use of percentages and the distinction between statistical and practical significance.

Takeaways

  • 📚 The video is a supplement to textbook material, not a replacement, emphasizing the importance of reading and understanding textbook content.
  • ✍️ Writing out definitions and examples by hand is recommended for better memory retention and understanding.
  • 🔍 Diagrams and charts should be copied by hand to help organize information and facilitate recall.
  • 🧐 The video highlights potential pitfalls in statistical analysis, such as confusing correlation with causation.
  • ⚠️ Correlation does not imply causation, a key concept to remember when interpreting statistical data.
  • 🤔 The accuracy of self-reported data can be questionable, emphasizing the need for measured data over reported data.
  • 🗳️ Loaded questions can lead to biased responses, so it's important to frame questions neutrally to avoid influencing outcomes.
  • 🔄 The order of questions can impact responses, suggesting that the sequence in which options are presented matters.
  • 🔒 Non-response and voluntary response samples can introduce bias, so it's crucial to consider who is responding and why.
  • 📉 Low response rates can lead to unreliable data, so strategies to increase engagement and response are important.
  • 🔢 Understanding the mechanics of percentages is crucial for accurate statistical analysis and interpretation.

Q & A

  • What is the primary purpose of the lecture videos mentioned in the transcript?

    -The lecture videos are supplementary to the textbook material, meant to highlight important concepts, provide examples, and clarify potential confusion points, rather than replacing the textbook.

  • Why is it recommended to write out definitions by hand when studying from a textbook?

    -Writing definitions by hand is believed to help the brain process and remember them more effectively.

  • What is a potential pitfall when dealing with correlation in statistics as discussed in the transcript?

    -A potential pitfall is confusing correlation with causation. The transcript emphasizes that correlation does not imply causation, meaning that two variables being correlated does not necessarily mean one causes the other.

  • How can the way a question is worded impact the responses in a survey?

    -The transcript points out that loaded questions, which are biased towards a certain answer, can sway responses. Additionally, the order in which options are presented can influence the choices people make.

  • What is a non-response in the context of a survey and why is it a concern?

    -A non-response occurs when respondents ignore a question or a survey altogether, which can introduce bias into the data by excluding certain perspectives and potentially skewing the results.

  • Why might self-selected samples in a survey be unreliable?

    -Self-selected samples can be unreliable because they are only from people who volunteer to respond, which may introduce bias if those who choose to respond have different characteristics than those who do not.

  • What strategies can be used to prevent low response rates in surveys?

    -To prevent low response rates, surveys should present an engaging argument for their importance, be quick and easy to complete, and potentially offer a reward for participation.

  • What is the difference between statistical significance and practical significance as outlined in the transcript?

    -Statistical significance is achieved when study results are unlikely to be due to random chance, often using a threshold of a 5% chance or less. Practical significance, on the other hand, refers to whether the findings make a meaningful enough difference to be considered useful or worth implementing.

  • How can percentages be misleading in statistical analysis as discussed in the transcript?

    -Percentages can be misleading if they are not accurately representing the actual data or if they are presented without context, such as not considering the total number of subjects in a study.

  • Why is it important to understand the mechanics of converting between fractions, decimals, and percentages in statistics?

    -Understanding these conversions is crucial for accurate data analysis and interpretation. It ensures that data is presented and understood correctly, avoiding misrepresentations that can lead to incorrect conclusions.

Outlines

00:00

📚 Textbook Study and Video Lectures

This paragraph emphasizes the importance of textbook study as a foundational step in learning statistical and critical thinking. It suggests writing out definitions by hand to aid memory, reading examples to reinforce understanding, and copying diagrams or charts to organize information. The role of video lectures is to supplement textbook material by highlighting key points and providing examples, rather than replacing the textbook. The paragraph also introduces the concept of pitfalls in statistics, such as the common mistake of confusing correlation with causation.

05:01

⚠️ Pitfalls in Data Analysis

The second paragraph delves into potential pitfalls in data analysis, particularly the misuse of correlation and causation. It uses the example of wet parking lots and rain to illustrate that correlation does not imply causation. The paragraph also warns against relying on self-reported data, which can be inaccurate, and stresses the importance of physical measurements for more reliable data. Additionally, it touches on the issue of loaded questions and the order of questions in surveys, which can influence respondents' answers.

10:05

🔍 Further Data Analysis Pitfalls

This paragraph continues the discussion on data analysis pitfalls, focusing on the dangers of non-response and voluntary response samples, which can introduce bias into the data. It also addresses the issue of low response rates and suggests strategies to prevent them, such as making surveys engaging, quick, and offering rewards for participation. The paragraph concludes with a warning about misleading percentages and the importance of understanding how percentages are used and calculated in statistical contexts.

15:06

👀 Observational Data and Bias

The fourth paragraph contrasts the reliability of observational data versus self-reported data, using the example of hand washing habits in public restrooms. It highlights the potential for bias in self-reported data due to social desirability and suggests that observing behavior directly can yield more accurate results. The paragraph also examines the potential for bias in research funding sources and the importance of considering the source of study funding when evaluating study outcomes.

20:08

📊 Statistical Significance vs. Practical Significance

This paragraph introduces the concepts of statistical significance and practical significance. It explains that statistical significance refers to the likelihood that study results are not due to chance, while practical significance pertains to whether the results are meaningful enough to be useful in real-world applications. The paragraph also discusses the importance of understanding the difference between these two types of significance and provides examples to illustrate the concepts.

25:08

🔢 Fractions, Decimals, and Percentages

The sixth paragraph focuses on the mathematical relationships between fractions, decimals, and percentages. It provides an example of converting a fraction to a decimal and then to a percentage, emphasizing the importance of understanding these conversions in statistical analysis. The paragraph also discusses the interpretation of percentages in the context of people or items, suggesting that results should be rounded to the nearest whole number when dealing with discrete entities.

❓ Suspicious Percentage Claims

The final paragraph addresses the potential for suspicious or incorrect percentage claims in research findings. It uses an example of a treatment's effectiveness in a small group to illustrate how percentages can be misused or misreported. The paragraph emphasizes the importance of understanding the mechanics of percentages to correctly interpret and question research findings.

Mindmap

Keywords

💡Statistical Thinking

Statistical thinking refers to the approach of analyzing and interpreting data using statistical methods. In the video, it's emphasized as a crucial skill for understanding patterns and making informed decisions. The video discusses how statistical thinking can help identify pitfalls in data analysis, such as confusing correlation with causation.

💡Correlation

Correlation is a statistical term that describes the degree to which two variables are linearly related. The video clarifies that a correlation between two variables does not imply that one causes the other, a common pitfall in interpreting statistical data. An example given is the correlation between wet parking lots and rain, where the wetness does not cause the rain.

💡Causation

Causation refers to a cause-and-effect relationship between events or variables. The video stresses that while correlation can suggest a connection, it does not prove causation. This is important in statistical analysis to avoid drawing incorrect conclusions about what causes what, as seen in the discussion about the relationship between sour cream consumption and motorcycle fatalities.

💡Pitfalls

Pitfalls in the context of the video are errors or difficulties that can arise in statistical analysis. The video outlines several pitfalls, such as confusing correlation with causation, relying on reported data versus measured data, and the influence of question wording on survey responses. These pitfalls highlight the need for careful consideration in data collection and interpretation.

💡Sample

A sample is a subset of a larger population that is studied to make inferences about the whole. The video explains the importance of understanding the difference between a sample and a population, and how samples should be representative to draw accurate conclusions. An example is given where 1048 adults are surveyed to infer about hand-washing habits among all adults.

💡Population

Population in statistics refers to the entire group that is the subject of a study. The video script uses the term to contrast with 'sample', indicating that while a sample is a smaller, manageable group, the population is the larger group that the sample is meant to represent. Understanding this concept is key to generalizing findings from a sample to a population.

💡Voluntary Response Sample

A voluntary response sample is a type of sample where participants choose to respond, potentially leading to biased results. The video warns about the unreliability of such samples because those who volunteer may have different characteristics than those who do not, thus skewing the data. An example is given where a survey posted on the USA Today website receives responses that may not be representative of all internet users.

💡Non-Response

Non-response refers to the absence of data from individuals who were supposed to participate in a study but did not. The video discusses how non-response can introduce bias into a study, as those who choose not to respond may differ systematically from those who do. It's a critical consideration in ensuring the representativeness and reliability of survey data.

💡Question Order Effect

The question order effect is a phenomenon where the way questions are ordered can influence the responses. The video provides an example where the order in which traffic and industry are mentioned in a question about air pollution affects which is seen as more contributing. This illustrates how the design of a survey can impact the results obtained.

💡Percentage

A percentage is a way of expressing a proportion or a rate per hundred. The video explains the importance of understanding percentages in the context of data interpretation, emphasizing the need for accuracy in calculations and reporting. The video also highlights potential issues with misleading percentages, such as a researcher claiming a treatment works in 53% of a group of 20 subjects, which mathematically does not add up.

Highlights

Videos serve as supplementary material to textbooks, not replacements.

Importance of reading textbook definitions and examples thoroughly.

Writing out diagrams and charts by hand to aid memory and understanding.

Videos help highlight textbook information and provide examples.

Analyzing data while avoiding pitfalls such as confusing correlation with causation.

Correlation does not imply causation, exemplified by wet parking lots and rain.

Physical evidence is required to justify causality, not just statistical analysis.

Potential inaccuracy in self-reported data versus measured data.

Loaded questions can lead to biased responses.

Order of questions can influence responses, as shown in traffic versus industry pollution example.

Non-response and voluntary response samples can introduce bias.

Low response rates can create bias and affect the usefulness of data.

Strategies to prevent low response rates include engaging surveys, brevity, and rewards.

Misleading percentages can occur if not properly understood or calculated.

Understanding the difference between statistical significance and practical significance.

The flaw in assuming causation from correlation, as seen in the sour cream and motorcycle fatalities example.

Converting fractions to decimals and percentages with examples.

Interpreting percentages in context, such as rounding to nearest whole numbers for people.

Suspicion raised by a researcher's claim of a treatment working in 53% of a small group.

Transcripts

play00:02

this video is about statistical and

play00:03

critical thinking and these lecture

play00:06

videos are meant to be supplementary to

play00:08

the textbook material and so they're not

play00:10

meant to replace

play00:12

anything you might be able to get from

play00:13

the textbook but just go alongside it to

play00:16

help highlight some important things

play00:17

give some examples and so let's be

play00:20

also very clear that the first step we

play00:22

do in any section or topic should be to

play00:25

read the textbook now that involves

play00:28

probably a few steps that is write out

play00:30

all definitions by hand

play00:32

when you write them with your hand your

play00:34

brain does better processing remembering

play00:36

them

play00:37

and then secondly we need to read all

play00:39

the examples and note or write any

play00:41

questions or points of confusion often a

play00:43

textbook will have all kinds of examples

play00:45

in them we should be reading them to

play00:46

understand and reinforce the definitions

play00:49

or topics being discussed

play00:51

in the section and then thirdly write

play00:53

out

play00:54

that is copy by hand any diagrams or

play00:56

charts often these can help organize

play00:59

information and so we have to have those

play01:00

written out so that our brain remembers

play01:02

them and knows where we can go back and

play01:04

find them

play01:05

and so that's always a step we should do

play01:08

so this this kind of like taking notes

play01:10

from the textbook again is the

play01:13

the main task in any topic or section

play01:16

and then these videos will help

play01:18

highlight some of the information in the

play01:20

textbook

play01:21

so then secondly let's talk about some

play01:24

important highlights in this topic or

play01:25

section

play01:26

now we're going to talk about analyzing

play01:28

data

play01:30

and specifically some potential pitfalls

play01:32

that could occur okay pitfalls are like

play01:33

dangers right

play01:35

and in the case of statistics

play01:37

we have to be very careful about how we

play01:39

handle certain things such as

play01:41

correlation

play01:43

and

play01:44

causation

play01:46

so specifically here correlation

play01:49

does

play01:51

not

play01:53

imply causation

play01:55

okay so first of all these are a lot of

play01:57

fill in the blanks kind of definitions

play01:59

and examples that you'll find in these

play02:01

video notes and the the note guided or

play02:04

templates that you can use to fill in

play02:05

the blank should be available so you can

play02:07

actually follow along and fill in

play02:08

yourself

play02:10

but they do

play02:11

purpose to highlight and emphasize some

play02:13

of these definitions now correlation

play02:16

is different than causation as noted

play02:19

here in this first potential pitfall for

play02:22

example

play02:24

wet parking lots may correlate

play02:28

that is have a connection with rain but

play02:31

they don't cause the rain right if you

play02:33

drive by and see a parking lot that is

play02:35

totally wet

play02:37

it may be reasonable to conclude that it

play02:39

rained recently and there may be other

play02:41

clues that could help

play02:43

you know inform that conclusion but

play02:46

just because the parking lot's wet

play02:48

doesn't mean the parking lot caused the

play02:50

rain it simply goes with the fact that

play02:52

it may have range right

play02:54

and so we have to be careful not to

play02:56

confuse these it's

play02:58

in statistics often important to find

play03:00

correlations or things that have

play03:02

connections

play03:03

but we can't jump too much from that

play03:06

to some big you know causal conclusion

play03:09

here that may or may not be true and so

play03:11

as our author notes here secondly

play03:14

statements about causality that is

play03:16

what's causing something else can be

play03:18

justified by physical evidence

play03:21

but not by statistical analysis and so

play03:24

we just have to be very careful not to

play03:26

lean a little too far over our skis on

play03:28

the statistical side and

play03:31

take from a correlation which may

play03:33

actually occur

play03:34

and conclude from it some

play03:36

specific causal relationship they may

play03:39

not actually

play03:41

be impliable

play03:44

secondly

play03:45

we also have to be careful

play03:48

so first was that correlation does not

play03:50

cause or does not imply causation

play03:53

secondly

play03:55

we could have a potential pitfall if

play03:57

sample data is

play03:59

reported

play04:04

reported instead of

play04:10

measured

play04:11

okay so one example of this would be if

play04:14

you're trying to do some kind of

play04:15

statistical analysis that involves

play04:16

getting

play04:17

people's weights so let's say you know

play04:19

the weights of everyone in a classroom

play04:21

or everyone in a building or something

play04:23

well if you simply ask people to report

play04:26

their weight

play04:28

as you can probably imagine we all kind

play04:30

of want to look and sound a little

play04:32

better than we actually are so you may

play04:34

get inaccurate numbers

play04:36

people may round down or round up

play04:38

depending on the scenario right

play04:40

whereas if you were to measure it such

play04:43

as have an organized study where every

play04:45

participant is like standing on a scale

play04:48

the same scale and it's being measured

play04:50

by a third party independent you know

play04:52

person

play04:53

looking at the weight on the scale you'd

play04:55

end up with a much more accurate

play04:57

set of of measurements or data

play05:00

and then thirdly we also have to be

play05:02

careful of what could be called

play05:05

loaded questions

play05:11

okay and loaded questions are

play05:14

ones which give away or kind of lean

play05:17

in a certain direction

play05:19

and our author gives us a really good

play05:21

example of one so there was a

play05:22

set of questions given where the first

play05:25

one said

play05:26

should the president have the line item

play05:29

veto to eliminate waste

play05:31

now 97 percent of respondents said yes

play05:36

okay but then when the question was

play05:38

phrased should the president have the

play05:40

line item veto or not

play05:44

only 57 said yes

play05:46

so notice the difference between the way

play05:48

these two questions were asked the first

play05:50

one went a little further it was loaded

play05:53

with some extra information right should

play05:54

the president have line item video veto

play05:57

to eliminate waste

play05:59

okay well you're you're loading the

play06:01

question with a specific

play06:04

purpose or application that may sway

play06:06

people more than the initial part of the

play06:09

question right maybe people say should

play06:11

they see this question and think well

play06:13

yeah of course we should eliminate waste

play06:14

so i'm going to say yes to that whereas

play06:16

the second question

play06:18

simply said should the president have

play06:19

line item veto or not that's a totally

play06:22

different way of looking at that same

play06:24

kind of question but it's not loaded

play06:26

with any specific application and you

play06:28

can see there's a difference in

play06:29

responses right that to the second one

play06:31

only 57 yes so so maybe that was getting

play06:34

more at the responses pertaining to the

play06:36

line item veto uh ability as as opposed

play06:39

to the you know desire to eliminate

play06:41

waste

play06:46

and next we should discuss or consider

play06:48

the order

play06:52

of questions

play06:54

that are provided

play06:55

so again our author gives us another

play06:57

example the first question reads

play07:00

would you say that traffic contributes

play07:03

more or less to air pollution than

play07:05

industry

play07:07

well in response to that question 45

play07:09

percent blames traffic

play07:12

and notice that was the first

play07:15

possible factor mentioned in the

play07:17

question

play07:18

whereas

play07:20

only 27 percent

play07:22

blamed industry and that was the last

play07:25

thing or second thing mentioned and so

play07:27

the order in which things are mentioned

play07:28

in a question has the potential to

play07:31

impact the way people respond to them

play07:34

or you'll notice in the second

play07:37

instance of this

play07:39

this time when the question was asked

play07:41

would you say that industry contributes

play07:45

more or less to air pollution than

play07:47

traffic well notice this time

play07:50

the order was reversed so this time

play07:52

industry was was mentioned first

play07:55

and then

play07:56

traffic was mentioned second and guess

play07:59

what the actual responses were reversed

play08:02

this time

play08:03

24 blame traffic 57 blamed industry and

play08:07

so it could be the case in in this

play08:10

scenario this example that the order of

play08:12

the items mentioned the question

play08:14

actually impacted the responses and so

play08:17

you know we have to be clear when

play08:20

gathering and working with data and we

play08:22

want to just be aware for these pitfalls

play08:23

that could give us totally different

play08:26

outcomes and analyses simply based on

play08:28

the way a question may be worded

play08:32

next we have to consider

play08:34

non-response

play08:40

if we have respondents ignoring a

play08:42

question or ignoring a survey

play08:45

then it's quite possible

play08:47

that having these non-responses may

play08:49

already filter

play08:51

the data by what types of people respond

play08:54

or not right if people are less likely

play08:56

even to respond then we're not gathering

play08:58

or capturing

play09:00

their authentic response and that may

play09:02

influence

play09:04

the overall results in a study or

play09:07

a data gathering process

play09:09

and secondly recall as in the textbook

play09:11

here something called voluntary response

play09:13

sample or self selected sample

play09:16

can be unreliable or biased for the

play09:19

exact same reason so a self

play09:20

self-selected sample would be something

play09:23

where you're getting

play09:25

feedback and it could be authentic

play09:27

feedback

play09:28

but it's only feedback from people who

play09:30

volunteered

play09:32

to respond to the question well the very

play09:34

fact that they may have volunteered

play09:36

could have been influenced by some other

play09:38

characteristic that's going to skew the

play09:40

data or give it some bias so we have to

play09:42

be very careful about non-responses

play09:44

which is another way of saying voluntary

play09:46

response sample as mentioned by our

play09:49

author

play09:52

next we should be aware or be careful

play09:55

with

play09:56

low

play09:58

response

play10:00

rates

play10:05

so again if you're trying to gather data

play10:06

from people and you're requiring some

play10:08

kind of response to communicate that

play10:11

if you have a very low response rate

play10:14

um then that's a fair reason to to

play10:17

wonder whether the the the information

play10:20

gathered it's going to be useful or not

play10:22

and so

play10:23

as noted by our author this can create

play10:25

bias in the results but to help prevent

play10:28

low response rates there are a couple or

play10:31

a few strategies

play10:32

that could be used so um first of all we

play10:35

could serve a survey should present an

play10:37

engaging argument for its importance um

play10:40

surveys are ignored more often than

play10:43

they're embraced likely very likely

play10:46

and so if we can get an engaging or you

play10:49

know context or motivation for the

play10:52

survey that can actually help a lot

play10:54

or secondly a survey should should not

play10:56

be very time consuming right quick and

play10:59

easy is what people like and that's a

play11:01

more likely to get a response

play11:04

than if it's some you know 30 page

play11:06

or even just three page survey we want

play11:09

to be

play11:10

conducive to people getting it done

play11:12

quickly

play11:13

or thirdly

play11:15

it's helpful to provide a reward for

play11:17

completing a survey and this is often

play11:18

done in studies

play11:20

where

play11:21

sometimes respondents may even get paid

play11:23

for their feedback it just depends on

play11:25

the scenario but these are things to

play11:26

keep in mind uh when we're trying to

play11:28

gather data that low response rates can

play11:31

cause problems so we should be careful

play11:33

with them uh and by the way what is

play11:35

actually low or not could depend on the

play11:37

context and so those are all

play11:39

also just considerations

play11:41

that we'll be going over as we encounter

play11:44

these

play11:46

next we should consider

play11:48

misleading percentages

play11:56

misleading percentages now

play11:59

the mechanics of percentages is not

play12:02

something uh

play12:04

taught exclusively in this class so

play12:06

likely at this point you've already

play12:08

worked with percentages we'll review

play12:09

some of it

play12:10

with some examples here shortly but

play12:12

percentages are important now note that

play12:14

a percent

play12:17

really means divided by a hundred this

play12:18

is just a strict mathematical definition

play12:22

four percent we have to be careful that

play12:25

that's how it's being used in any

play12:27

statistical context

play12:29

and secondly let's recall that

play12:31

converting between decimals percentages

play12:34

and fractions

play12:35

is important sometimes we'll have to

play12:36

actually do that in statistics

play12:39

and we might want to make sure it's done

play12:40

correctly

play12:43

now let's spend a little time going over

play12:45

some examples now these examples will go

play12:47

with the content of this topic or

play12:48

section and

play12:50

some of it may you know touch on things

play12:52

that weren't explicitly highlighted in

play12:54

this video lecture so far there may have

play12:56

been things from the textbook that

play12:58

you've already been reading or taking

play12:59

notes on and so this will help you know

play13:01

give you some more to think about and

play13:03

process in these topics all right so

play13:06

this is coming from our author and

play13:08

our first question says in a survey of

play13:10

1048 adults conducted by bradley

play13:13

corporation

play13:15

subjects were asked how often they wash

play13:17

their hands when using a public restroom

play13:20

and seventy percent of the respondents

play13:22

said always

play13:24

okay so we want two parts here for part

play13:26

a identify the sample and the population

play13:29

with the definitions of sample and

play13:31

population given in the textbook i

play13:33

assume you've already written them out

play13:34

by hand in your notes and you know

play13:36

thought through some of the examples

play13:37

printed in the book

play13:39

but we want to understand these clearly

play13:40

so a sample would be

play13:42

a potentially smaller amount of people

play13:45

than a population it doesn't have to be

play13:47

smaller but it typically is smaller than

play13:49

the larger or full

play13:51

population a sample like you may imagine

play13:54

if you go to a restaurant or something

play13:56

and you sample some food you're just

play13:58

getting a small portion of maybe what

play14:00

they offer right

play14:01

so let's identify in this case what the

play14:03

sample and population are well in this

play14:05

case a survey was given to

play14:08

1046 adults so the sample

play14:12

in this case would be 1046.

play14:16

that's how many adults were surveyed

play14:18

right

play14:19

but the population

play14:23

would be

play14:24

all

play14:25

adults

play14:29

or you know you could say any adults

play14:31

right well obviously there are a lot

play14:32

more adults than 1046

play14:35

so

play14:36

the sample is in this case much smaller

play14:38

but it's supposed to just get a smaller

play14:40

representative

play14:42

slice of what the overall population

play14:45

may be or maybe like

play14:48

and then secondly

play14:50

with this question why would better

play14:51

results be obtained by observing

play14:54

the hand washing instead of asking about

play14:57

it well this goes up to one of our

play14:59

potential pitfalls we noted above which

play15:01

is the the reporting as opposed to

play15:04

measured

play15:06

so when people report that they wash

play15:08

their hands

play15:09

there's probably an inclination to say

play15:13

you did because it's it's probably a

play15:15

little more noble looking or or

play15:18

respectable looking to say you wash your

play15:20

hands right and so that may sway the

play15:23

results where people are maybe more

play15:25

inclined

play15:27

to to say they did just because it looks

play15:29

or sounds better so

play15:31

why would better results be given by

play15:33

observing because reporting

play15:35

results

play15:37

can skew them as opposed to observing as

play15:39

we noted above

play15:41

secondly determine whether a given

play15:42

source has potential to create bias in a

play15:45

statistical study

play15:48

in an article

play15:51

in journal of nutrition okay so this is

play15:53

a real publication here noted that

play15:56

chocolate is rich in flavonoids

play16:00

the article notes quote regular

play16:03

consumption of foods rich in flavonoids

play16:06

may reduce the risk of coronary heart

play16:08

disease end quote

play16:10

the study received funding from mars

play16:12

incorporated the candy company and the

play16:14

chocolate manufacturers association

play16:17

okay so think about this for a second

play16:19

right

play16:20

do these candy companies have an

play16:22

interest in people thinking that their

play16:24

candy might actually be nutritious well

play16:27

of course right that may

play16:30

make people more inclined to buy and eat

play16:31

it well if those people got if those

play16:35

candy companies funded

play16:37

this study

play16:39

and the study made that conclusion that

play16:42

these flavonoids found in their candy

play16:44

may actually be good for you

play16:46

there could be reason to wonder if

play16:48

there's some bias in these results right

play16:50

because it's it's more likely that they

play16:52

wanted those results and they they

play16:54

funded the study

play16:57

thirdly let's determine whether the

play16:58

sampling method appears to be sound or

play17:01

is flawed

play17:02

okay so first in a survey of

play17:05

1368 subjects the following question was

play17:07

posted on the usa today website in your

play17:11

view are nuclear plants safe

play17:14

end quote the survey subjects were

play17:16

internet users who chose to respond to

play17:19

the question posted on the electronic

play17:22

edition of usa today

play17:24

so we're trying to determine does this

play17:26

seem to be appear to be sound or flawed

play17:28

well

play17:29

if you understand and kind of imagine

play17:31

yourself in this context

play17:33

these were people who chose to respond

play17:35

right

play17:36

to this electronic question posted in

play17:39

this specific place this electronic

play17:41

edition of usa today and so this is what

play17:43

you would call self-reporting or

play17:45

voluntary response sample

play17:48

and so this could be dangerous and it

play17:51

could cause biased results as noted in

play17:53

our pitfalls above

play17:56

or part b in a survey of social media

play17:59

usage the pew research center randomly

play18:00

selected 2002 adults in the united

play18:03

states

play18:04

well since it was randomly selected

play18:07

this kind of construction of this sample

play18:10

actually would be more or more reliable

play18:14

we could say it appears to be sound

play18:19

next let's describe the difference

play18:20

between statistical significance and

play18:23

practical

play18:24

significance now to be clear we are

play18:26

going to study statistical significance

play18:29

in a more rigorous way and definition

play18:32

later on in the course this is just an

play18:34

introductory you know section on the

play18:36

topic of understanding statistical

play18:38

thinking so

play18:39

the definitions provided in the textbook

play18:41

which you should have written out and by

play18:43

hand in your notes

play18:45

do give just kind of a general sense of

play18:47

it and so statistical significance is

play18:49

achieved when a study produces results

play18:52

that are unlikely to occur due to random

play18:55

chance okay now that's the general

play18:57

meaning of it a commonly used criterion

play18:59

is that five percent chance or less of

play19:02

random occurrence is acceptable again

play19:05

this is going to be studied in more

play19:07

depth later on but the point is

play19:09

statistically significant has a

play19:12

mathematical meaning to it

play19:14

and it involves the results get uh given

play19:18

in the study

play19:20

not likely being just a chance like oh

play19:22

we just flipped a coin and that's how it

play19:24

ended up right it's more likely that it

play19:25

was not chance it was due to some other

play19:27

actual factor

play19:29

now practical significance occurs when a

play19:31

treatment or finding makes enough of a

play19:34

difference to justify its use

play19:36

so for example an arduous diet that only

play19:40

sheds a couple of pounds may not be

play19:42

practically significant to people right

play19:44

and so you could say well this diet

play19:48

showed that it had statistical

play19:49

significance to reduce you know weight

play19:51

in people they lost two pounds or

play19:53

something well okay even if it's

play19:55

statistically significant practically

play19:57

significant means people actually care

play19:59

about the result enough to try it so

play20:01

want to make sure and understand the

play20:02

difference between those two things

play20:05

next let's identify what's wrong with

play20:07

the following quote

play20:09

in recent years there has been a strong

play20:11

correlation between per capita

play20:12

consumption of sour cream and the

play20:15

numbers of motorcycle riders killed in

play20:17

non-collision accidents therefore

play20:20

consumption of sour cream causes

play20:23

motorcycle fatalities

play20:25

okay hopefully this one's jumping out at

play20:27

you loud and clear if not you can go up

play20:29

and review one of our pitfalls called

play20:31

correlation does not

play20:35

imply causation that was one of the

play20:37

things we noted above

play20:38

and in this case it's exactly what's

play20:40

happening right there's a correlation

play20:42

apparently between sour cream

play20:44

consumption and these fatal motorcycle

play20:46

accidents so okay what is the actual

play20:49

cause of that well the statistics can't

play20:51

really tell us

play20:52

but what we can say is that

play20:55

the correlation or association between

play20:58

two variables does not mean that one of

play20:59

the variables is the cause of the other

play21:03

correlation does not imply causation

play21:08

next let's talk about or review some

play21:11

work with fractions and decimals and

play21:13

percentages here so we have an example

play21:15

that says convert

play21:17

559 out of 1165

play21:21

to the following forms

play21:25

we want to write it as a fraction a

play21:26

decimal and a percentage well as a

play21:29

fraction we can take the phrase out of

play21:32

and use that kind of like a fraction bar

play21:35

and so if it says 559 out of 1165 we can

play21:39

say

play21:41

559

play21:43

is the numerator

play21:45

and then the

play21:46

1165 is the denominator so out of

play21:50

basically means over in a fraction set

play21:53

up right

play21:55

now we can convert this to a decimal

play21:57

simply by using a fraction as division

play22:00

right that's what a fraction means and

play22:01

so we could take this fraction 559

play22:04

over or out of 1165

play22:08

and in a calculator perhaps write that

play22:10

as 559 divided by

play22:12

1165

play22:14

now we weren't given any specific

play22:16

instructions for rounding in this

play22:18

example sometimes you may see actual

play22:20

instructions for it and we're actually

play22:22

going to talk about certain rounding

play22:24

standards and conventions more as we go

play22:26

on in later sections so since it wasn't

play22:28

mentioned i'm going to just say for this

play22:30

one let's round to 100

play22:32

and the decimal point form when we

play22:34

actually crunch this fraction in a

play22:36

calculator

play22:37

it's a long decimal but if we round it

play22:39

to a hundredth uh so we'll say

play22:41

approximately

play22:43

since it's rounded

play22:44

uh point or zero point

play22:47

four

play22:48

eight

play22:49

if rounded to a hundredth

play22:51

uh you could you know round it to

play22:52

different values

play22:53

just based on what was uh requested now

play22:56

we convert a decimal to a percentage by

play22:59

understanding decimal place values right

play23:02

two places to the right of a decimal

play23:04

which is where our value stops here is

play23:06

called the hundredths place

play23:08

well that's exactly what a percent

play23:11

means is out of 100 and so 0.48 or 0.48

play23:16

is the definition of 48

play23:21

and so again this is a rounded decimal

play23:22

and so i'm that consequently the

play23:24

percentage would also be rounded um

play23:27

but nonetheless these are how these two

play23:29

or three definitions relate fractions

play23:31

decimals and percentages of course you

play23:33

can go back and forth or different

play23:35

orders from one of the other depending

play23:36

on what you're given and where you're

play23:38

trying to go with it but this is how

play23:40

they're connected

play23:42

similarly for number seven the question

play23:44

says how would we calculate and

play23:46

interpret quote 48

play23:48

of 342 people end quote

play23:52

well we have to convert sometimes um

play23:55

english to mathematics so when you say

play23:57

48

play23:58

of

play24:00

342

play24:02

that's like saying

play24:04

the decimal form of 48 which is

play24:08

0.48

play24:10

of

play24:10

342 could mean times

play24:14

342. of in this context would be

play24:16

multiply and so you could crunch these

play24:19

two numbers

play24:21

and the result that we could get after

play24:23

just multiplying 0.48 times 342 on a

play24:26

calculator would be

play24:27

164.16

play24:31

okay so now that was the calculate part

play24:33

but then the next part said and

play24:35

interpret well if you're talking about

play24:37

people which is the context of this this

play24:40

number here

play24:41

you can't represent or reasonably

play24:45

describe 0.16

play24:47

of a person

play24:48

0.16 so

play24:50

rather we just round up to the nearest

play24:52

whole number when we're talking about

play24:55

people or you know items that can't be

play24:57

divided up so obviously animals would be

play25:00

something similar too but so in this

play25:02

case you would describe it or interpret

play25:04

as 164

play25:06

people just rounding to the nearest

play25:07

whole number

play25:13

and then

play25:14

lastly here what is suspicious about a

play25:16

researcher suggesting a treatment works

play25:19

in 53

play25:21

of a group of 20 subjects

play25:25

well the more we work with percentages

play25:27

the more comfortable i think we'll get

play25:29

with them but in this case when you're

play25:30

talking about a

play25:32

a set of 20 subjects

play25:34

let's remember first of all for a

play25:36

percent

play25:38

a percent is defined by a part

play25:42

over or out of

play25:45

a hole so part out of the whole thing

play25:47

right and you can actually crunch these

play25:50

numbers out to get a percentage

play25:52

a percentage for it so

play25:54

in this case

play25:55

there were 20 subjects which means the

play25:57

whole

play25:58

was 20. so if i have 20 as my

play26:01

denominator

play26:02

then the part would be you know how many

play26:04

respondents out of 20 i'm trying to

play26:06

represent well in this case

play26:08

we're suggesting that for those you know

play26:11

for whom this treatment works that would

play26:12

be the part we're talking about here

play26:15

we're suggesting that somehow the part

play26:17

number out of 20

play26:19

should equal 53 or perhaps rounded to it

play26:22

but the point is

play26:23

it would be roughly 0.53 right well if

play26:27

we start messing around with some some

play26:28

percentages if i had

play26:30

10 out of 20

play26:33

that would be zero point or just point

play26:36

five

play26:37

zero 0.5

play26:39

or 0.50 you can always add a zero onto

play26:41

the right without changing the number of

play26:43

a decimal okay so point 50 that would be

play26:46

50 that's close to our target of 53

play26:49

percent but it's too low so let's go one

play26:52

higher well the next higher would be if

play26:54

it worked for 11 out of 20 uh subjects

play26:57

right you can't have 10 and a half so go

play26:59

to 11 out of 20. well it turns out

play27:01

that's

play27:03

0.55

play27:04

so interestingly we skipped right over

play27:08

the reported result of 53 percent and

play27:12

we're suggesting that the numbers can't

play27:14

even give you that result

play27:15

you can get a 50 percent or 55 percent

play27:19

but not this 53

play27:21

given so that's reason to be a little

play27:24

suspicious about where the

play27:26

53 or the 0.53

play27:29

came from you know were they rounding

play27:30

something else what's going on with

play27:32

these percentages so understanding the

play27:33

mechanics of percents helps us use them

play27:36

and

play27:37

interpret them and be wary of them

play27:39

correctly

Rate This

5.0 / 5 (0 votes)

Etiquetas Relacionadas
StatisticsCritical ThinkingData AnalysisCorrelation vs CausationSampling MethodsBias in ResearchSurvey DesignData InterpretationStatistical SignificancePercentage Calculation
¿Necesitas un resumen en inglés?