T-test, ANOVA and Chi Squared test made easy.

Global Health with Greg Martin
5 Apr 202215:06

Summary

TLDRThis educational video explores various statistical tests and their appropriate applications. It delves into the t-test, covering single sample, two-tailed, one-tailed, and paired tests, using real-world examples to elucidate the concepts. The video then moves on to ANOVA, demonstrating how it allows for comparing means across three or more populations. Additionally, the chi-squared test is introduced, addressing both goodness of fit and test of independence scenarios, enabling analyses of categorical variables and proportions. Throughout the video, the presenter emphasizes the importance of understanding the research question and selecting the appropriate statistical test accordingly, making the material highly accessible and engaging.

Takeaways

  • 🔑 The key to understanding statistical tests is to first understand the question being asked.
  • 👤 For t-tests, we are examining the difference in means between two populations, or between one population at different points in time.
  • 🤔 The null hypothesis assumes there is no difference in means, and we calculate the probability of observing our sample data if the null hypothesis were true.
  • ✅ If this probability is below a predetermined threshold (typically 0.05), we reject the null hypothesis and conclude the observed difference is statistically significant.
  • 📐 There are different types of t-tests: single sample, two-tailed, one-tailed, and paired t-tests, depending on the specific question being asked.
  • ⚖️ ANOVA (Analysis of Variance) is used when comparing means across three or more populations.
  • 🔍 After an ANOVA identifies a significant difference, multiple comparisons can pinpoint which populations differ from each other.
  • 📊 Chi-squared tests are used to test for differences in proportions of categorical variables.
  • ✔️ The chi-squared goodness-of-fit test examines if observed proportions match expected proportions.
  • 🔗 The chi-squared test of independence checks if the proportions of one categorical variable depend on the values of another variable.

Q & A

  • What are the three main statistical tests covered in the video?

    -The three main statistical tests covered in the video are the t-test, ANOVA (analysis of variance), and the chi-squared test.

  • What is the purpose of the t-test?

    -The t-test is used to test the difference in means or averages between two populations, between one population at different points in time, or between a sample mean and a presumed or hypothesized mean.

  • What is the difference between a one-tailed and two-tailed t-test?

    -A two-tailed t-test is used when we are asking if there is a difference in any direction, while a one-tailed t-test is used when we are asking if there is a difference in a particular direction.

  • When is a paired t-test used?

    -A paired t-test is used when there are paired observations in each population, meaning that for each observation in one population, there is a corresponding observation in the other population.

  • What is the purpose of ANOVA?

    -ANOVA (analysis of variance) is used to compare the means of three or more populations. It tests the null hypothesis that there are no differences in the means of these populations.

  • How can you determine which specific populations are driving the difference in means after conducting an ANOVA?

    -After conducting an ANOVA and rejecting the null hypothesis, a multiple comparison of means (such as Tukey's test) can be performed to identify which specific populations have statistically significant differences in their means.

  • What is the chi-squared goodness of fit test used for?

    -The chi-squared goodness of fit test is used to determine if the observed proportions of a categorical variable are significantly different from the expected proportions based on a hypothesized distribution.

  • What is the purpose of the chi-squared test of independence?

    -The chi-squared test of independence is used to determine if there is a significant relationship between two categorical variables, i.e., if the proportions of one variable are independent of the other variable.

  • What is the null hypothesis being tested in a chi-squared test?

    -In a chi-squared test, the null hypothesis is that there is no difference in proportions (for goodness of fit) or that the variables are independent of each other (for test of independence).

  • How is the decision to reject or accept the null hypothesis made in hypothesis testing?

    -The decision to reject or accept the null hypothesis is made by comparing the calculated p-value to a predetermined significance level (typically 0.05). If the p-value is less than the significance level, the null hypothesis is rejected, indicating a statistically significant result.

Outlines

00:00

🎥 Introduction to Statistical Tests

This paragraph introduces the topic of the video, which is an explanation of different statistical tests: t-test, ANOVA (Analysis of Variance), and chi-squared test. It sets the context by stating that understanding the question being asked is crucial to determining the appropriate test and interpreting the results correctly. The paragraph also provides an overview of the specific types of t-tests (single sample, two-tailed, one-tailed, and paired) and chi-squared tests (goodness of fit and test of independence) that will be covered.

05:02

✏️ Exploring the T-Test

This paragraph delves into the details of the t-test, using real data and examples to illustrate different scenarios and applications. It explains that the t-test is used to analyze the difference in means or averages between two populations, one population at different points in time, or a sample mean compared to a hypothesized or presumed average. The concept of hypothesis testing is introduced, where the null hypothesis assumes no difference in means, and the alternative hypothesis states that the observed difference is statistically significant. The paragraph walks through the process of determining statistical significance, considering the p-value and predetermined threshold (often 0.05).

10:02

📊 ANOVA and Chi-Squared Tests

This paragraph covers two additional statistical tests: ANOVA (Analysis of Variance) and the chi-squared test. For ANOVA, it explains that it is used when comparing the means of three or more populations, with the null hypothesis being no difference in means across populations. The paragraph uses real data and visualizations to illustrate the ANOVA process and interpretation of results, including multiple comparisons and confidence intervals. For the chi-squared test, the paragraph introduces the goodness of fit test and the test of independence, both used to analyze categorical variables and proportions. It explains the process of hypothesis testing, setting the null hypothesis of no difference in proportions, and using the p-value and predetermined threshold to determine statistical significance.

Mindmap

Keywords

💡T-test

The t-test is a statistical hypothesis test used to determine if there is a significant difference between the means of two groups or samples. In the video, the t-test is explained as a method to assess if the difference between means from a sample and a presumed population mean, or between two population means, is statistically significant. Examples given include comparing life expectancy in Africa to a presumed mean, and comparing life expectancies between Africa and Europe.

💡Null hypothesis

The null hypothesis is the default assumption or statement being tested in a statistical analysis. It typically represents a claim of no difference or no effect. In the context of the video, the null hypothesis is that there is no difference between the means or proportions being compared. For example, the null hypothesis could be that the life expectancy in Africa and Europe is the same, or that the proportions of small, medium, and large flowers are equal.

💡P-value

A p-value is a probability value used in statistical hypothesis testing to determine the significance of the results. It represents the probability of obtaining results as extreme as the observed data, assuming the null hypothesis is true. In the video, the p-value is compared to a predetermined threshold (often 0.05) to decide whether to reject or fail to reject the null hypothesis. A small p-value (less than the threshold) suggests that the observed difference is statistically significant.

💡ANOVA (Analysis of Variance)

ANOVA is a statistical method used to analyze the differences among the means of three or more independent groups or populations. The video explains that when comparing more than two means, ANOVA is used instead of the t-test. The null hypothesis in ANOVA is that all group means are equal, and the alternative hypothesis is that at least one group mean differs from the others. ANOVA is demonstrated using data on life expectancy across Europe, the Americas, and Asia.

💡Chi-squared test

The chi-squared test is a statistical test used to evaluate the goodness of fit between observed data and expected data, or to assess the independence of two categorical variables. The video covers two types of chi-squared tests: the goodness of fit test (comparing observed proportions to expected proportions) and the test of independence (determining if two categorical variables are related or independent). These tests are demonstrated using examples of flower species and sizes.

💡Hypothesis testing

Hypothesis testing is a systematic method for making statistical decisions using data. It involves formulating a null hypothesis (the default assumption) and an alternative hypothesis, collecting data, and using statistical tests to determine whether the null hypothesis should be rejected or not rejected based on the observed data and a predetermined significance level. The video extensively discusses hypothesis testing in the context of various statistical tests (t-test, ANOVA, chi-squared).

💡Paired t-test

A paired t-test is a type of t-test used when the data consists of matched pairs or when the same subjects or entities are measured twice (e.g., before and after an intervention). In the video, the paired t-test is explained using the example of life expectancy in Africa in 1957 and 2007, where each country or observation in 1957 has a corresponding observation in 2007. The paired t-test accounts for the correlation between the pairs.

💡One-tailed and two-tailed tests

One-tailed and two-tailed tests refer to the direction of the alternative hypothesis in hypothesis testing. A one-tailed test is used when the alternative hypothesis specifies a direction (e.g., life expectancy in Africa is less than Europe), while a two-tailed test is used when the alternative hypothesis does not specify a direction (e.g., life expectancy in Ireland and Switzerland are different). The video explains and illustrates the use of one-tailed and two-tailed tests in the context of t-tests.

💡Confidence interval

A confidence interval is a range of values used to estimate a population parameter (e.g., mean, proportion) with a certain level of confidence. The video discusses confidence intervals in the context of ANOVA, where multiple comparison tests are performed to identify which group means differ. The confidence interval indicates the plausible range of values for the difference between two group means, and if it includes zero, it suggests the difference is not statistically significant.

💡Statistical significance

Statistical significance refers to the likelihood that an observed result or effect is not due to chance alone. It is determined by comparing the p-value (probability of obtaining the observed result if the null hypothesis is true) to a predetermined significance level (e.g., 0.05). If the p-value is less than the significance level, the result is considered statistically significant, and the null hypothesis is rejected. The video emphasizes the importance of statistical significance in interpreting the results of various statistical tests.

Highlights

When doing statistical tests, it's important to understand the question you're asking, and then it becomes easy to understand and interpret the results and decide which test to do.

For the t-test, there are single sample, two-tailed, one-tailed, and paired tests.

When doing a t-test, we're asking about the difference in means or averages between two populations, one population at different points in time, or between a sample mean and a presumed mean.

In hypothesis testing, we assume the null hypothesis (no difference in means), and if the probability of getting our sample data is very low under this assumption, we reject the null and accept that the difference is statistically significant.

For a single sample t-test, we compare the sample mean to a presumed population mean to see if the difference is statistically significant.

For a two-sample t-test, we can do a two-tailed test (is there a difference in any direction?) or a one-tailed test (is the difference in a particular direction?).

A paired t-test is used when there are pairs of observations, one from each population, matched together (e.g., life expectancy in Africa in 1957 and 2007).

For comparing means of three or more populations, we use ANOVA (analysis of variance).

After finding a significant difference with ANOVA, we can use multiple comparison tests to tease out which specific population means differ.

The chi-squared test is used for categorical variables and proportions, testing if the proportions across categories differ from expected values.

The chi-squared goodness of fit test checks if the observed proportions differ significantly from expected equal proportions.

The chi-squared test of independence checks if the proportions of one categorical variable depend on the values of another categorical variable.

In all tests, if the p-value is below a predetermined threshold (usually 0.05), we reject the null hypothesis and conclude the observed difference or relationship is statistically significant.

The key principles are the same across tests: assuming a null hypothesis of no difference, calculating the probability of getting the observed data under that assumption, and rejecting the null if that probability is very low.

Proper statistical testing requires determining the threshold for significance beforehand, to avoid p-hacking (adjusting criteria based on results).

Transcripts

play00:00

welcome back to this global health

play00:01

youtube channel in this video we're

play00:02

going to be talking about statistical

play00:03

tests which test to do when it's not

play00:06

complicated and the key is to understand

play00:09

what question it is that you're asking

play00:11

and when you understand the question it

play00:12

becomes very easy to understand and

play00:14

interpret the results and to decide

play00:16

which tests to do when so don't go away

play00:18

stick with me you're going to love this

play00:20

we're going to cover three things in

play00:21

this video the t-test anova which is

play00:24

analysis of variance and the chi-squared

play00:25

test for the t-test there'll be the

play00:27

single sample there's going to be

play00:29

two-tailed one-tailed and paired right

play00:30

we're going to do all four of those

play00:32

things and for the chi squared there's

play00:33

the goodness of fit and of course

play00:35

there's also the test of independence

play00:37

right you're going to find all of this

play00:38

super duper easy to understand so stick

play00:40

with me don't go away let's do this

play00:43

let's talk about the t-test right and

play00:44

i've got real data and real examples

play00:46

here and we're going to look at four

play00:47

different scenarios four different

play00:49

applications of the t-test looking at

play00:51

this data when we do a t-test we're

play00:53

asking a question about the difference

play00:55

in means or averages difference between

play00:57

two populations or difference between

play00:59

one population at different points in

play01:00

time or the difference in an average

play01:02

mean that we're seeing compared to some

play01:04

sort of hypothesized or presumed average

play01:06

right or presumed mean and we're asking

play01:08

the question is our sample can from our

play01:11

sample diet can we make inference about

play01:13

the wider population is this somehow

play01:15

representative of the truth that's out

play01:17

there representative of the population

play01:20

from which the sample was taken in other

play01:22

words is this statistically significant

play01:25

and this is how hypothesis testing works

play01:27

we assume the opposite the

play01:29

counterfactual the antithesis we assume

play01:31

that there's no difference in means

play01:32

between these two populations

play01:35

if that were true and we call that the

play01:37

null hypothesis if that were true

play01:40

then how likely would it be that we

play01:42

would get a sample

play01:43

from the population that shows a

play01:45

difference that we're seeing

play01:47

or greater what is the probability of

play01:49

that if we find that that is very

play01:52

improbable and we decide up front by the

play01:54

way we decide upfront what we consider

play01:56

what the threshold for what we would

play01:58

consider to be very improbable

play02:01

what do we mean by very and we we often

play02:03

use five percent if it's five percent or

play02:05

less in terms of likelihood or

play02:07

probability if we cons if it would be

play02:10

very improbable to get a sample

play02:12

like we've gotten

play02:14

if the null hypothesis were true

play02:17

and it would be very improbable to get

play02:18

the sample but we have gotten the sample

play02:21

we can then make the inference that the

play02:22

null hypothesis must in fact be

play02:24

incorrect that this assumption that

play02:27

there's no difference that that's

play02:28

incorrect we can reject that and we can

play02:30

accept the fact that there is in fact a

play02:32

difference and that our sample data is

play02:35

statistically significant and that's how

play02:37

hypothesis testing works so the first

play02:40

example here at the top on the left is a

play02:42

single sample t-test in other words

play02:44

we've just got one sample just in this

play02:45

case we've just got life expectancy in

play02:48

africa so we've just taken africa we've

play02:49

got life expectancy uh we don't have two

play02:51

populations we've got one population

play02:53

it's just africa and we've got a

play02:56

presumed life expectancy or presumed

play02:58

mean it could be for any old reason

play02:59

there could be any reason why we believe

play03:01

that life expectancy should be 50 or

play03:03

should be 55 it could be any number and

play03:05

we would ask the question is that the

play03:07

sample that we've got

play03:08

48.9 years is that statistically

play03:11

significantly different from that

play03:13

presumed mean or that presumed

play03:14

population mean and then you get you

play03:16

basically get a p-value if that p-value

play03:18

is less than point zero five or five

play03:20

percent if that if that's the number

play03:22

that you've chosen as a threshold it

play03:24

could be different

play03:25

uh then if if it's if it's less than

play03:27

that threshold and it's usually 0.00.05

play03:30

then we can reject the null hypothesis

play03:32

that the average is whatever it is that

play03:34

we assumed it to be and we can accept

play03:35

the fact that the the the the the sample

play03:38

mean that we've gotten is statistically

play03:40

significant okay that's the easiest

play03:42

example if you understand that one

play03:43

you'll understand the rest of these

play03:44

super duper easy let's keep going in

play03:47

these two examples and i've got two here

play03:48

just to illustrate the fact that there's

play03:50

two possible ways of asking this next

play03:51

kind of question we've got we've got

play03:53

life expectancy in africa and and in

play03:55

europe in the top right hand corner and

play03:57

we've got life expectancy in ireland and

play03:59

in switzerland at the bottom bottom left

play04:01

over here now the reason i've got two

play04:03

here is just to highlight the fact that

play04:04

there's two ways of doing this we could

play04:06

ask the question

play04:08

is there

play04:09

a difference without specifying in which

play04:12

direction so is there a difference for

play04:13

example between the life expectancy in

play04:15

ireland and the life expectancy in

play04:16

switzerland and we might say look we

play04:19

don't know in which direction the

play04:20

difference might be we're just asking

play04:21

are these are they the same or are they

play04:22

different is the difference that we're

play04:24

seeing here statistically significant

play04:26

would we expect to see a difference of

play04:28

this magnitude

play04:30

or more if it were the case that in you

play04:33

know in the scenario of the null

play04:34

hypothesis that in fact ireland and

play04:36

switzerland have the exact same

play04:38

life expectancy and if and if that

play04:40

probability is less than 0.05 if that's

play04:42

our threshold then we would say that it

play04:44

is statistically significant we reject

play04:46

the null hypothesis we reject the notion

play04:48

that they've got the same life

play04:50

expectancy and we accept the fact that a

play04:52

life expectancy that that this is

play04:54

statistically significant we could

play04:56

approach the same

play04:58

the exact same problem in a slightly

play04:59

different way and let's look let's use

play05:01

africa and europe for that we could say

play05:03

look we want to ask the question

play05:05

is it statistically significant that

play05:08

that africa is

play05:10

that that africa has a life expectancy

play05:13

less than europe of this magnitude so we

play05:15

might go into the question saying we

play05:17

know that africa has got a life

play05:18

expectancy less than europe

play05:20

we're asking

play05:21

is a difference of this magnitude

play05:24

statistically significant so we're not

play05:26

asking is there a difference in any

play05:28

direction

play05:29

we're saying there is a we think there

play05:31

is a difference

play05:32

we think that the difference we think

play05:34

that africa has a a different

play05:37

we're asking is africa is the life

play05:39

expectancy in africa less than europe

play05:42

by this magnitude or more and is that

play05:44

statistically significant and under

play05:46

those circumstances you do a one-tailed

play05:49

t-test does that make sense two-tailed

play05:51

if you're saying is there a difference

play05:52

in any direction one tailed if you're

play05:54

saying is there a difference in a

play05:55

particular direction

play05:57

okay you got it and in both cases the

play05:59

null hypothesis is that both populations

play06:02

have got the same mean that that there's

play06:04

no difference between the life

play06:06

expectancy in the two populations now

play06:09

here's the last example and this is this

play06:11

is what i want you to understand this is

play06:12

a paired tea test so we've got a one a

play06:15

one tailed and a two-tailed over there a

play06:17

paired t-test and this illustrates it

play06:19

quite nicely

play06:20

we've got a life expectancy in africa in

play06:23

1957 and then the life expectancy in

play06:26

africa in 2007.

play06:29

so it's

play06:30

for each sample

play06:32

for each observation in

play06:35

the 1957 there is a counterpart in 2007

play06:39

right so

play06:40

one in africa in 1957 one of the

play06:43

examples would be south africa there'd

play06:44

be a life expectancy that's one of the

play06:46

observations but in 2007 there would

play06:48

also be an observation that was south

play06:50

africa right and there'd be an a life

play06:53

expectancy there's malawi in both cases

play06:56

zimbabwe in both in both in both samples

play06:58

in other words there are these pairs

play07:00

there is a counterpart in each

play07:02

population and under those circumstances

play07:04

you do a paired t test and all of the

play07:07

principles the exact same principles

play07:09

apply right you can it can be one-sided

play07:12

two-sided

play07:13

and and of course the p-value of less

play07:16

than a threshold that you just determine

play07:18

up front

play07:19

anything less than that threshold if

play07:21

it's if it's if it's five percent means

play07:23

that the null hypothesis gets rejected

play07:25

the null hypothesis is that both of

play07:26

these have got the exact same life

play07:29

expectancy the exact same mean and if

play07:31

that's not the case if we reject that we

play07:33

can accept that the difference that

play07:35

we're seeing is

play07:36

statistically significant all right got

play07:38

it let's keep going next we're going to

play07:40

talk about anova so here we've got

play07:43

the we've got two means right if we want

play07:46

to add a third what do we do we can't do

play07:48

we can't do three means with the t-test

play07:50

we would do an analysis of variance

play07:52

anova all right so let's look at that

play07:54

next

play07:56

analysis of variance we're trying to

play07:58

basically answer the same kind of

play07:59

question that we were with the t-test

play08:01

but except now we've got

play08:03

three populations and we want to com

play08:06

three or more populations and we wanted

play08:07

to compare the means right the null

play08:09

hypothesis is still the same that the

play08:12

null hypothesis is that there are no

play08:14

differences in the means of these

play08:15

populations

play08:16

and the alternative hypothesis is that

play08:18

in fact there is a difference that the

play08:19

difference we're seeing is statistically

play08:22

significant right in this case i've used

play08:24

box plots and density uh density plots

play08:27

to to illustrate the difference in means

play08:30

in three populations we've got europe

play08:31

the americas and asia this is taken from

play08:33

the gap-minded data this is real data so

play08:36

it's super duper interesting and we're

play08:38

seeing here that the data is showing a

play08:41

difference in the means across these

play08:42

different population groups right the

play08:44

question is is that difference real and

play08:47

if we do a an anova test it will show a

play08:49

p-value that's very small but once

play08:51

you've done that right you've done the

play08:53

anova test you've shown that there's a

play08:54

small p value we can reject the null

play08:56

hypothesis which is that there's no

play08:57

difference in the means we can accept

play08:59

the fact that there is some sort of

play09:00

statistical difference uh how then do we

play09:03

tease out where that what's driving that

play09:06

difference right because all that

play09:09

conclusion all we can conclude from that

play09:11

is that

play09:12

one of

play09:14

these populations is different from the

play09:16

others and sometimes there's that you

play09:17

know you may have more than three here

play09:19

and you can do there are ways of

play09:21

determining

play09:22

uh

play09:23

where that what's driving that

play09:24

difference

play09:25

and what i've done here is i've fed our

play09:27

model into what's called a two key

play09:29

multiple comparison of means and it's

play09:31

taken each of the options you know asia

play09:35

and america europe and america and

play09:36

europe and asia and looked at them

play09:39

individually right and if you look at

play09:40

the results of it it's quite interesting

play09:42

because you can sort of see well

play09:43

between asia and america

play09:46

there is a difference of

play09:47

minus two or the difference of 2 it

play09:49

doesn't really matter the magnitude

play09:50

doesn't matter but the confidence

play09:52

interval for that difference is between

play09:55

-6 and 0.72 now and here you can see it

play09:58

diagrammatically that confidence

play09:59

interval crosses the zero threshold in

play10:02

other words the confidence interval the

play10:03

95 confidence interval includes the

play10:06

possibility of a zero which a zero means

play10:09

there's zero difference in other words

play10:10

it includes the possibility that there's

play10:12

no difference between those two

play10:14

population means however europe and

play10:16

america right we've got a difference of

play10:18

four the confidence interval does not

play10:21

include zero it's from zero point three

play10:23

to seven point two

play10:24

and uh and in europe to asia again same

play10:27

story quite a big difference and the

play10:29

difference does not include the

play10:30

possibility of zero of no difference and

play10:33

as you would expect the p values the

play10:35

adjusted p values uh

play10:37

bears that out so in the asia to america

play10:40

where the confidence interval included

play10:41

zero the p value does not cross the

play10:42

threshold of of a five percent or less

play10:46

it's 0.14 but the other two where the

play10:49

confidence interval does not include

play10:50

zero does not include the possibility of

play10:52

no difference they both have small p

play10:54

values less than point zero

play10:56

point zero

play10:57

point zero five okay got it

play11:00

now let's talk about the chi-squared

play11:02

test there's two of them right there's

play11:03

the goodness of fatigue test and there's

play11:05

the chi squared test of independence now

play11:09

really what we're looking at here is

play11:11

categorical variables and proportions of

play11:13

categorical variables and this is a

play11:15

great test to kind of test the notion

play11:18

right we're testing whether or not there

play11:20

in fact is a difference in the

play11:22

proportions across the different

play11:23

categories okay so let's have a look at

play11:25

that right so here we've got

play11:27

some flowers these happen to be irises

play11:29

and we know that they come in we've

play11:31

categorized them as small medium and

play11:33

large

play11:34

and in the first instance we could ask

play11:35

the question are the proportion of

play11:38

flowers that are small medium large the

play11:39

same do we expect to see the same number

play11:41

of small medium and large flowers in a

play11:43

random sample that we take from the

play11:44

population and we answer that question

play11:46

by doing a chi-squared goodness of fit

play11:48

test let's have a look at that

play11:50

and again we're talking about hypothesis

play11:52

testing in other words

play11:54

if if it were the case

play11:57

that there wasn't a difference in

play11:58

proportions that would be our null

play12:00

hypothesis right our null hypothesis is

play12:02

that there's no difference in proportion

play12:04

in the proportion of small medium and

play12:06

large

play12:07

flowers

play12:08

right if that were true and we took a

play12:11

random sample and that random sample

play12:13

happened to show a difference in

play12:15

proportions as large as or bigger than

play12:17

the difference we're seeing now we would

play12:19

consider if if that if if that

play12:21

eventuality was considered to be

play12:23

extremely unlikely then we could reject

play12:26

the idea that they're all the same

play12:27

proportions and we could accept the fact

play12:29

that what we're seeing in the data is in

play12:31

fact statistically significant what do

play12:33

we mean by extremely unlikely well we've

play12:35

got a cut-off point called the alpha

play12:36

value that's the probability you know

play12:39

how small this how small must that

play12:41

probability be for us to consider it to

play12:43

be

play12:45

not like like

play12:47

unacceptably smaller to the point that

play12:48

we wouldn't really believe that we

play12:50

wouldn't accept the null hypothesis to

play12:52

be true

play12:52

right and and usually we use 0.5 as

play12:54

we've talked about so many times in

play12:56

these videos

play12:57

so we do a chi-square test we take the

play12:59

starter we make a table put it into the

play13:00

chi-square test and we get a p-value

play13:03

that p-value is that probability if that

play13:05

p-value is less than the threshold we

play13:07

talked about usually 0.05 but it could

play13:10

be anything depending on what it is that

play13:11

you're trying to measure and how

play13:12

important your sort of discrimination is

play13:15

if that p-value is less than the

play13:16

threshold and we reject the null we

play13:18

accept the alternative and we say that

play13:19

this difference that we're seeing in the

play13:20

data is in fact statistically

play13:23

significant and the exact same principle

play13:25

applies to the chi-square test of

play13:27

independence we're asking the question

play13:29

are the proportions of

play13:31

our species

play13:33

are they in any way dependent or are

play13:35

they independent of the size of the

play13:37

flowers is knowing the value

play13:40

you knowing this is does knowing the

play13:41

size of the flower tell us anything

play13:43

about the probability of a particular

play13:45

flower being in one of these species

play13:47

right looking at these graphs it seems

play13:49

that that is the case but we need to

play13:50

demonstrate that statistically so we do

play13:53

a

play13:53

chi-square test of independence and we

play13:55

get a p-value if the p-value is very

play13:58

very small beyond a pre-determined

play14:01

threshold remember it has to be

play14:02

predetermined you can't do it rest

play14:04

retrospectively that's p hacking bad

play14:06

science

play14:08

uh if it's if the p value is very small

play14:10

in other words the probability of a

play14:11

sample showing a a difference in

play14:14

proportions or a a relationship which is

play14:18

demonstrated by difference in

play14:19

proportions of this magnitude or more

play14:21

the probability of that being the case

play14:23

in the event that the null hypothesis

play14:25

was was true in other words that there

play14:26

was no difference if that proper

play14:28

probability is extremely small we reject

play14:31

the notion that these things are all the

play14:33

same that the proportions are the same

play14:35

and accept the fact that in fact what

play14:37

we're seeing in the data this difference

play14:39

that we're seeing this relationship that

play14:40

we're seeing is in fact statistically

play14:43

significant

play14:44

right that's the chi-squared test of

play14:46

independence now stay and watch another

play14:48

video share this video with people that

play14:50

you think might find it useful subscribe

play14:52

to this channel if you haven't hit the

play14:53

bell notification if you want

play14:54

notification of future videos

play14:56

take care

play14:57

don't do drugs always do your best don't

play14:58

ever change speak to you again soon

Rate This

5.0 / 5 (0 votes)

Besoin d'un résumé en anglais ?