How to calculate p-values

StatQuest with Josh Starmer

22 Mar 202025:15

Summary

TLDRIn this StatQuest episode, Josh Starman explains the concept of p-values, emphasizing the difference between one-sided and two-sided p-values, with a focus on the latter. He uses the example of coin flips to demonstrate how p-values are calculated by adding up probabilities and how they are used to test the null hypothesis. The video also covers the importance of considering equally rare and more extreme outcomes when calculating p-values. It further illustrates the process with examples involving coin tosses and height distributions, highlighting the significance threshold of 0.05 and the potential pitfalls of one-sided p-values.

Takeaways

😀 P-values are used to determine the significance of an observed outcome by calculating the probability of that outcome under a null hypothesis.
🔍 Two types of p-values exist: one-sided and two-sided, with two-sided p-values being more common and one-sided potentially misleading if not used carefully.
🎰 The calculation of p-values involves adding up probabilities of observing the given data or something more extreme, considering the null hypothesis.
🤔 The null hypothesis typically assumes no difference or effect, and a small p-value (usually less than 0.05) suggests rejecting it in favor of an alternative hypothesis.
📊 P-values consider not just the probability of the observed outcome but also outcomes that are equally rare or more extreme, which helps in assessing the significance of the results.
🌟 In the context of coin flips, p-values can be calculated by considering all possible outcomes and their probabilities, such as getting two heads in a row.
📈 For continuous data like height, statistical distributions are used to calculate p-values, which involve adding up areas under the curve that represent probabilities.
🚫 One-sided p-values focus only on one direction of change (e.g., recovery times being shorter) and can be dangerous if not interpreted correctly, as they may fail to detect negative effects.
🧐 The example of a new drug's effect on recovery times illustrates how p-values can be used to test whether the drug has a significant impact, with one-sided p-values potentially overlooking harmful effects.
📉 The script uses relatable examples, like coin tosses and flower rarity, to explain the concept of p-values and their importance in statistical analysis.

Q & A

What is the main focus of the StatQuest video?
-The main focus of the StatQuest video is to explain how to calculate p-values, which are used in statistics to determine the significance of observed data.
Why are two-sided p-values more common than one-sided p-values?
-Two-sided p-values are more common because they consider all possible deviations from the null hypothesis, making them more robust. One-sided p-values, which only consider deviations in one direction, are rarely used and can be potentially misleading.
What is the null hypothesis in the context of the coin flipping example?
-In the coin flipping example, the null hypothesis is that the coin is not special and behaves like a normal coin, meaning it has no difference from a standard coin in terms of the probability of landing heads or tails.
How is the probability of getting two heads in a row calculated in a fair coin flip?
-The probability of getting two heads in a row in a fair coin flip is calculated by considering the independent probabilities of each flip. Since each flip has a 50% chance of heads, the probability is 0.5 * 0.5 = 0.25.
What does a p-value represent in statistical testing?
-A p-value represents the probability of observing the data (or something more extreme) under the assumption that the null hypothesis is true. It is used to determine the significance of the results, with a common threshold of 0.05 for rejecting the null hypothesis.
Why are the probabilities of getting one head and one tail treated as the same regardless of the order?
-The probabilities of getting one head and one tail are treated as the same because the order does not affect the probabilities of the outcomes. Each flip is independent, so the result of the first flip does not influence the second.
How is the p-value calculated for the scenario of getting four heads and one tail in five coin flips?
-The p-value for getting four heads and one tail in five flips is calculated by adding the probabilities of all outcomes that are as extreme or more extreme than the observed outcome. This includes the probabilities of getting four heads and one tail (5/32), one head and four tails (5/32), and five heads or five tails (2/32), resulting in a p-value of 0.375.
What is a statistical distribution and how is it used to calculate p-values for continuous data?
-A statistical distribution is a model that describes the probabilities of different possible outcomes in a population. It is used to calculate p-values for continuous data by considering the area under the curve that corresponds to the observed data and more extreme values.
Why are one-sided p-values considered potentially dangerous?
-One-sided p-values are considered potentially dangerous because they only consider deviations in one direction from the null hypothesis, which can lead to incorrect conclusions if the data shows an effect in the opposite direction or if there is an unexpected effect. They should be used with caution and typically only by experts who understand their implications.
How does the StatQuest video illustrate the concept of 'rare or more extreme' outcomes in the context of p-values?
-The video illustrates 'rare or more extreme' outcomes by using examples such as flipping coins and measuring heights. It explains that p-values include not only the probability of the observed outcome but also the probabilities of other outcomes that are as rare or rarer, emphasizing the importance of considering all possible extreme outcomes when assessing statistical significance.

Outlines

plate

Cette section est réservée aux utilisateurs payants. Améliorez votre compte pour accéder à cette section.

Améliorer maintenant

Mindmap

plate

Cette section est réservée aux utilisateurs payants. Améliorez votre compte pour accéder à cette section.

Améliorer maintenant

Keywords

plate

Cette section est réservée aux utilisateurs payants. Améliorez votre compte pour accéder à cette section.

Améliorer maintenant

Highlights

plate

Cette section est réservée aux utilisateurs payants. Améliorez votre compte pour accéder à cette section.

Améliorer maintenant

Transcripts

plate

Cette section est réservée aux utilisateurs payants. Améliorez votre compte pour accéder à cette section.

Améliorer maintenant

Voir Plus de Vidéos Connexes

p-values: What they are and how to interpret them

Hypothesis Testing: One Sided vs Two Sided Alternative | Statistics Tutorial #14 |MarinStatsLectures

Hypothesis Test vs. Confidence Interval | Statistics Tutorial #15 | MarinStatsLectures

Multiple Regression, Clearly Explained!!!

What Is A P-Value? - Clearly Explained

Using Linear Models for t-tests and ANOVA, Clearly Explained!!!

Rate This

★

★

★

★

★

5.0 / 5 (0 votes)

Étiquettes Connexes

StatisticsP-ValuesHypothesis TestingCoin FlipNull HypothesisData AnalysisProbabilityStatistical DistributionHealthcare DataEducational Content

Besoin d'un résumé en anglais ?