MIN-LESSON 8b: (Power Laws) Q& A

N N Taleb's Probability Moocs
4 Jun 202113:36

Summary

TLDRIn this insightful lecture, the speaker addresses the prevalence of power laws in various aspects of life, such as wars and pandemics, which exhibit an alpha close to one-half, indicating heavy tails. The speaker criticizes the use of small sample sizes to make general claims, emphasizing the importance of avoiding sampling errors in scientific analysis. They also explore the concept of infinity in the context of power laws, explaining how it differs from finite observations and how it contributes to the unpredictability of large-scale events. The talk concludes with a discussion on the origins of power laws, including the Matthew effect and social contagion, and how constraints can alter their distribution.

Takeaways

  • πŸ“š The speaker discusses the concept of power laws and their relation to events like wars and pandemics, emphasizing the importance of understanding their statistical properties.
  • πŸ•ŠοΈ Stephen Pinker's view on the decline of violence is critiqued, with the speaker pointing out the limitations of drawing conclusions from a small sample size and the potential for sampling errors.
  • ⏱️ Wars and pandemics are highlighted as having power law distributions with an alpha close to one half, indicating heavy tails and the possibility of extreme events.
  • πŸ” The speaker emphasizes the need for caution when making statistical claims based on historical data, especially given the unreliability of historical records and the potential for exaggeration over time.
  • πŸ“‰ The concept of 'infinite' in the context of power laws is explained, describing it as a situation where the mean or upper bound is not well-defined due to the nature of the data.
  • πŸ“ˆ Power laws are contrasted with Gaussian distributions, with the former being characterized by heavy tails and the latter by a more balanced distribution around the mean.
  • 🌐 The speaker introduces the idea of maximum entropy distributions as a way to understand why power laws occur and under what conditions they might not.
  • πŸ’‘ The 'Matthew effect' and 'preferential attachment' are mentioned as mechanisms that can lead to power law distributions, where successful entities become more successful over time.
  • πŸͺ An example of how power laws can emerge is given through the scenario of storefront allocations, where success leads to greater opportunities and further success.
  • πŸ“Š The speaker discusses the transformation of data to fit power law distributions, such as converting the maximum possible deaths in a conflict to an equivalent 'infinity' for statistical analysis.
  • 🚫 The existence of constraints, like energy or biological limits, is presented as a reason why not all phenomena follow power law distributions, with human height being an example of a constrained variable.

Q & A

  • What is the main topic discussed in the transcript?

    -The main topic discussed in the transcript is the concept of power laws, particularly in the context of wars, pandemics, and their statistical properties.

  • Who is Stephen Pinker and what is his claim regarding violence?

    -Stephen Pinker is a journalist and author who wrote a book claiming that violence has declined over history, based on his analysis of 100 observations.

  • What is the issue with using a small sample size to derive general properties, according to the speaker?

    -The issue with using a small sample size is that it introduces sampling errors and the risk of being misled by randomness, which goes against the scientific method of avoiding noise and not being fooled by randomness.

  • What is the significance of the 'alpha' value in the context of power laws?

    -The 'alpha' value in power laws represents the steepness of the tail of the distribution. Wars and pandemics have an alpha close to one half, indicating a thick tail, which means rare but extreme events are more likely than what a normal distribution would predict.

  • What is the average inter-arrival time for a conflict causing more than 50 million deaths, according to the speaker?

    -The average inter-arrival time for a conflict causing more than 50 million deaths is approximately 80 years.

  • Why does the speaker argue that it is not scientifically valid to claim the world is a 'better place' without wars based on a short period of observation?

    -The speaker argues that to make a statistically significant claim about the absence of large-scale wars, one must wait for a period at least three times the average inter-arrival time to account for the variability in such rare events.

  • What is the issue with historical data when it comes to statistical inference?

    -The issue with historical data is that historians are not rigorous scientists; they often cite other people and provide numbers without a clear methodology, leading to potential inaccuracies in statistical inference.

  • How does the speaker address the problem of variability in historical death tolls due to conflicts?

    -The speaker addresses this by building over 500 events with high and low estimates of death tolls, creating 100,000 different histories to test hypotheses and account for variability.

  • What is the concept of 'infinite' in the context of power laws and statistical analysis?

    -In the context of power laws, 'infinite' means that the mean or other metrics do not converge to a specific value but rather fluctuate widely, indicating that the upper bound is not defined.

  • What are the two types of distributions mentioned by the speaker, and how do they relate to power laws?

    -The two types of distributions mentioned are one-tailed distributions, which are skewed to the left or right, and two-tailed distributions, which have both positive and negative tails. Power laws are typically one-tailed, with a heavy emphasis on the right side (positive values).

  • How does the speaker explain the formation of power laws in real-world phenomena?

    -The speaker explains the formation of power laws through mechanisms like the Matthew effect (rich get richer), preferential attachment, and social contagion effects, which can lead to a snowball effect and the creation of power-law distributions.

  • Why does the speaker believe that the world is naturally power law distributed, except under certain constraints?

    -The speaker believes that the world is naturally power law distributed because many phenomena follow this pattern unless there are specific constraints like growth limitations, energy constraints, or biological constraints that prevent a power law distribution.

Outlines

00:00

πŸ“š Questioning the Decline of Violence and Wars

The speaker addresses the question about the decline in violence and wars, referencing Stephen Pinker's book that suggests a decrease in violence over time. The speaker challenges this view by discussing the power law distribution of wars and pandemics, noting that they have a significant 'alpha' value close to one half, indicating a fat tail and the potential for large-scale events. The speaker criticizes the scientific approach of drawing conclusions from a small sample size and emphasizes the importance of avoiding sampling errors and not being misled by randomness. They also discuss the statistical significance of making claims about the absence of large-scale wars, suggesting that a longer time frame is necessary for such assertions. Additionally, the speaker touches on the unreliability of historical data, particularly when it comes to the numbers reported by historians, and proposes a method of building multiple histories to test hypotheses and account for this variability.

05:02

πŸ” Understanding Power Laws and Infinite Observations

This paragraph delves into the concept of power laws, particularly focusing on the idea of infinity in the context of observations. The speaker explains that infinite in this sense does not mean an actual infinite observation but rather a theoretical construct that helps in understanding the distribution of data. They clarify that power laws are characterized by a single tail, which is either skewed to the left or right, and that variables of interest, such as wars and pandemics, typically have one tail. The speaker also discusses the transformation of data to accommodate the concept of infinity, allowing for the application of power laws in statistical inference. The paragraph concludes with a brief mention of the origins of power laws, hinting at a deeper exploration in the context of entropy and maximum entropy distributions.

10:03

🌐 The Emergence of Power Laws in Social Phenomena

The speaker explores the emergence of power laws in social phenomena, such as the 'Matthew effect' where the rich get richer, and the concept of preferential attachment. They provide an illustrative example of how a store with more visitors is likely to expand and attract even more visitors, leading to a power law distribution over time. The speaker also mentions social contagion effects, where the actions of a few can influence the behavior of many, further reinforcing the power law distribution. They contrast this with situations where constraints, such as energy or biological limitations, prevent the formation of power laws, as seen in the distribution of human height. The speaker concludes by emphasizing that power laws are a natural distribution in the absence of constraints, and they express a personal belief in the prevalence of power law distributions in the world.

Mindmap

Keywords

πŸ’‘Power Laws

Power laws are mathematical relationships between two quantities, where a relative change in one quantity results in a proportional relative change in the other quantity, independent of the initial size of those quantities. In the context of the video, power laws are discussed in relation to the distribution of events like wars and pandemics, where the frequency of such events follows a specific mathematical pattern. The speaker uses power laws to analyze the likelihood and impact of rare but extreme events, such as large-scale wars that result in a significant number of deaths.

πŸ’‘Stephen Pinker

Stephen Pinker is a cognitive scientist, psychologist, and author known for his books on language, mind, and human nature. In the video, the speaker refers to Pinker's work on the decline of violence, challenging the idea that the world is becoming a better place by pointing out the limitations of Pinker's methodology and the potential for large-scale conflicts that could contradict this narrative.

πŸ’‘Alpha

In the context of power laws, the 'alpha' refers to the exponent in the mathematical formula that describes the relationship between two quantities. A lower alpha value indicates a 'fatter' tail in the distribution, meaning that there is a higher likelihood of extreme events occurring. The speaker mentions that wars and pandemics have an alpha close to one half, suggesting that these events have a significant potential for large-scale impact.

πŸ’‘Black Swan

The term 'Black Swan' is used to describe an event that is highly improbable but has severe consequences and is typically only understood in hindsight. The speaker refers to their work on the 'Black Swan' to highlight the unpredictability of events like wars and pandemics, which have thick tails in their distribution, indicating a higher likelihood of extreme outcomes.

πŸ’‘Sampling Error

Sampling error refers to the error that arises from the difference between the characteristics of the sample and the population from which it is drawn. In the video, the speaker criticizes the use of small sample sizes to make broad claims about trends, such as the decline of violence, emphasizing the importance of avoiding sampling errors to prevent being misled by randomness.

πŸ’‘Statistical Significance

Statistical significance is a measure of how likely it is that an observed effect or difference occurred by chance. The speaker argues that to make a statistically significant claim about the decline of wars, one must wait a sufficient amount of time, which is three times the average inter-arrival time of such events, to ensure that the claim is not a result of random variation.

πŸ’‘Bootstrap Method

The bootstrap method is a resampling technique used in statistics to estimate the accuracy of sample estimates. It involves resampling the data multiple times and calculating the statistic of interest for each resample. In the video, the speaker mentions using the bootstrap method to analyze data on pandemics, suggesting that the power law distribution observed holds even when different subsets of the data are considered.

πŸ’‘Existential Risk

Existential risk refers to a risk that threatens the entire future of humanity, potentially leading to human extinction or a permanent and drastic reduction in the potential for future development. The speaker identifies wars and pandemics as existential risks due to their potential to cause extreme harm and disruption on a global scale.

πŸ’‘Infinite

In the context of the video, 'infinite' is used to describe a situation where the mean or sum of a dataset does not converge to a specific value but instead varies widely, indicating a lack of an upper bound. The speaker explains that in power law distributions, the concept of 'infinite' is used to illustrate the potential for extreme values that can significantly impact the overall distribution.

πŸ’‘Matthew Effect

The Matthew Effect, or 'rich get richer' phenomenon, is a principle from sociology that suggests that those who have a certain advantage are more likely to accumulate additional advantages. In the video, the speaker uses this concept to explain how power laws can emerge, as those with more resources or success are more likely to gain even more, leading to a distribution where a few entities dominate.

πŸ’‘Constraints

Constraints refer to limitations or restrictions that can affect the behavior of a system or the distribution of certain variables. The speaker discusses how constraints, such as energy limitations or biological constraints on human height, can prevent certain variables from following a power law distribution. In contrast, variables like GDP or market prices, which are less constrained, can exhibit power law characteristics.

Highlights

Stephen Pinker's book argues that violence has declined historically, but the speaker challenges this view with a power law perspective.

Wars and pandemics follow a power law with an alpha close to one half, indicating a thick tail of extreme events.

The critique of deriving general properties from a small sample size due to sampling errors and the importance of avoiding being fooled by randomness.

The average inter-arrival time for wars causing more than 50 million deaths is approximately 80 years, questioning the premature conclusion of a safer world.

The importance of waiting three times the average inter-arrival time for statistically significant claims about the absence of large-scale wars.

Historians' lack of rigor in data reporting and the issue of inflated numbers over time, affecting statistical inference.

The method of building over 500 alternate histories to account for the variability in historical conflict data.

The concept of transforming a finite number of deaths into an equivalent to infinity for power law analysis.

Pandemics also follow a power law distribution, reinforcing the existential risks they pose to humanity.

The clarification of the concept of infinity in data analysis, relating to undefined means and the unpredictability of extreme values.

The explanation of power laws as one-tailed distributions, typically skewed to the right, with implications for variables like wars and pandemics.

The role of constraints in preventing the formation of power laws, such as biological constraints on human height.

The Matthew effect and preferential attachment as mechanisms leading to power law distributions in social and economic phenomena.

The use of maximum entropy distribution to understand the formation of power laws and the constraints that lead to Gaussian distributions.

The speaker's personal view that the world is naturally power law distributed except when constrained, contrasting with the idea of social contagion effects.

The impact of constraints on growth, energy, and limitations in shaping distributions that deviate from power laws.

A commitment to answering more questions in the future, indicating an ongoing dialogue on the topic.

Transcripts

play00:00

friends hello again uh

play00:04

i'm gonna uh answer some questions

play00:07

uh a lot of questions online about the

play00:10

lecture on power laws

play00:12

and thank you for these interesting

play00:13

questions so let me start but i'm going

play00:15

to structure it

play00:17

to make it flow with the previous

play00:19

[Music]

play00:21

the main power law

play00:24

lesson okay so

play00:27

first question was

play00:30

what about wars okay stephen pinker

play00:33

wrote a book

play00:35

the journalist stephen pinker wrote a

play00:37

book on

play00:39

a saying that violence has declined

play00:41

naively he took 100 observations

play00:44

uh over the past history and uh

play00:48

claimed that you know we haven't had

play00:50

wars since

play00:51

big wars since the second world war so

play00:53

the world is a better place

play00:56

there were like the number of laws

play00:59

in his book is enormous but i'll focus

play01:02

on

play01:02

one thing wars you remember i was

play01:06

talking about alphas

play01:07

wars have an alpha close to one half

play01:12

the lower the alpha the

play01:16

fatter the tail pandemics

play01:21

also about one half and when i wrote the

play01:25

black swan

play01:26

that's what shocked me wars and

play01:28

pandemics

play01:29

they had the thickest tails and it was

play01:32

you know

play01:33

uh not until quite a bit later that we

play01:36

did work on words and pandemics

play01:38

in a more formal way so to give you the

play01:42

intuition of why

play01:43

uh what what error my picture i'm going

play01:45

to focus on which is the one the most

play01:46

offensive i find scientifically

play01:49

which is to derive general properties of

play01:52

a small sample

play01:54

because you have sampling errors so the

play01:56

whole idea about science is avoiding

play01:58

noise

play01:59

sampling error and and not being fooled

play02:02

by randomness

play02:03

so i will um

play02:06

and i'll show you what what what in fact

play02:09

had happened here okay when we read

play02:12

the data we know that we did the

play02:14

following

play02:16

and actually the paper is in my book

play02:18

okay or a version of the papers in my

play02:20

book

play02:22

you have you take inter-arrival time you

play02:25

take an event

play02:26

say an event

play02:29

x higher than okay

play02:36

you take an event say

play02:41

x higher than

play02:48

50 million in today's population the

play02:52

equivalent 50 more than 50 million

play02:54

have died in that event we call

play02:58

a conflict single conflict not series of

play03:02

conflicts

play03:03

cumulatively killing more than 50

play03:05

million and

play03:08

mean time for it to happen

play03:12

throughout history effectively for x

play03:15

higher than 50 million

play03:16

the mean time is something like uh

play03:21

80 years

play03:24

okay so if it takes 80 years on average

play03:27

for such an event to happen

play03:29

how can you make claims seven years

play03:32

later

play03:34

that you know the world is a better

play03:36

place

play03:37

we haven't had such a war you need to

play03:39

wait

play03:41

let's assume in standard deviation

play03:44

actually this is gaussian the arrival

play03:46

time

play03:47

between events is gaussian this is time

play03:50

you have wars

play03:52

you put a bench 50 million how many

play03:55

times you exceeded

play03:56

the arrival time is gaussian and

play03:58

memoryless and not gaussian sort of like

play04:00

thin tailed but memoryless anyway so you

play04:02

have to wait three times

play04:04

that to start making a statistically

play04:07

significant claim

play04:08

this is not science so another problem

play04:12

with historians with data coming from

play04:15

history particularly when you go back

play04:17

two thousand more than two thousand

play04:18

years

play04:19

that historians are not very rigorous

play04:23

scientists

play04:24

they just cite other people and give you

play04:26

a number

play04:28

so how can we have a

play04:32

statistical inference that takes into

play04:34

account the fact that historians are

play04:36

literally bullshitters as follows

play04:39

we take every conflict you have

play04:43

this is a time you have a conflict you

play04:46

take two numbers

play04:47

low and high because visibly

play04:50

let's say take the recent recent events

play04:53

in the algerian war

play04:55

you have low number what the french

play04:56

think

play04:58

and high number what algerians claim

play05:01

and of course you have inflation over

play05:03

time the french things three hundred

play05:04

thousand say

play05:05

were killed by the event and the

play05:07

algerians saw that a million were killed

play05:09

and of course you have inflation like

play05:11

the hama massacre in

play05:14

syria had inflation started at 2000

play05:17

and reached 40 000 with no information

play05:19

just like you relay

play05:20

the event the number gets bigger so this

play05:23

phenomenon so how can we know

play05:25

if something was reliable in the past so

play05:28

with this

play05:29

what we do it did is built

play05:32

hundreds of over 500 events a little

play05:35

more than 500 events

play05:37

you can build 100k histories

play05:41

you build 100 000 histories where you

play05:44

take

play05:44

this high that low

play05:48

that low high low so you build your

play05:52

history

play05:52

and and from there you test

play05:56

your hypothesis and sure enough

play05:59

practically all sample path

play06:03

throughout history

play06:06

all sample path

play06:14

had a alpha below one

play06:18

all second problem

play06:21

is of course you know the variable the

play06:23

random variable for them to be power law

play06:25

they should reach infinity

play06:27

we did the trick because you have a

play06:29

finite number of people on the planet

play06:32

to transform a hundred percent of the

play06:34

population being killed into something

play06:36

equivalent to infinity and that

play06:39

transformation

play06:40

doesn't change the numbers for what we

play06:42

saw but just allows us to do power law

play06:44

because it changes the numbers say if

play06:46

you have more than eight billion

play06:48

people dying or more than seven billion

play06:51

yeah you see a difference in numbers

play06:53

so it is uh what we call a log

play06:55

transformation

play06:57

and and that allows us to use power laws

play07:00

quite effectively

play07:02

in doing the inference so

play07:05

now i answered the question about wars

play07:07

let me talk about pandemics same methods

play07:09

we use from pandemics

play07:11

we catalog practically all pandemics

play07:13

recorded

play07:15

and figure that there's a power law and

play07:17

then of course you bootstrap

play07:19

you jack now if you remove some

play07:20

observation add uh you know

play07:22

whatever you do get the same result

play07:26

uh both have an alpha below one

play07:29

so both represent existential

play07:32

risks for us and they're both very

play07:36

dangerous

play07:37

so i've answered

play07:40

the first question about war and

play07:42

pandemics

play07:44

second point someone was saying well

play07:46

what do you mean by infinite

play07:47

i never observe infinite mean you never

play07:50

have finite

play07:51

infinite observation it's always finite

play07:55

let me clean this

play07:59

infinite mean means that you run

play08:02

your uh your data

play08:07

and every run will provide you with a

play08:09

different mean

play08:10

and they're all going to be high you

play08:11

don't know what the upper bound is

play08:15

so that's one one way to view it another

play08:18

one

play08:19

is by building sums i have numbers going

play08:22

from x1

play08:23

to whatever x and and m you know you

play08:26

make that large as you want and i take s

play08:28

n sigma x i one over n so

play08:31

the average and you start here

play08:34

if the thing converges it'll be volatile

play08:37

and then it reaches a line if your mean

play08:40

is

play08:41

undefined it'll be all over the map

play08:45

typically the terminology is as follows

play08:47

where you say

play08:48

undefined or infinite

play08:52

if the metric you're looking for say the

play08:55

mean

play08:57

is between zero and infinity you call it

play08:59

infinite if it's between minus infinity

play09:02

and plus infinity you're caused

play09:03

undefined because it could be plus

play09:04

infinity or minus infinity or anywhere

play09:06

in between

play09:08

so power laws and one-tailed

play09:11

have an alpha here but not here

play09:15

okay this is sort of like skewed left

play09:17

right distribution

play09:19

you can have distribution like that or

play09:21

again distribution with both tails

play09:23

this is the tails detail for most

play09:25

variables we're concerned with

play09:28

we have one tail like wars one tail

play09:31

pandemics one tail

play09:34

you don't worry about negative things

play09:37

and finance when we talk about returns

play09:40

log returns can be negative infinity

play09:42

plus infinity

play09:43

something going to zero is negative

play09:44

infinity in long returns

play09:47

sort of the same game we played with uh

play09:50

with

play09:50

pandemics so you have a left tail

play09:54

and you have a right tail

play09:58

where where do power laws come from

play10:03

when we do entropy we're going to look

play10:04

at a minimum

play10:06

uh sorry a maximum entropy distribution

play10:12

and then we'll we'll have a more formal

play10:14

understanding of it

play10:16

and that's the constraints that make

play10:17

things a gaussian

play10:20

not uh it's like we start with the

play10:22

gaussian things become power law no

play10:24

if you put constraints of energy on

play10:25

anything you have a finite variance and

play10:27

that sort of like

play10:29

allows you to use the central limit

play10:33

because you're bounded in the variance

play10:35

but but let me

play10:36

give you a story of how things become

play10:38

paralleled and

play10:40

also why they don't stay there the

play10:42

literature has

play10:43

what's called the matthew effects with

play10:45

rich get richer

play10:47

or something called preferential

play10:48

attachment and the third experiment is

play10:51

as follows

play10:55

you have n people

play10:59

with one over and a location of store

play11:02

storefront

play11:03

okay we have a storefront each one there

play11:06

they all have equal storefront

play11:11

that's day zero day one you have

play11:14

customers

play11:15

visiting these p randomly making

play11:18

uniform so and let's name them x1

play11:22

so x3 xyz so x3

play11:25

has more visitors than the others it's

play11:27

going to have a larger storefront

play11:30

so this is day zero if on day one

play11:34

you now reallocate storefronts because

play11:37

x3 has

play11:41

made some money so she or he you will

play11:43

you know expand

play11:44

so when you expand your probability of

play11:46

getting visitors is no longer one over n

play11:48

it's gonna be whenever you're

play11:52

you're higher than one over n and the

play11:53

other would have lower probability one

play11:55

over n

play11:57

and and and after over time the big will

play12:00

get bigger

play12:00

because you have a higher probability of

play12:03

making money until the thing collapses

play12:05

that's sort of the story another way to

play12:08

view it

play12:09

in saying social

play12:13

contagion effects is that's all i go to

play12:15

a store

play12:17

and i see a friend

play12:21

buying a book okay

play12:24

that person may have randomly bought

play12:26

this book

play12:28

so i go by the book all right

play12:31

now a third person will see two people

play12:34

walking around the same book

play12:35

what's going on i'll buy this book and

play12:37

then sure enough it causes a

play12:39

snowball effect and you'll have a power

play12:42

law

play12:43

i personally don't like these

play12:44

representation i think that the world

play12:47

is naturally power law distributed

play12:51

except when you have constraints

play12:52

constraints of growth constraints of

play12:58

energy constraints of limitations that

play13:01

you have

play13:01

for example a price and the market

play13:04

doesn't have constraints well the gdp

play13:06

is very far away so therefore

play13:09

it can follow a power law however height

play13:12

there are energy constraints or the

play13:14

height constraints are biological

play13:15

constraints that makes human

play13:18

not follow power law and the

play13:20

distribution of height

play13:22

although between species we tend to have

play13:24

parallel like think of a mouse

play13:26

versus an elephant so

play13:29

thank you i'll answer more questions

play13:31

later i just wanted to keep this short

play13:33

have a great day bye now

Rate This
β˜…
β˜…
β˜…
β˜…
β˜…

5.0 / 5 (0 votes)

Related Tags
Power LawsHistorical AnalysisStephen PinkerViolence DeclineStatistical InferenceExistential RisksPandemicsWarsTail EventsInfinite VarianceSocial Contagion