How we can protect truth in the age of misinformation | Sinan Aral
Summary
TLDRThe video script discusses the alarming spread of fake news and its impact on society, exemplified by the 2013 Associated Press Twitter hack and its market consequences. It delves into the role of social media in disseminating misinformation, the influence of bots, and the potential dangers of synthetic media. The speaker suggests possible solutions, including labeling, incentives, regulation, transparency, and the use of algorithms, while emphasizing the importance of ethical considerations in combating misinformation.
Takeaways
- 📰 The Associated Press was hacked by Syrian hackers who spread false news of an explosion at the White House, causing a stock market crash and highlighting the impact of misinformation on financial markets.
- 🛑 Automated trading algorithms reacted to the false tweet, demonstrating how quickly misinformation can affect global economies.
- 📈 The fake news incident wiped out $140 billion in equity value, showing the tangible financial consequences of misinformation.
- 🔍 Robert Mueller indicted Russian entities for meddling in the 2016 U.S. election, revealing the extent of foreign influence campaigns through social media.
- 🌐 The Internet Research Agency's activities reached millions on social media, underlining the scale at which misinformation can be disseminated.
- 📊 A study by Oxford University found that a significant portion of information spread on social media during the Swedish elections was fake, indicating a widespread issue.
- 📚 The largest-ever longitudinal study of fake news spread on Twitter showed that false news spreads more rapidly and widely than true news.
- 🤔 The study found that false news is more likely to be retweeted, raising questions about why people are more inclined to share unverified information.
- 🧐 The 'novelty hypothesis' suggests that people share novel information to gain status, which may explain the rapid spread of false news.
- 🤖 The role of bots in spreading misinformation was confirmed, but they affect both true and false news equally, indicating that human behavior is a key factor in the spread of false information.
- 🎭 The rise of synthetic media, powered by technologies like generative adversarial networks, poses a new challenge in distinguishing reality from fiction.
- 🛠️ Addressing the spread of misinformation requires a multifaceted approach, including labeling, incentives, regulation, transparency, and the use of algorithms and machine learning.
Q & A
What was the impact of the fake tweet by Syrian hackers on the stock market in 2013?
-The fake tweet about explosions at the White House and injury to President Barack Obama led to the stock market crashing, wiping out 140 billion dollars in equity value in a single day.
What was the role of the Internet Research Agency in the 2016 U.S. presidential election?
-The Internet Research Agency, a shadowy arm of the Kremlin, spread misinformation on social media during the election, reaching 126 million people on Facebook, issuing three million tweets, and creating 43 hours of YouTube content to sow discord.
What did the study by Oxford University reveal about the spread of misinformation in the recent Swedish elections?
-The study showed that one third of all information spreading on social media about the Swedish elections was fake or misinformation.
What was the main finding of the largest-ever longitudinal study of the spread of fake news online?
-The study found that false news diffused further, faster, deeper, and more broadly than true news in every category, with false political news being the most viral.
Why was the hypothesis about people spreading false news having more followers or being more active on Twitter disproved?
-The study revealed that false-news spreaders actually had fewer followers, were less active, and had been on Twitter for a shorter period of time compared to those spreading true news.
What is the 'novelty hypothesis' proposed to explain the spread of false news?
-The 'novelty hypothesis' suggests that human attention is drawn to novelty, and people like to share novel information because it makes them seem like they have access to inside information, gaining status in the process.
How did sentiment analysis of replies to true and false tweets support the novelty hypothesis?
-Sentiment analysis showed that false news exhibited significantly more surprise and disgust in replies, while true news had more anticipation, joy, and trust, corroborating the idea that new and surprising information is more likely to be shared.
What role did bots play in the spread of false news according to the study?
-Bots were found to accelerate the spread of both false and true news online, but they were not responsible for the differential diffusion of truth and falsity, as humans are primarily responsible for that spread.
What are the two specific technologies mentioned that could make the spread of misinformation even worse?
-The two technologies are 'generative adversarial networks' for creating convincing fake videos and audio, and the democratization of artificial intelligence, allowing anyone to deploy these algorithms to generate synthetic media.
What are some of the potential paths to address the problem of misinformation mentioned in the script?
-The potential paths include labeling information, adjusting incentives to reduce the economic benefits of spreading false news, regulation of social media platforms, increasing transparency about how algorithms work, and developing algorithms and machine learning to identify and mitigate the spread of fake news.
What is the 'transparency paradox' that social media platforms like Facebook are facing?
-The 'transparency paradox' refers to the simultaneous demand for social media platforms to be open and transparent about their algorithms and data while also ensuring the security and privacy of that data.
How does the speaker suggest we should approach the ethical and philosophical questions underlying technological solutions to misinformation?
-The speaker suggests that while technology can help identify and mitigate the spread of misinformation, the ethical and philosophical questions about defining truth and falsity, and determining legitimate opinions and speech, require human involvement and cannot be solved by technology alone.
Outlines
📰 Impact of Fake News on Society and Economy
The paragraph discusses the rapid spread of a false tweet about an explosion at the White House in 2013, which was retweeted extensively and led to a significant stock market crash, resulting in a loss of 140 billion dollars in equity value. It highlights the role of automated trading algorithms responding to the sentiment of the tweet. The paragraph also touches on the indictments issued by Robert Mueller against Russian entities for their involvement in the 2016 U.S. presidential election through the Internet Research Agency's misinformation campaigns on social media. The study by Oxford University on the prevalence of fake news in the Swedish elections and the potential for social-media misinformation to incite violence is also mentioned. The speaker introduces their research on the spread of fake news online, comparing the diffusion of true and false news stories verified by fact-checking organizations, and noting that false news spreads more widely and rapidly than true news.
🧐 The Novelty Hypothesis and Emotional Response to Fake News
This paragraph delves into the reasons behind the viral nature of false news, proposing a 'novelty hypothesis' which suggests that humans are drawn to and share novel information due to its perceived value and the status it confers. The speaker's research measured the novelty of tweets and analyzed the sentiment in replies to both true and false news, finding that false news elicited more surprise and disgust, while true news received more anticipation, joy, and trust. The paragraph also addresses the role of bots in spreading misinformation, concluding that while bots do accelerate the spread of news, they affect both true and false news equally, indicating that the differential spread of truth and falsity is a human-driven phenomenon. The speaker foreshadows the worsening of the issue with the advent of synthetic media powered by technologies like generative adversarial networks, which can create convincing fake audio and video.
🛠 Possible Solutions to Combat Fake News
The final paragraph outlines potential strategies to address the challenges posed by fake news. It begins by discussing the idea of labeling information, similar to food labeling in grocery stores, to provide consumers with the source and credibility of the information. The paragraph then explores the concept of incentives, suggesting that reducing the spread of false news could diminish the economic motivation for its creation. Regulation is presented as another avenue, with the current exploration in the U.S. of how to regulate social media platforms like Facebook. The paragraph also discusses the importance of transparency in understanding how algorithms work and the challenges of balancing openness with data security. Finally, the paragraph considers the role of algorithms and machine learning in identifying and controlling the spread of fake news, emphasizing the need for human involvement in addressing ethical and philosophical questions about truth and legitimacy in the era of synthetic media.
Mindmap
Keywords
💡Fake News
💡Viral
💡Automated Trading Algorithms
💡Indictment
💡Internet Research Agency
💡Misinformation
💡Longitudinal Study
💡Novelty Hypothesis
💡Sentiment Analysis
💡Bots
💡Synthetic Media
💡Generative Adversarial Networks (GANs)
💡Transparency
💡Ethics and Philosophy
Highlights
On April 23, 2013, a false tweet by Syrian hackers claiming explosions at the White House was retweeted 4,000 times in less than five minutes, causing a stock market crash and wiping out $140 billion in equity value.
Automated trading algorithms reacted to the fake tweet's sentiment, demonstrating the impact of misinformation on financial markets.
The indictment by Robert Mueller against Russian entities for election interference highlights the role of the Internet Research Agency in spreading misinformation during the 2016 US presidential election.
The Internet Research Agency's efforts reached 126 million people on Facebook, issued 3 million tweets, and created 43 hours of YouTube content with fake news designed to sow discord.
A study by Oxford University revealed that one third of information spread on social media about the recent Swedish elections was fake or misinformation.
Social-media misinformation campaigns can lead to 'genocidal propaganda,' as seen in the case of the Rohingya in Burma, triggering mob killings in India.
The largest-ever longitudinal study of fake news spread on Twitter from 2006 to 2017 was published in 'Science', analyzing the diffusion of verified true and false news stories.
False news was found to diffuse further, faster, deeper, and more broadly than true news in every category studied, with false political news being the most viral.
Contrary to initial hypotheses, false-news spreaders tended to have fewer followers, were less active, and had been on Twitter for a shorter time compared to those spreading true news.
False news was 70 percent more likely to be retweeted than true news, even when controlling for various factors.
The 'novelty hypothesis' suggests that human attention is drawn to novelty, and sharing novel information can increase social status, which may explain the spread of false news.
Sentiment analysis of replies to tweets showed that false news elicited more surprise and disgust, while true news received more anticipation, joy, and trust.
Bots were found to accelerate the spread of both false and true news online, but they did not account for the differential diffusion of truth and falsity.
The rise of synthetic media, powered by technologies like generative adversarial networks, poses a significant threat to the辨别ability of real and fake information.
The White House's use of a doctored video to justify revoking a reporter's press pass highlights the real-world implications of synthetic media.
Potential solutions to combat misinformation include labeling, incentives, regulation, transparency, and the use of algorithms and machine learning, each with its own challenges.
The ethical and philosophical questions underlying any technological solution involve defining truth and falsity and determining the legitimacy of opinions and speech.
The speaker calls for vigilance in defending the truth against misinformation, emphasizing the role of technology, policy, and individual responsibility.
Transcripts
Transcriber: Ivana Korom Reviewer: Krystian Aparta
So, on April 23 of 2013,
the Associated Press put out the following tweet on Twitter.
It said, "Breaking news:
Two explosions at the White House
and Barack Obama has been injured."
This tweet was retweeted 4,000 times in less than five minutes,
and it went viral thereafter.
Now, this tweet wasn't real news put out by the Associated Press.
In fact it was false news, or fake news,
that was propagated by Syrian hackers
that had infiltrated the Associated Press Twitter handle.
Their purpose was to disrupt society, but they disrupted much more.
Because automated trading algorithms
immediately seized on the sentiment on this tweet,
and began trading based on the potential
that the president of the United States had been injured or killed
in this explosion.
And as they started tweeting,
they immediately sent the stock market crashing,
wiping out 140 billion dollars in equity value in a single day.
Robert Mueller, special counsel prosecutor in the United States,
issued indictments against three Russian companies
and 13 Russian individuals
on a conspiracy to defraud the United States
by meddling in the 2016 presidential election.
And what this indictment tells as a story
is the story of the Internet Research Agency,
the shadowy arm of the Kremlin on social media.
During the presidential election alone,
the Internet Agency's efforts
reached 126 million people on Facebook in the United States,
issued three million individual tweets
and 43 hours' worth of YouTube content.
All of which was fake --
misinformation designed to sow discord in the US presidential election.
A recent study by Oxford University
showed that in the recent Swedish elections,
one third of all of the information spreading on social media
about the election
was fake or misinformation.
In addition, these types of social-media misinformation campaigns
can spread what has been called "genocidal propaganda,"
for instance against the Rohingya in Burma,
triggering mob killings in India.
We studied fake news
and began studying it before it was a popular term.
And we recently published the largest-ever longitudinal study
of the spread of fake news online
on the cover of "Science" in March of this year.
We studied all of the verified true and false news stories
that ever spread on Twitter,
from its inception in 2006 to 2017.
And when we studied this information,
we studied verified news stories
that were verified by six independent fact-checking organizations.
So we knew which stories were true
and which stories were false.
We can measure their diffusion,
the speed of their diffusion,
the depth and breadth of their diffusion,
how many people become entangled in this information cascade and so on.
And what we did in this paper
was we compared the spread of true news to the spread of false news.
And here's what we found.
We found that false news diffused further, faster, deeper
and more broadly than the truth
in every category of information that we studied,
sometimes by an order of magnitude.
And in fact, false political news was the most viral.
It diffused further, faster, deeper and more broadly
than any other type of false news.
When we saw this,
we were at once worried but also curious.
Why?
Why does false news travel so much further, faster, deeper
and more broadly than the truth?
The first hypothesis that we came up with was,
"Well, maybe people who spread false news have more followers or follow more people,
or tweet more often,
or maybe they're more often 'verified' users of Twitter, with more credibility,
or maybe they've been on Twitter longer."
So we checked each one of these in turn.
And what we found was exactly the opposite.
False-news spreaders had fewer followers,
followed fewer people, were less active,
less often "verified"
and had been on Twitter for a shorter period of time.
And yet,
false news was 70 percent more likely to be retweeted than the truth,
controlling for all of these and many other factors.
So we had to come up with other explanations.
And we devised what we called a "novelty hypothesis."
So if you read the literature,
it is well known that human attention is drawn to novelty,
things that are new in the environment.
And if you read the sociology literature,
you know that we like to share novel information.
It makes us seem like we have access to inside information,
and we gain in status by spreading this kind of information.
So what we did was we measured the novelty of an incoming true or false tweet,
compared to the corpus of what that individual had seen
in the 60 days prior on Twitter.
But that wasn't enough, because we thought to ourselves,
"Well, maybe false news is more novel in an information-theoretic sense,
but maybe people don't perceive it as more novel."
So to understand people's perceptions of false news,
we looked at the information and the sentiment
contained in the replies to true and false tweets.
And what we found
was that across a bunch of different measures of sentiment --
surprise, disgust, fear, sadness,
anticipation, joy and trust --
false news exhibited significantly more surprise and disgust
in the replies to false tweets.
And true news exhibited significantly more anticipation,
joy and trust
in reply to true tweets.
The surprise corroborates our novelty hypothesis.
This is new and surprising, and so we're more likely to share it.
At the same time, there was congressional testimony
in front of both houses of Congress in the United States,
looking at the role of bots in the spread of misinformation.
So we looked at this too --
we used multiple sophisticated bot-detection algorithms
to find the bots in our data and to pull them out.
So we pulled them out, we put them back in
and we compared what happens to our measurement.
And what we found was that, yes indeed,
bots were accelerating the spread of false news online,
but they were accelerating the spread of true news
at approximately the same rate.
Which means bots are not responsible
for the differential diffusion of truth and falsity online.
We can't abdicate that responsibility,
because we, humans, are responsible for that spread.
Now, everything that I have told you so far,
unfortunately for all of us,
is the good news.
The reason is because it's about to get a whole lot worse.
And two specific technologies are going to make it worse.
We are going to see the rise of a tremendous wave of synthetic media.
Fake video, fake audio that is very convincing to the human eye.
And this will powered by two technologies.
The first of these is known as "generative adversarial networks."
This is a machine-learning model with two networks:
a discriminator,
whose job it is to determine whether something is true or false,
and a generator,
whose job it is to generate synthetic media.
So the synthetic generator generates synthetic video or audio,
and the discriminator tries to tell, "Is this real or is this fake?"
And in fact, it is the job of the generator
to maximize the likelihood that it will fool the discriminator
into thinking the synthetic video and audio that it is creating
is actually true.
Imagine a machine in a hyperloop,
trying to get better and better at fooling us.
This, combined with the second technology,
which is essentially the democratization of artificial intelligence to the people,
the ability for anyone,
without any background in artificial intelligence
or machine learning,
to deploy these kinds of algorithms to generate synthetic media
makes it ultimately so much easier to create videos.
The White House issued a false, doctored video
of a journalist interacting with an intern who was trying to take his microphone.
They removed frames from this video
in order to make his actions seem more punchy.
And when videographers and stuntmen and women
were interviewed about this type of technique,
they said, "Yes, we use this in the movies all the time
to make our punches and kicks look more choppy and more aggressive."
They then put out this video
and partly used it as justification
to revoke Jim Acosta, the reporter's, press pass
from the White House.
And CNN had to sue to have that press pass reinstated.
There are about five different paths that I can think of that we can follow
to try and address some of these very difficult problems today.
Each one of them has promise,
but each one of them has its own challenges.
The first one is labeling.
Think about it this way:
when you go to the grocery store to buy food to consume,
it's extensively labeled.
You know how many calories it has,
how much fat it contains --
and yet when we consume information, we have no labels whatsoever.
What is contained in this information?
Is the source credible?
Where is this information gathered from?
We have none of that information
when we are consuming information.
That is a potential avenue, but it comes with its challenges.
For instance, who gets to decide, in society, what's true and what's false?
Is it the governments?
Is it Facebook?
Is it an independent consortium of fact-checkers?
And who's checking the fact-checkers?
Another potential avenue is incentives.
We know that during the US presidential election
there was a wave of misinformation that came from Macedonia
that didn't have any political motive
but instead had an economic motive.
And this economic motive existed,
because false news travels so much farther, faster
and more deeply than the truth,
and you can earn advertising dollars as you garner eyeballs and attention
with this type of information.
But if we can depress the spread of this information,
perhaps it would reduce the economic incentive
to produce it at all in the first place.
Third, we can think about regulation,
and certainly, we should think about this option.
In the United States, currently,
we are exploring what might happen if Facebook and others are regulated.
While we should consider things like regulating political speech,
labeling the fact that it's political speech,
making sure foreign actors can't fund political speech,
it also has its own dangers.
For instance, Malaysia just instituted a six-year prison sentence
for anyone found spreading misinformation.
And in authoritarian regimes,
these kinds of policies can be used to suppress minority opinions
and to continue to extend repression.
The fourth possible option is transparency.
We want to know how do Facebook's algorithms work.
How does the data combine with the algorithms
to produce the outcomes that we see?
We want them to open the kimono
and show us exactly the inner workings of how Facebook is working.
And if we want to know social media's effect on society,
we need scientists, researchers
and others to have access to this kind of information.
But at the same time,
we are asking Facebook to lock everything down,
to keep all of the data secure.
So, Facebook and the other social media platforms
are facing what I call a transparency paradox.
We are asking them, at the same time,
to be open and transparent and, simultaneously secure.
This is a very difficult needle to thread,
but they will need to thread this needle
if we are to achieve the promise of social technologies
while avoiding their peril.
The final thing that we could think about is algorithms and machine learning.
Technology devised to root out and understand fake news, how it spreads,
and to try and dampen its flow.
Humans have to be in the loop of this technology,
because we can never escape
that underlying any technological solution or approach
is a fundamental ethical and philosophical question
about how do we define truth and falsity,
to whom do we give the power to define truth and falsity
and which opinions are legitimate,
which type of speech should be allowed and so on.
Technology is not a solution for that.
Ethics and philosophy is a solution for that.
Nearly every theory of human decision making,
human cooperation and human coordination
has some sense of the truth at its core.
But with the rise of fake news,
the rise of fake video,
the rise of fake audio,
we are teetering on the brink of the end of reality,
where we cannot tell what is real from what is fake.
And that's potentially incredibly dangerous.
We have to be vigilant in defending the truth
against misinformation.
With our technologies, with our policies
and, perhaps most importantly,
with our own individual responsibilities,
decisions, behaviors and actions.
Thank you very much.
(Applause)
5.0 / 5 (0 votes)