How AIs, like ChatGPT, Learn

CGP Grey
18 Dec 201708:54

Summary

TLDRThis video script delves into the pervasive yet mysterious world of algorithms that govern our digital experiences. It explains how these complex bots, initially built through a process of trial and error by simpler bots, evolve to perform tasks beyond human comprehension. Despite their creators' inability to fully understand their inner workings, these algorithms shape our online interactions, from video recommendations to financial transactions, often driven by the relentless pursuit of data to improve their accuracy. The script concludes with a humorous nod to the irony of being governed by bots while also asking viewers to engage with the content to influence algorithmic decisions.

Takeaways

  • 🌐 Algorithms are pervasive on the internet, influencing what we see, buy, and how we interact with various platforms.
  • πŸ‘€ Companies are reticent to disclose how their algorithms work due to the competitive advantage and trade secret nature of their technology.
  • πŸ€– Modern algorithms are often too complex for humans to fully understand, even the developers who created them.
  • 🧠 The process of creating algorithms involves 'builder bots' that construct other bots and 'teacher bots' that test them, using a trial-and-error methodology.
  • πŸ”„ The iterative process of testing, building, and refining bots leads to increasingly effective algorithms over time.
  • πŸ“ˆ The effectiveness of algorithms is measured by how well they perform in specific tasks, such as recognizing images or recommending content.
  • πŸ“Š More data leads to longer and more comprehensive tests, which in turn results in the development of better algorithms.
  • πŸ” Algorithms are not just tools; they are also used to test and manipulate human behavior, as seen in the way content is recommended to keep users engaged.
  • πŸ”‘ The success of an algorithm is often determined by its performance on tests designed by humans, highlighting the ongoing interplay between human intention and machine learning.
  • 🌟 Despite the complexity and sometimes mysterious inner workings of algorithms, they are an integral part of our digital lives and are here to stay.

Q & A

  • How do algorithms influence our online experiences?

    -Algorithms shape our online experiences by determining what content we see, set prices, monitor transactions for fraud, and even decide which videos to recommend on platforms like NetMeTube.

  • Why is it difficult for humans to provide explicit instructions for complex algorithmic tasks?

    -Many problems are too large and complex for humans to write simple instructions for. For instance, identifying fraudulent transactions among billions of financial transactions per second or selecting videos from an almost infinite library for recommendation.

  • How do companies protect the secrets of their algorithmic bots?

    -Companies treat their algorithmic bots as valuable employees and guard the details of their functioning as fiercely protected trade secrets.

  • What is the process of creating an algorithmic bot that can recognize images?

    -A builder bot creates student bots with random connections, which are then tested by a teacher bot using a set of labeled images. The best-performing bots are selected, replicated with variations, and the process repeats until a bot can recognize images effectively.

  • Why do algorithms sometimes fail to understand complex or nuanced tasks?

    -Algorithmic bots are often good at specific tasks they have been trained for but may fail at others, such as understanding videos, recognizing upside-down images, or distinguishing between unrelated objects.

  • How does the process of building and testing algorithmic bots relate to the concept of evolution?

    -The process resembles evolution, where the builder bot selects the 'fittest' bots (those that perform best in tests), replicates them with variations, and discards the rest, leading to gradual improvement over many iterations.

  • What role does data play in improving the performance of algorithmic bots?

    -Data is crucial for creating longer and more comprehensive tests, which in turn lead to better-performing bots. More data allows for more extensive testing and fine-tuning of the bots' abilities.

  • How do algorithmic bots determine what content to show users on platforms like NetMeTube?

    -Bots are tested on their ability to keep users engaged on the platform. They analyze user behavior and select content that maximizes the time users spend on the site.

  • Why do companies collect vast amounts of user data?

    -Companies collect data to create more extensive tests for their algorithmic bots, which helps improve the bots' performance and accuracy in tasks such as content recommendation or user engagement.

  • What is the significance of the 'Are you human?' tests on websites?

    -These tests serve a dual purpose: they verify that the respondent is a human, and they also contribute to the training data used to improve bots' ability to recognize and respond to human-like inputs.

  • How do algorithmic bots impact the way content is selected and presented on social media platforms?

    -Bots are used to analyze user preferences and interactions to select and prioritize content that is most likely to engage users, such as posts, articles, or videos, based on their past behavior and preferences.

Outlines

00:00

πŸ€– The Ubiquity and Complexity of Algorithms

This paragraph delves into the omnipresence of algorithms in our digital lives, highlighting how they influence our online experiences. Algorithms are responsible for showing us videos, curating content, setting prices, and even monitoring financial transactions for fraud. The paragraph discusses the evolution of these bots from simple 'If this, then that' instructions to complex systems that can handle vast amounts of data beyond human comprehension. It introduces the concept of machine learning, where bots are not programmed with explicit instructions but are instead trained through a process of trial and error, using large datasets to improve their performance. The limitations of understanding how these advanced algorithms work are also touched upon, emphasizing the black-box nature of modern AI systems.

05:01

πŸ“Š The Evolution of Learning Algorithms

The second paragraph explores the process of training algorithms through a cycle of building, testing, and refining. It describes a system where a 'builder bot' creates student bots with random connections, which are then tested by a 'teacher bot' that can only evaluate their performance. The best-performing bots are selected, modified, and the process repeats, gradually improving the bots' abilities. This iterative process is likened to natural selection, where only the most effective bots survive. The paragraph also touches on the limitations of these bots, which are often highly specialized and may not perform well outside their trained domain. It concludes with a commentary on the role of data in training these bots and the implications of using algorithms that are not fully understood, even by their creators.

Mindmap

Keywords

πŸ’‘Algorithm

An algorithm is a set of rules or steps used to solve a problem or perform a computation. In the context of the video, algorithms are pervasive in the digital world, influencing what content users see, how data is processed, and even setting prices. The video discusses how algorithms are becoming increasingly complex and opaque, to the point where even their creators may not fully understand their inner workings, as exemplified by the statement, 'But how these bots work exactly, more and more, no one knows.'

πŸ’‘Algorithmic bots

Algorithmic bots refer to automated software agents that perform tasks using algorithms. The video uses the term to describe the systems that curate content, detect fraud, and make decisions at scale. These bots are portrayed as both incredibly powerful and somewhat mysterious, as they are built and evolve through a process that even their creators may not fully comprehend, as indicated by the narrative about building bots that build bots.

πŸ’‘Linear algebra

Linear algebra is a branch of mathematics that deals with linear equations, linear transformations, and vector spaces. The video humorously suggests that the cutting edge of algorithm development might require a liking for linear algebra, implying that advanced mathematical concepts are foundational to the complex algorithms that drive modern technology.

πŸ’‘Machine learning

Machine learning is a subset of artificial intelligence that allows computers to learn from data and improve from experience without being explicitly programmed. The video touches on machine learning through the process of 'building a bot that builds bots,' which is a metaphor for how machine learning models are trained and refined through iterative processes to improve their performance.

πŸ’‘Neural networks

Neural networks are a set of algorithms modeled loosely after the human brain that are designed to recognize patterns. They are a key component in machine learning and are mentioned in the video as a complex system that is difficult to fully understand, even by the humans who design them, which relates to the broader theme of the video about the enigmatic nature of advanced algorithms.

πŸ’‘Data

Data refers to the raw facts and statistics collected through observation and used for reference or analysis. The video emphasizes the importance of data in training algorithms, suggesting that 'more data equals longer tests equals better bots.' It highlights how companies are obsessed with collecting data to improve their algorithms' performance and decision-making capabilities.

πŸ’‘Fraud detection

Fraud detection involves identifying potentially deceitful transactions or activities. In the video, it is mentioned as one of the tasks that algorithms perform, watching transactions for fraud at banks. This illustrates the practical application of algorithms in safeguarding financial systems.

πŸ’‘Recommendation systems

Recommendation systems are algorithms that suggest items or content to users based on their preferences. The video discusses how these systems decide what content users see on platforms like NetMeTube, highlighting the influence of algorithms on user experience and content consumption.

πŸ’‘Trade secret

A trade secret refers to confidential business information that gives an enterprise a competitive edge. The video mentions that companies guard the inner workings of their algorithms as trade secrets, emphasizing the commercial value and proprietary nature of these complex systems.

πŸ’‘User interaction

User interaction refers to the ways in which users engage with a system or service. The video discusses how algorithms are designed to increase user interaction, such as keeping users watching videos on platforms, which underscores the role of algorithms in shaping user behavior and experience.

πŸ’‘Watch time

Watch time is a metric used to measure the total amount of time users spend watching content. In the video, it is used as an example of how algorithms are trained to maximize user engagement, indicating the importance of this metric in the performance evaluation of algorithmic bots.

Highlights

Algorithms are pervasive on the internet, shaping user experiences.

User actions on platforms are influenced and monitored by algorithms.

Algorithms decide content visibility on social media platforms like TweetBook.

Algorithms assist in photo organization and creation of automated content.

Financial transactions are influenced by algorithms that detect fraudulent activity.

The stock market heavily relies on algorithmic trading systems.

Modern algorithmic bots are built to solve problems too complex for human-crafted instructions.

Algorithmic solutions to large-scale problems like fraud detection are not fully understood, even by their creators.

Companies consider the inner workings of their algorithms as highly valuable trade secrets.

The current state of algorithm development is driven by linear algebra and other complex mathematical concepts.

A method for building algorithms involves creating 'builder bots' that construct other bots.

Teacher bots test the effectiveness of student bots without truly understanding the tasks themselves.

Through a process of trial and error, algorithms evolve to perform tasks with increasing accuracy.

The complexity of modern algorithms makes their decision-making processes incomprehensible.

Algorithms are trained on vast amounts of data to improve their performance.

User interactions, such as completing CAPTCHA tests, contribute to the training of algorithms.

Algorithms can be designed to maximize user engagement on platforms like NetMeTube.

The process of algorithm development involves iterative testing and refinement.

Algorithms are selected based on their performance in tests designed by humans.

The understanding of how algorithms operate is limited, even by their creators.

The reliance on algorithms is increasing, and we must learn to guide them through the tests we design.

The video humorously suggests that viewer engagement through likes and subscriptions influences algorithmic visibility.

Transcripts

play00:00

On the internet, the algorithms are all around you.

play00:03

You are watching this video because an algorithm brought it to you (among others) to click,

play00:07

which you did, and the algorithm took note.

play00:10

When you open the TweetBook, A the algorithm decides what you see.

play00:13

When you search through your photos, A the algorithm does the finding.

play00:16

Maybe even makes a little movie for you.

play00:18

When you buy something, A the algorithm sets the price

play00:21

and A the algorithm is at your bank watching transactions for fraud.

play00:26

The stock market is full of algorithms

play00:28

trading with algorithms.

play00:29

Given this, you might want to know how these little algorithmic bots shaping your world work,

play00:34

especially when they don't.

play00:36

In Ye Olden Days,

play00:37

humans built algorithmic bots by giving them instructions the humans could explain.

play00:41

"If this, then that."

play00:43

But many problems are just too big and hard for a human to write simple instructions for.

play00:49

There's a gazillion financial transactions a second, which ones are fraudulent?

play00:53

There's octillion videos on NetMeTube.

play00:55

Which eight should the user see as recommendations? Which shouldn't be allowed on the site at all?

play01:01

For this airline seat, what is the maximum price this user will pay right now?

play01:06

Algorithmic bots give answers to these questions.

play01:08

Not perfect answers, but much better than a human could do.

play01:11

But how these bots work exactly, more and more, no one knows.

play01:15

Not even the humans who built them,

play01:17

or "built them",

play01:18

as we will see...

play01:19

Now companies that use these bots don't want to talk about how they work

play01:23

because the bots are valuable employees.

play01:25

Very, VERY valuable.

play01:27

And how their brains are built is a fiercely guarded trade secret.

play01:30

Right now the cutting edge is most likely very

play01:33

'I hope you like linear algebra',

play01:34

but what the current hotness is on any particular site

play01:37

and how the bots work, is a bit "I dunno", and always will be.

play01:41

So let's talk about one of the more quaint but understandable ways bots CAN be "built"

play01:45

without understanding how their brains work.

play01:48

Say you want a bot that can recognize what is in a picture.

play01:51

Is it a bee, or is it a three?

play01:53

It's easy for humans (even little humans),

play01:55

but it's impossible to just tell a bot in bot language how to do it,

play01:59

because really we just know that's a bee and that's a three.

play02:03

We can say in words what makes them different, but bots don't understand words.

play02:07

And it's the wiring in our brains that makes it happen anyway.

play02:10

While an individual neuron may be understood, and clusters of neurons' general purpose vaguely grasped,

play02:16

the whole is beyond.

play02:18

Nonetheless, it works.

play02:20

So to get a bot that can do this sorting,

play02:22

you don't build it yourself.

play02:23

You build a bot that builds bots, and a bot that teaches bots.

play02:27

These bots' brains are simpler, something a smart human programmer can make.

play02:31

The builder bot builds bots, though it's not very good at it.

play02:35

At first it connects the wires and modules in the bot brains almost at random.

play02:39

This leads to some very...

play02:41

"special" student bots sent to teacher bot to teach.

play02:44

Of course, teacher bot can't tell a bee from a three either;

play02:47

if the human could build teacher bot to do that, well, then, problem solved.

play02:51

Instead the human gives teacher bot a bunch of "bee" photos, and "three" photos,

play02:54

and an answer key to which is what.

play02:56

Teacher bot can't teach,

play02:58

but teacher bot can TEST.

play03:00

The adorkable student bots stick out their tongues, try very hard,

play03:03

but they are bad at what they do.

play03:05

Very, VERY, bad.

play03:07

And it's not their fault, really, they were built that way.

play03:10

Grades in hand, the student bots take a march of shame back to builder bot.

play03:13

those that did best are put to one side,

play03:15

the others recycled.

play03:17

Builder bot still isn't good at building bots,

play03:19

but now it takes those left and makes copies with changes in new combinations.

play03:23

Back to school they go.

play03:25

Teacher bot teaches - er, tests again, and builder bot builds again.

play03:28

And again, and again.

play03:30

Now a builder that builds at random, and a teacher that doesn't teach, just tests,

play03:34

and students who can't learn, they just are what they are, in theory shouldn't work,

play03:38

but in practice, it does.

play03:39

Partly because in every iteration, builder bot's slaughterhouse keeps the best and discards the rest,

play03:45

and partly because teacher bot isn't overseeing an old-timey, one-room schoolhouse with a dozen students,

play03:50

but an infinite warehouse with thousands of students.

play03:54

The test isn't ten questions, but a million questions.

play03:57

And how many times does the test, build, test loop repeat?

play04:01

As many as necessary.

play04:03

At first students that survive are just lucky,

play04:06

but by combining enough lucky bots, and keeping only what works,

play04:10

and randomly messing around with new copies of that

play04:13

eventually a student bot emerges that isn't lucky,

play04:16

that can perhaps barely tell bees from threes.

play04:19

As this bot is copied and changed, slowly the average test score rises,

play04:23

and thus the grade needed to survive the next round gets higher and higher.

play04:27

Keep this up and eventually from the infinite warehouse

play04:30

(slaughterhouse)

play04:31

a student bot will emerge, who can tell a bee from a three in a photo it's never seen before pretty well.

play04:36

But how the student bot does this, neither the teacher bot nor the builder bot,

play04:40

nor the human overseer, can understand.

play04:43

Nor the student bot itself.

play04:45

After keeping so many useful random changes, the wiring in its head is incredibly complicated,

play04:51

and while an individual line of code may be understood, and clusters of code's general purpose vaguely grasped,

play04:57

the whole is beyond,

play04:58

nonetheless, it works.

play05:00

But this is frustrating, especially as the student bot is very good at exactly

play05:05

only the kinds of questions it's been taught to.

play05:08

It's great with photos, but useless with videos or baffled if the photos are upside down,

play05:13

or things that are obviously not bees, it's confident are.

play05:17

Since teacher bot can't teach,

play05:18

all the human overseer can do is give it more questions, to make the test even longer,

play05:23

to include the kinds of questions the best bots get wrong.

play05:26

This is important to understand.

play05:28

It's a reason why companies are obsessed with collecting data.

play05:32

More data equals longer tests equals better bots.

play05:35

So when you get the "Are you human?" test on a website,

play05:38

you are not only proving that you are human, (hopefully),

play05:41

but you are also helping to build the test to make bots that can read, or count,

play05:45

or tell lakes from mountains, or horses from humans.

play05:47

Seeing lots of questions about driving lately?

play05:50

Hmm...! What could that be building a test for?

play05:52

Now figuring out what's in a photo, or on a sign, or filtering videos,

play05:56

requires humans to make correct enough tests.

play05:59

But there is another kind of test that makes itself.

play06:02

Tests ON the humans.

play06:04

For example, say entirely hypothetical NetMeTube wanted users to keep watching as long as possible?

play06:11

Well, how long a user stays on the site is easy to measure.

play06:14

So, teacher bot gives each student bot a bunch of NetMeTube users to oversee,

play06:18

the student bots watch what their user watches, looks at their files,

play06:21

and do their best to pick the videos that keep the user on the site.

play06:24

The longer the average, the higher their test score.

play06:27

Build, test, repeat.

play06:29

A million cycles later, there's a student bot who's pretty good at keeping the users watching,

play06:34

at least compared to what a human could build.

play06:36

But when people ask: "How does the NetMeTube algorithm select videos?"

play06:40

Once again, there isn't a great answer other than pointing to the bot,

play06:44

and the user data it had access to,

play06:46

and most vitally, how the human overseers direct teacher bot to score the test.

play06:51

That's what the bot is trying to be good at to survive.

play06:54

But what the bot is thinking, or how it thinks it, is not really knowable.

play06:59

All that's knowable is this student bot gets to be the algorithm,

play07:03

because it's point one percent better than the previous bot at the test the humans designed.

play07:09

So everywhere on the internet, behind the scenes, there are tests to increase user interaction,

play07:13

or set prices just right to maximize revenue,

play07:17

or pick the posts from all your friends you'll like the most, or articles people will share the most, or whatever.

play07:22

If it's testable, it's teachable. Well, "teachable",

play07:24

and a student bot will graduate from the warehouse to be the algorithm of its domain.

play07:29

At least, for a little while.

play07:31

We're used to the idea that the tools we use, even if we don't understand them, someone does,

play07:36

but with our machines that learn we are increasingly in a position where we use tools,

play07:40

or are used by tools,

play07:42

that no one, not even their creators, understand.

play07:45

We can only hope to guide them with the tests we make,

play07:49

and we need to get comfortable with that,

play07:50

as our algorithmic bot buddies are all around, and not going anywhere.

play07:58

OK. The bots are watching.

play08:00

You know what's coming.

play08:02

This is where I need to ask you...

play08:04

To like...

play08:06

comment...

play08:07

...and subscribe.

play08:09

And bell me.

play08:11

And share on the TweetBook.

play08:13

The algorithm is watching.

play08:15

It won't show people the video...

play08:18

unless you do this.

play08:21

Look what you've reduced me to, bots.

play08:24

What do you want? Do you want watch time?

play08:26

Is that what you want?

play08:28

Fine.

play08:30

(sigh...) Hey guys, did you know I also have podcasts you can listen to?

play08:34

Maybe even just in the background while you're tidying up your all room for hours? Or whatever?

play08:40

There's hours of audio entertainment for you, and watch time for the bots overseeing your actions.

play08:47

Go ahead and - and take a click. Entertain yourself.

play08:50

Help me.

play08:51

Help the bots.

Rate This
β˜…
β˜…
β˜…
β˜…
β˜…

5.0 / 5 (0 votes)

Related Tags
AlgorithmsArtificial IntelligenceMachine LearningData CollectionDigital InfluenceInternet BotsUser EngagementTech MysteryAI EthicsDigital Marketing