Why it's harder for AI to open doors than play chess | Pulkit Agrawal | TEDxMIT

TEDx Talks
9 Apr 202318:58

Summary

TLDRIn this talk, the presenter explores the paradox of intelligence, highlighting that tasks humans find easy, like opening doors, are challenging for robots, while complex cognitive tasks like chess are relatively easy for AI. The speaker discusses the Model X paradox, where physical intelligence is undervalued compared to cognitive intelligence. They argue for the importance of developing physical intelligence in AI, showcasing simulations and real-world applications of robots learning to walk on various terrains and manipulate objects. The talk emphasizes the need for a holistic approach, including hardware and perception, to achieve true artificial intelligence.

Takeaways

  • 🤖 The speaker discusses the paradox that tasks requiring physical intelligence, like opening a door, are more challenging for robots than intellectual tasks like playing chess, which AI has mastered.
  • 🏆 AI systems have been able to surpass human capabilities in complex games like chess and Go, but physical tasks remain a significant challenge.
  • 🧠 The 'Moravec's paradox' highlights the contrast between what humans perceive as hard (intellectual tasks) and what robots find hard (sensory-motor skills).
  • 🕵️‍♂️ The speaker cites experts like Hans Moravec and Stephen Pinker, who argue that high-level reasoning requires less computation than sensory-motor skills.
  • 🧬 The evolution of intelligence is discussed, with a focus on how evolution has spent much more time developing sensory-motor skills than language or reasoning.
  • 🌐 The speaker emphasizes the importance of physical intelligence in AI, suggesting that it is a prerequisite for true artificial intelligence.
  • 🤝 The use of simulation is highlighted as a key technique for generating data and training AI systems to perform complex physical tasks.
  • 🤸‍♂️ Examples of robots learning to walk and manipulate objects in simulation and then transferring these skills to the real world are provided.
  • 🤲 The importance of hardware design, such as hands with touch sensing, is discussed in the context of developing physical intelligence.
  • 🔧 The speaker argues for a 'full stack' approach to physical intelligence, combining control algorithms, perception, and hardware design.

Q & A

  • What is the main paradox discussed in the script regarding AI capabilities?

    -The main paradox discussed is the Moravec's paradox, which states that tasks requiring high-level reasoning that are considered difficult for humans are easy for AI, while tasks considered simple for humans, such as sensory motor skills, are actually difficult for AI.

  • Why do AI systems struggle with tasks like opening doors, despite their ability to play complex games like chess?

    -AI systems struggle with tasks like opening doors because these tasks require a level of physical intelligence and sensory motor skills that are not as developed as their reasoning capabilities. These tasks involve understanding and interacting with the physical world in a complex way, which is still challenging for AI.

  • What is the 'Model X Paradox' mentioned in the script?

    -The 'Model X Paradox' is a concept that emerged from observations that AI systems can solve complex cognitive tasks like chess relatively easily, but struggle with tasks involving physical intelligence, such as walking or opening a door, which humans find easy.

  • How does the script explain the difference in difficulty between chess and sensory motor skills for AI?

    -The script explains that while chess, which requires reasoning, is considered a complex task for humans, it is relatively easy for AI systems. In contrast, sensory motor skills, which are second nature to humans, require enormous computational power and are much harder for AI to replicate accurately.

  • What role does human intuition play in the development of AI, according to the script?

    -Human intuition plays a significant role in the development of AI by influencing what tasks are prioritized and how they are approached. The script suggests that human intuition often misjudges the difficulty of tasks for AI, leading to a focus on cognitive tasks at the expense of physical intelligence.

  • What is the significance of the timeline provided in the script regarding the evolution of intelligence?

    -The timeline in the script is significant because it illustrates the vast amount of time evolution has spent developing sensory motor skills compared to language and reasoning. This highlights the complexity of physical intelligence and suggests that AI development should also prioritize physical capabilities.

  • How does the script suggest AI systems can improve their physical intelligence?

    -The script suggests that AI systems can improve their physical intelligence through the use of simulation, where large amounts of data can be generated to train AI in various physical tasks. This data can then be transferred to real-world applications.

  • What is the importance of hardware in the development of AI physical intelligence as discussed in the script?

    -The importance of hardware in the development of AI physical intelligence is highlighted by the need for hands with touch sensing and the development of tools to design better hands for specific tasks. The script emphasizes that hardware, perception, and control algorithms are all crucial for achieving physical intelligence.

  • How does the script relate the evolution of life on Earth to the development of AI?

    -The script relates the evolution of life on Earth to the development of AI by drawing a parallel between the long evolutionary development of sensory motor skills and the relatively recent development of language and reasoning. It suggests that AI should also prioritize the development of physical intelligence before focusing on higher cognitive functions.

  • What is the speaker's stance on the future of AI and physical intelligence?

    -The speaker believes that physical intelligence is a critical foundation for true artificial intelligence. They argue against the idea that AI can be fully realized without embodied intelligence and advocate for a 'full stack' approach that includes control, perception, and hardware in the development of AI.

Outlines

00:00

🤖 The Paradox of AI: Chess vs. Door Opening

The speaker begins by questioning the intelligence required for complex tasks like playing chess versus simple tasks like opening a door. They highlight the irony that while AI has surpassed human capabilities in chess two decades ago, AI's ability to perform seemingly simple tasks like opening doors remains challenging. The speaker introduces the concept of the 'Model X Paradox,' which suggests that tasks humans find easy, such as sensory-motor skills, are difficult for robots, while tasks involving reasoning, like chess, are easier for AI. The paradox is illustrated with examples of AI's success in complex games and the struggles of robots in the DARPA Grand Challenge finals to perform basic physical tasks.

05:02

🌱 Evolution's Emphasis on Sensory-Motor Skills

The speaker delves into the evolutionary timeline, noting that life began with single-cell organisms and evolved to apes capable of simple sensory-motor tasks over billions of years. The evolution of humans and the development of language occurred much more recently, suggesting that nature has prioritized sensory-motor skills over language and reasoning. The speaker uses this timeline to emphasize the disparity in AI development, where systems have advanced rapidly in language understanding but struggle with physical intelligence. They also discuss AI's current capabilities in language and image generation, showcasing AI's prowess in areas that are a tiny fraction of evolutionary time.

10:04

🤸‍♂️ Overcoming the Model X Paradox with Simulation

The speaker addresses the ongoing challenge of the Model X Paradox, where AI excels in language but falls short in physical tasks. They argue that physical intelligence is crucial for true AI and cannot be bypassed. To tackle this, the speaker's lab uses simulation to generate extensive data and train AI in sensory-motor skills. They demonstrate how simulated training can transfer to real-world scenarios, such as robots navigating various terrains and manipulating objects. The speaker also touches on the importance of hardware design, showing experiments with hands that can sense touch and are optimized for specific tasks.

15:04

🍰 The Full Stack Approach to Physical Intelligence

In the final paragraph, the speaker summarizes the dichotomy between artificial and natural intelligence, emphasizing the need for a 'full stack' approach to physical intelligence. They argue that while AI has made significant strides in language understanding, true intelligence requires embodiment and physical capabilities. The speaker advocates for the development of physical intelligence as a foundation for achieving artificial general intelligence. They conclude by encouraging more focus on physical intelligence amidst the current hype around AI, suggesting that without building this foundation, AI's potential remains unfulfilled.

Mindmap

Keywords

💡Artificial Intelligence (AI)

Artificial Intelligence refers to the simulation of human intelligence in machines that are programmed to think like humans and mimic their actions. In the context of the video, AI is discussed in relation to its capabilities and limitations, especially when compared to human intelligence. The video highlights how AI systems have surpassed human performance in complex games like chess and Go, yet struggle with simple physical tasks such as opening doors.

💡Model X Paradox

The Model X Paradox is a concept that emerged from observations that tasks which humans find easy, such as walking or opening a door, are difficult for robots, while tasks that seem complex to humans, like playing chess, are relatively easier for AI. The video discusses this paradox to illustrate the dichotomy between what humans perceive as hard and what robots actually find challenging, emphasizing the need for a better understanding of physical intelligence in AI development.

💡Sensory Motor Skills

Sensory motor skills involve the integration of sensory information with motor output to execute tasks. The video explains that sensory motor skills require 'enormous compute,' suggesting that they are computationally intensive and complex for AI to replicate. Examples from the script include robots struggling with tasks like opening doors or walking, which are simple for humans but represent a significant challenge for AI.

💡Language Models

Language models are AI systems designed to understand and generate human language. The video mentions that AI has made significant progress in language understanding by training on large amounts of internet data. These models can predict words and generate text, as illustrated by the script's examples of AI systems making predictions about word sequences.

💡Physical Intelligence

Physical intelligence refers to the ability of an organism or machine to interact with its environment through physical actions. The video argues that physical intelligence is a critical yet underdeveloped aspect of AI. It contrasts the advanced state of language models with the more primitive state of AI's physical capabilities, as seen in the challenges AI faces with tasks like manipulating objects or navigating terrains.

💡Evolution

Evolution in the video is used to describe the process by which species develop and change over time. It is used to draw a comparison between the time scales of natural evolution and the rapid development of AI. The speaker points out that while natural evolution has spent billions of years perfecting sensory motor skills, it has only had a relatively short time to develop complex language and reasoning, suggesting that AI should also prioritize physical intelligence.

💡Simulation

Simulation in the context of the video refers to the use of virtual environments to train AI systems. The speaker discusses how simulation allows for the generation of large amounts of data and the training of AI in various scenarios. This is exemplified by the video's mention of robots learning to walk in simulation before transferring those skills to the real world.

💡Generalization

Generalization in AI refers to the ability of a system to apply learned behaviors to new, unseen situations. The video emphasizes the importance of generalization in physical intelligence, as robots must be able to perform tasks in a variety of environments. The script provides examples of robots trained in simulation that can generalize their walking behaviors to different real-world terrains.

💡Manipulation

Manipulation in the video pertains to the physical act of controlling and moving objects. It is highlighted as a complex aspect of physical intelligence that AI needs to master. The video gives examples of AI systems learning to reorient objects in simulation, which is a form of manipulation that requires understanding of object properties and fine motor control.

💡Hardware

Hardware in the context of the video refers to the physical components of robots, such as hands or limbs. The speaker argues that in addition to software and algorithms, the design of hardware is crucial for achieving physical intelligence. The video discusses the importance of touch sensing and the development of hands optimized for specific tasks, illustrating the interplay between hardware design and AI capabilities.

Highlights

Artificial intelligence has surpassed human capabilities in complex games like chess and Go, yet struggles with simple tasks such as opening doors.

The dichotomy between tasks perceived as hard by humans and those challenging for robots is known as the Moravec's paradox.

Sensory motor skills, which are effortless for humans, require enormous computational power for robots.

The evolution of intelligence has prioritized sensory motor skills over language and reasoning, which are relatively recent developments.

Current AI systems excel at language understanding but lack physical intelligence, which is crucial for everyday tasks.

Simulation is a key technique used to generate data and train robots in various environments and tasks.

AI systems can be trained in simulation to perform complex physical tasks and then transferred to real-world applications.

Physical intelligence involves not just control algorithms but also perception and hardware design.

The development of hands with touch sensing capabilities is crucial for robots to interact with their environment effectively.

Hardware design, such as the creation of hands optimized for specific tasks, is a significant aspect of physical intelligence.

The speaker argues that physical intelligence must be developed before achieving true artificial intelligence.

The talk emphasizes the importance of building physical intelligence, which is often overlooked in favor of language and reasoning capabilities.

The Model X paradox, observed in 1988, is still relevant today, highlighting the persistent challenges in physical intelligence for AI.

The speaker's lab is working on techniques to enhance physical intelligence, including simulation and hardware optimization.

The future of AI may depend on our ability to develop physical intelligence, which is essential for robots to perform everyday tasks.

The talk concludes with a call to action for the AI community to focus on building physical intelligence as a foundation for advanced AI capabilities.

Transcripts

play00:04

[Music]

play00:11

thank you

play00:13

so let us get started with a question

play00:17

what do you think requires more

play00:19

intelligence

play00:21

playing the game of chess

play00:23

or opening

play00:25

a door

play00:29

right I mean many times we think that

play00:33

chess is a matter of Genius

play00:36

so if chest was actually hard to do

play00:39

then building machines which can play

play00:42

chess should also be way harder than

play00:45

building machines which can open doors

play00:48

but let's see what we have managed to do

play00:51

in artificial intelligence

play00:54

you know we probably heard about AI

play00:57

systems surpassing humans at the game of

play01:00

chess

play01:01

this is not today but 20 years ago

play01:04

right since then we have had AI systems

play01:07

which can play complex multiplayer games

play01:10

and surpass humans even at this complex

play01:14

game of Go

play01:17

but now let's you know take a look at

play01:20

opening doors

play01:26

now you might think I am showing you bad

play01:28

videos

play01:29

but let me assure you these are the best

play01:32

teams competing in the DARPA Grand

play01:36

finals

play01:38

just

play01:39

seven years ago

play01:41

you know doing simple things like

play01:43

opening doors climbing stairs is

play01:46

actually very hard

play01:47

right

play01:49

okay so you know let's try to understand

play01:52

why is this the case

play01:54

so there seems to be this dichotomy

play01:56

between what we think is hard and what

play02:00

our robots find to be hard

play02:03

and in this talk you know what I'm going

play02:06

to communicate to you

play02:07

is that human intuition of what we think

play02:10

is hard really gets in our way

play02:13

and you know kind of stops us from

play02:15

really building intelligent systems

play02:18

now many scientists have wondered about

play02:21

this you know I'll start with a quote

play02:23

from Hans marawick you know one of the

play02:25

people who thought about AI quite a lot

play02:27

and here is what he has to say

play02:31

you know reasoning

play02:34

requires very little computation you

play02:37

know reasoning like the kind of

play02:38

reasoning you have in chess

play02:40

right but sensory motor skill requires

play02:43

enormous compute

play02:45

it will quote another scientist you know

play02:47

Stephen Pinker from Harvard

play02:50

the main lesson of 35 years of AI

play02:53

research

play02:57

is that hard problems like chess are

play02:59

actually easy

play03:00

and easy problems like walking and

play03:02

opening a door are actually hard

play03:06

these observations came to be known as

play03:09

the model X paradox

play03:11

you know some people have gone ahead to

play03:14

speculate

play03:15

right that machines or AI systems are

play03:20

going to do jobs which we think are

play03:22

cognitively challenging quite soon

play03:24

for example being a board member being a

play03:27

data analyst or being creative and

play03:30

making paintings

play03:31

but jobs which require physical

play03:34

intelligence

play03:35

are going to be not being able to done

play03:38

by machines for a long time to come

play03:42

now why is this

play03:44

and the reason is

play03:46

that we are

play03:48

least aware of things that we do very

play03:52

well for example our heart is beating we

play03:55

are breathing but are we aware of it

play03:57

when you walk are you aware of it you

play04:00

know these systems are working all the

play04:01

time they're Flawless you don't even

play04:04

think about them right this is what was

play04:06

quoted by Marvin Minsky you know one of

play04:09

the co-founders of the MIT is AI lab and

play04:12

a turing Award winner

play04:14

right and then he goes on to say that we

play04:16

are more aware of simple processes that

play04:21

do not work well

play04:24

and when he says simple again think of

play04:26

chess

play04:28

and this is not an abstract concept I

play04:31

think all of us have experienced this

play04:34

model X Paradox in our lives

play04:36

you know imagining riding a bicycle

play04:39

you know when you've learned how to ride

play04:41

a bicycle very early on you probably

play04:44

paid attention to every you know every

play04:46

movement of your foot where is the

play04:48

handle going

play04:50

but after some time it becomes natural

play04:52

it becomes intuitive you don't even

play04:54

think about it

play04:57

so let's take this idea and you know

play04:59

apply it to and use it to understand the

play05:02

evolution of intelligence

play05:04

so life started you know some 3.7

play05:06

billion years ago with single cell

play05:09

organisms

play05:11

then it took around you know 3.7 billion

play05:14

years to come to Apes now what can apes

play05:18

do simple sensory motor stuff you know

play05:21

like uh hanging from branches you know

play05:24

picking up a fruit throwing it so on and

play05:27

so forth

play05:28

then it took you know a few million

play05:30

years or 20 million years for humans to

play05:32

evolve

play05:33

then a few million years for language to

play05:36

come in

play05:37

and then you know we are just 50 000 to

play05:41

150 000 years from when language started

play05:45

so maybe there's a lesson over here also

play05:47

that Evolution spend a lot of time

play05:50

evolving sensory motor skills and

play05:54

relatively very very little time

play05:56

developing language or reasoning that we

play05:59

think is complex today

play06:00

no just to give a sense of these numbers

play06:03

you know imagine that the origin of

play06:05

Earth we are describing it in one day

play06:08

right so we have start of the you know

play06:10

Earth is born at midnight and they're

play06:14

going to look at you know our 24 hours

play06:17

period

play06:18

so language is just 10 seconds old

play06:24

right

play06:25

humans are just one minute 26 or one

play06:28

minute 20 seconds old

play06:30

apes or maybe six minutes old

play06:33

but life started 20 hours ago

play06:36

so that gives you the sense of how much

play06:39

time it took to get to these simple

play06:41

skills

play06:42

now what is this implication for

play06:44

building a robot

play06:46

so I for one you know want to have a

play06:48

robot which can do the mundane thing

play06:50

that I do at my house today

play06:52

right now if I say to the robot you know

play06:55

make me dinner

play06:57

the first thing the robot needs to

play06:58

understand is you know what is sinner

play07:00

what do I eat what are the recipes and

play07:03

how to make it right now this is what

play07:05

language might give us right

play07:09

now there's another part which is

play07:11

physical intelligence which is how do I

play07:13

actually make dinner which I have to

play07:14

chop vegetables so on and so forth

play07:17

so now let's look back at our timeline

play07:21

so how much time it took for language

play07:23

understanding 10 seconds how much time

play07:26

for physical intelligence maybe 20 hours

play07:30

right so what what have we done in AI

play07:32

today is you know we have taken Lots

play07:36

amounts of data from the internet and

play07:38

developed very capable systems which can

play07:41

understand language

play07:42

you know just to give you a very quick

play07:44

summary of how they work

play07:45

you know so these systems are called as

play07:47

language models they consume a lot of

play07:50

data and then given a few words they try

play07:53

to predict what words are going to come

play07:55

next right for example you know the

play07:58

question is Coke is in and then the AI

play08:02

system can you know make a prediction of

play08:05

what the next words are going to be

play08:06

right here is you know one prediction

play08:08

made by the system

play08:10

let us ask it a different question and

play08:12

see what the prediction is

play08:15

you know sounds very reasonable right

play08:18

maybe let's ask another question

play08:21

and let's see what the answer is

play08:24

you know maybe maybe a bit nonsensical

play08:27

but but the point is you know yes you

play08:29

know there are a few aberrations but

play08:32

these systems are becoming really really

play08:34

good

play08:35

and we can also hook up images with

play08:37

these systems it's just not about

play08:38

language right for example you know we

play08:41

can ask an AI system to generate an

play08:43

image of a 2 2 on a stroll with a dog

play08:45

right something which probably is you

play08:48

know we never imagined right or

play08:50

something like draw images

play08:52

of an avocado chair

play08:55

right and these systems can do it

play08:58

so

play08:59

what does this mean that we have such

play09:01

good language understanding in context

play09:04

of building robots that we could you

play09:06

know be in my house

play09:08

for the model X Paradox was made in you

play09:11

know 1988 after observing 35 years of AI

play09:15

research now we are almost in 2023 which

play09:19

means it is almost 35 years since then

play09:22

right and let's look at an attempt you

play09:25

know to build a robotic system to

play09:27

replicate some household phase so this

play09:30

is a very impressive system you know put

play09:32

out by Google you know some time back

play09:35

and the question is

play09:36

you know someone spilled the Coke and

play09:39

they want the robot to clean the mess

play09:41

so let's look at you know what the robot

play09:43

ends up doing

play09:45

right it realizes it needs to find a

play09:47

Coke it goes it grasps this Coke can

play09:50

and then it tries to throw it in the

play09:53

trash can

play09:53

[Music]

play09:58

but cannot throw it

play10:01

right then it moves ahead and says hey

play10:03

you know I need something to wipe off

play10:05

the table so I'm going to pick up this

play10:08

pad and take it

play10:10

to go wipe the table

play10:13

foreign

play10:29

okay so what is the lesson the Moto X

play10:31

Paradox still triumphs

play10:33

right 35 years have passed

play10:36

you know and still the same problem

play10:38

exists

play10:39

right now I don't want to be here 35

play10:41

years from now and tell you the same

play10:43

thing

play10:45

right

play10:46

so we we need to fix this

play10:49

so now what what actually is the problem

play10:51

right the problem is you know to get to

play10:53

language understanding which is

play10:55

equivalent of 10 seconds of evolution we

play10:57

are pretty much consumed all of the

play10:59

internet

play11:00

right now how are we going to go to

play11:03

sensory motor skills

play11:06

so you know some people think that hey

play11:08

you know maybe we can get to artificial

play11:11

intelligence without doing physical

play11:14

intelligence

play11:15

now I can talk a lot about this right

play11:18

but in interest of time I was going to

play11:20

tell you my bet my bet is no

play11:23

right and these paradoxes that we have

play11:26

you know seen so far also happen in

play11:29

physical intelligence and this is what

play11:31

it you know makes physical intelligence

play11:33

challenging right to give you an example

play11:35

you know consider a robot doing a

play11:38

backflip

play11:39

impressive right

play11:42

but you know what about

play11:44

the behavior of walking

play11:46

seems very simple

play11:49

but when you do a backflip you know

play11:51

maybe what you're doing is a specialized

play11:53

motion that you only have to reason

play11:55

about your own motor system

play11:58

but when I'm walking then I have to walk

play12:00

on many different terrains so I need to

play12:02

reason about the environment for these

play12:05

systems have to generalize through a

play12:07

large variety of terrains and this is

play12:10

what makes them challenging

play12:13

like

play12:15

so

play12:16

you know so the question which me and my

play12:18

lab are trying to look at is you know

play12:21

how do we get to physical intelligence

play12:24

and I'm going to briefly now tell you

play12:26

you know some of the ideas and

play12:28

techniques that we have been using right

play12:31

for the one thing we heavily make use of

play12:34

is simulation

play12:36

you know because in simulation we can

play12:38

generate lots of data right in three

play12:40

hours we can generate you know 100 days

play12:43

worth of data right so this is you know

play12:46

an example where you can simulate Many

play12:48

Many Robots in parallel

play12:50

then in simulation you know they can

play12:52

learn how to walk

play12:55

right so these are some gates that we

play12:57

have learned

play12:58

now once we learn these walking

play13:00

behaviors in simulation then we take

play13:02

them and transfer them into the real

play13:04

world and by real world I mean different

play13:07

kinds of terrains so it might be stairs

play13:09

going on certain obstacle or walking on

play13:11

sand

play13:13

so here are you know some results of

play13:15

these systems which were trained in

play13:16

simulation but then deployed

play13:19

you know in the real world right over

play13:21

here this data is running fast but it's

play13:24

not just fast it can go on these

play13:27

challenging terrains and still be stable

play13:30

or for example over here it tries to go

play13:34

under an obstacle

play13:38

right so it has to crouch before it can

play13:40

go beneath

play13:41

or you know for example you know going

play13:44

up this Gravelly Hill

play13:50

you know sometimes when the robot is

play13:53

doing you know these behaviors the

play13:55

environment is not you know forgiving

play13:58

you know for example once when you're

play14:00

running this robot outside in this

play14:02

building one of the screws underneath

play14:05

came off

play14:06

so what you now see is you know this

play14:08

robot is limping in a way but still

play14:11

walking and this is the kind of

play14:13

robustness and generalization that we

play14:15

hope to achieve

play14:18

and this is not just in context of

play14:20

locomotion you know we can also think

play14:23

about in context of manipulation right

play14:26

for example you know things that we do

play14:28

every day right we pick up tools we use

play14:30

them right and we keep doing it all the

play14:33

time sometimes for a purpose and

play14:35

sometimes you know just for fun

play14:38

right and sometimes you know we do it

play14:40

because we have to do it

play14:43

so what we can do is you know we can

play14:45

also run simulations where we have you

play14:47

know lots of hands you know stimulating

play14:50

you know this task of reorienting

play14:52

objects because it is needed to perform

play14:55

a downstream manipulation task

play14:57

and then we can take

play14:59

you know this learned system in

play15:00

simulation and transfer it into the real

play15:03

world

play15:03

right and the way this system is going

play15:06

to work is it's going to have a camera

play15:07

and going to give some commands to the

play15:10

fingers to move right this is just a

play15:12

side view so you have a sense of what

play15:14

I'm going to show you

play15:16

and we're going to evaluate on new

play15:18

objects so the system has never seen

play15:19

before

play15:21

so you know for example over here on

play15:24

your top right is a goal orientations

play15:26

and you know let's look at what the

play15:28

system ends up doing

play15:39

so it tries to reorient this object to

play15:41

the Target which is shown on the top

play15:43

right

play15:45

so again you know this is some examples

play15:47

showing that in simulation we can

play15:50

leverage large amounts of data and then

play15:53

use it to perform things that seem

play15:55

simple to humans but are actually quite

play15:58

complex

play16:01

now you know building these you know

play16:03

physical intelligence or these systems

play16:05

is not just about the control algorithm

play16:08

it's also a lot about the hardware you

play16:11

know for example the hand that I showed

play16:12

you had no touch sensing right so we

play16:16

need to you know activate some more

play16:19

modalities so it can start you know

play16:21

sensing where it makes contact

play16:24

right

play16:25

and you know we have been running you

play16:27

know some experiments with you know

play16:28

doing problems like hey if I give you an

play16:30

object you know can you feel you know

play16:34

what the object is

play16:35

right and you know then if I close the

play16:39

lights so it's just dark can you go and

play16:41

find that object again

play16:43

right so for example now we put more

play16:46

objects shut off the light and the Hand

play16:48

still has to find these objects just

play16:50

based on touch sensing

play16:52

the other question you can ask is well

play16:54

is the design of this hand good

play16:58

maybe maybe not right so we're

play17:00

developing tools which can help us

play17:02

design you know better hands so I'm

play17:04

showing you four examples of four

play17:06

different hands that we're able to

play17:08

design which were optimized for the

play17:09

particular task

play17:11

right so for example over here you know

play17:13

here's a hand which can cut things

play17:15

and you know it can you know use

play17:18

scissors to decide it can cut paper but

play17:20

not cut acrylic

play17:22

right so in summary

play17:25

right it is not just about control but

play17:28

also thinking about perception and also

play17:31

thinking about Hardware right and we

play17:33

need to think in a full stack way to

play17:36

approach physical intelligence

play17:39

so to end what I discussed was this

play17:42

dichotomy between artificial and natural

play17:45

intelligence

play17:46

and what we think we have is the Cherry

play17:49

right which is 10 seconds worth of

play17:52

evolution you know models that we have

play17:54

trained on the internet

play17:56

but the question is where is my cake

play18:01

right and you know while there are some

play18:04

people who believe

play18:05

that you know we can just go on the

play18:07

Internet Train bigger models and not be

play18:10

embodied and get to artificial

play18:12

intelligence

play18:13

you know me and some other people I may

play18:16

be on the other camp we think we need we

play18:18

need to build the cake first to build

play18:20

physical intelligence before we can go

play18:24

to True artificial intelligence right

play18:27

and I hope now that more and more people

play18:30

think about physical intelligence

play18:33

especially that there is a lot of you

play18:35

know hype and a lot of excitement about

play18:38

this embodied intelligence going on

play18:40

right I think that hype is very good but

play18:44

we cannot you know have the cake without

play18:47

building the cake with that thank you

play18:51

[Applause]

Rate This

5.0 / 5 (0 votes)

関連タグ
Artificial IntelligenceMachine LearningModel X ParadoxSensory Motor SkillsCognitive ChallengesAI EvolutionRoboticsSimulationPhysical IntelligenceEmbodied AI
英語で要約が必要ですか?