Swarms of AI Agents STUN the Entire Industry | 100 Million Jobs GONE? | Devin, Maisa, Groq & more

AI Unleashed - The Coming Artificial Intelligence Revolution and Race to AGI
15 Mar 202426:39

Summary

TLDR该视频脚本讨论了人工智能软件工程师Devon的兴起,以及其他类似的AI代理的发展。视频中展示了Devon如何通过聊天界面和浏览器来执行复杂的编程任务,如设置项目和安装依赖。同时,探讨了AI代理在未来工作中的角色,包括提高工作效率和降低成本。此外,还提到了大型语言模型如GPT 4.5和Claude 3 Opus的进步,以及它们对各行各业的潜在影响。

Takeaways

  • 🤖 人工智能软件工程师Devon引起了广泛关注,许多人对其取代或增强软件工程师的能力持有不同看法。
  • 🚀 Devon是众多类似代理中的第一款,预示着AI代理时代的到来。
  • 📹 视频中展示了Devon如何通过聊天、命令行界面、身份验证和浏览器来设置项目。
  • 🛠️ Devon能够根据GitHub上的项目README文件自动规划和执行复杂的技术任务。
  • 🔍 专家Ethan Mik强调了与AI编码器交互的新方式,它可以在后台持续执行和调试想法。
  • 📈 AI代理的未来可能不会采用传统的问答模式,而是允许用户随时异步输入想法。
  • 💡 尽管AI系统还不能完全自主地无误工作,但它们是一个引人入胜的开始。
  • 🌐 介绍了其他AI产品,如Grock的LPU(语言处理单元)和MSA AI的KPU(知识处理单元)。
  • 📊 展示了AI代理在客户服务中的应用,如处理订单问题和提供即时反馈。
  • 🔧 讨论了AI代理如何改变业务运作,降低成本并提高效率。
  • 🌟 强调了AI技术发展迅速,新模型如GPT 4.5和zGPT即将推出,预示着AI能力的进步。
  • 📈 呼吁人们关注AI技术的发展,通过调查问卷分享他们对AI的看法和需求,共同探索未来的发展方向。

Q & A

  • Devon 是什么?

    -Devon 是由 Cognition Labs 开发的第一个 AI 软件工程师代理,它能够执行编程任务,并与用户通过聊天界面进行交互。

  • 人们对于 AI 软件工程师的未来发展有什么看法?

    -人们对于 AI 软件工程师的未来发展持有不同的观点,一些人认为它们可能会取代软件工程师,而另一些人则认为它们将增强软件工程师的能力。

  • McKay Wrigley 对 Devon 的使用体验如何?

    -McKay Wrigley 对 Devon 的使用体验感到非常震撼,他在视频中表示,使用 Devon 27分钟的体验让他确信 AI 代理的时代已经开始。

  • Devon 如何执行编程任务?

    -Devon 通过访问聊天界面、浏览器和命令行界面(CLI)来执行编程任务。它可以读取 GitHub 上的项目文档,根据 README 文件中的指示进行项目设置,并执行如克隆仓库、安装依赖等任务。

  • Ethan Mik 认为 Devon 的哪些特点最吸引人?

    -Ethan Mik 认为 Devon 最吸引人的特点是其新的交互方式,用户可以随时像与真人交谈一样与其对话,而它则在后台执行和调试用户的想法。

  • AI 系统目前是否能够完全自主地完成编程工作?

    -尽管 AI 系统如 Devon 展示了其在编程方面的潜力,但目前它们还不能完全自主地、无误地完成编程工作,仍需要人类的监督和帮助。

  • AI 代理在客户服务方面的应用有哪些?

    -AI 代理可以用于客户服务,例如通过聊天界面与客户沟通、解答问题、处理订单和支付细节等。它们可以减少人工客服的工作量,并提高处理速度。

  • Grock 是什么?

    -Grock 是一个专注于提高 AI 推理速度的公司,它们开发了一种名为语言处理单元(LPU)的芯片,旨在支持实时 AI 应用的快速响应。

  • MSA AI Mesa Ai 是一种什么样的 AI 代理?

    -MSA AI Mesa Ai 是一种由知识处理单元(KPU)驱动的 AI 代理,它可以处理多种任务,如理解电子邮件内容、加载订单文件、识别特定订单并回复客户等。

  • 人们如何看待即将到来的更先进的 AI 模型?

    -人们对即将到来的更先进的 AI 模型持期待态度,认为它们将能够自动化更多的工作,并可能在全球范围内影响大量的职位。同时,人们也在关注这些模型的发布和性能。

  • 未来人们需要什么样的技能来与 AI 合作?

    -未来,人们需要强化领域专业知识,并学会如何有效地利用 AI 技术来执行任务。编程可能不再是必需的技能,而是更注重于如何使用自然语言来指导 AI 完成复杂的工作。

Outlines

00:00

🤖 人工智能软件工程师的兴起

本段介绍了名为Devon的首个AI软件工程师,它正在引起广泛关注。人们对于AI是否会取代或辅助软件工程师持有不同看法。Devon正在逐步向公众推出,并且已经开始有人尝试使用。视频中,McKay Wrigley分享了他使用Devon 27分钟的体验,表示AI代理的时代已经开始。他通过Devon设置了GitHub上的chatbot UI项目,展示了Devon如何读取项目说明、克隆仓库、安装依赖,并执行复杂技术步骤的能力。这标志着代理产品的未来,其中AI可以在后台持续工作和调试想法。

05:02

🚀 AI代理的商业潜力和未来

这段内容讨论了AI代理在商业上的潜力,尤其是在客户服务方面的应用。虽然AI系统还不能完全自主地无误地完成工作,但它们是一个有趣的开始。视频提到了AI在电子商务和在线业务中的应用,以及AI如何帮助建立网站和营销活动。尽管AI不能替代人类进行销售和客户沟通,但它可以简化许多重复性和单调性的任务。此外,还提到了AI Jason的YouTube频道,他探讨了Gro技术,这是一种专注于提高AI推理速度的新技术,能够实时生成文本并转换成语音。

10:03

📈 AI技术的进步和商业应用

本段强调了AI技术在多个领域的快速进步,包括语言处理单元(LPU)和知识处理单元(KPU)的发展,这些技术使得AI能够更快地生成响应并执行复杂任务。视频举例说明了AI如何在客户服务中应用,例如处理订单问题和提供优惠券。此外,还提到了大型AI模型如GPT 4.5和Claude 3 Opus的性能和潜力,以及它们如何影响未来的工作和商业运作。

15:05

🌐 AI对社会和经济的影响

这一段讨论了AI如何改变社会和经济,特别是通过降低智能成本,使得每个人都能负担得起智能服务。视频提到了人工智能的普及可能带来的变化,例如在医疗和教育领域的应用,以及如何使专业知识更容易地与技术结合。此外,还强调了未来可能不再需要每个人都学习编程,因为AI将使得编程变得简单,人们可以专注于自己的领域专业知识。

20:05

📚 社区参与和未来规划

最后一段强调了社区参与的重要性,以及如何利用AI技术来满足社区成员的需求。视频制作者邀请观众参与调查,以了解他们对AI的看法和需求。此外,提到了未来的计划,包括提供帮助和资源,以及如何通过AI技术使每个人都能提升自己的技能。视频结束时,制作者表达了对社区的感激,并预告了即将到来的激动人心的计划。

Mindmap

Keywords

💡AI软件工程师

AI软件工程师指的是使用人工智能技术来辅助软件开发的工程师。在视频中,Devon作为第一个AI软件工程师,能够通过聊天界面接收任务并执行复杂的软件设置和开发工作,这标志着AI在软件开发领域的应用进入了一个新的阶段。

💡认知实验室

认知实验室(Cognition Labs)是开发Devon AI软件工程师的机构。在视频中,它代表了AI技术研究和开发的前沿阵地,推动了AI在软件工程领域的实际应用。

💡AI代理

AI代理是指能够执行特定任务或服务的人工智能系统。在视频中,AI代理不仅能够进行编程和软件开发,还能够处理客户服务和市场营销等业务流程,显示出AI技术在多个领域的应用潜力。

💡自然语言处理

自然语言处理(NLP)是人工智能领域的一个重要分支,它使计算机能够理解和生成人类语言。在视频中,AI代理通过NLP技术与用户进行交流,理解任务需求并作出响应。

💡实时交互

实时交互指的是AI代理能够即时响应用户的输入并进行交流。在视频中,AI代理不仅可以快速执行任务,还能够与用户进行实时对话,提供反馈和建议。

💡自动化

自动化是指使用技术来执行原本需要人工完成的任务。视频中的AI代理通过自动化流程,减少了人工介入的需求,提高了工作效率。

💡知识处理单元

知识处理单元(KPU)是一种专门为处理和生成知识而设计的计算单元。在视频中,KPU被用来提供高效的AI服务,如自动回答客户问题和处理订单。

💡Grock

Grock是一种基于语言处理单元(LPU)的AI技术,专注于提高AI推理速度,使实时AI应用得以实现。在视频中,Grock被用来构建能够进行实时语音转文本和文本转语音的应用程序。

💡多智能体代码生成

多智能体代码生成是指利用多个AI代理共同工作,通过迭代测试和优化来生成和改进代码的过程。在视频中,这种方法被用来开发能够处理复杂任务的AI系统。

💡GPT 4.5

GPT 4.5是OpenAI预计发布的一个先进的语言预测模型,它将具有更丰富的知识库和更高的处理速度。在视频中,GPT 4.5的发布被认为可能会自动化大量的工作岗位。

💡AI伦理

AI伦理关注的是人工智能技术的开发和应用中涉及的道德和社会责任问题。在视频中,尽管没有直接提到AI伦理,但随着AI技术的发展,确保其负责任和道德的使用变得越来越重要。

Highlights

Devon作为第一个AI软件工程师代理开始引起关注,人们对其取代或增强软件工程师的能力持有不同看法。

Devon正在逐步向公众推出,一些人开始首次使用它进行项目设置。

McKay Wrigley对Devon的使用体验感到震撼,认为AI代理的时代已经开始。

Devon拥有访问聊天、浏览器和命令行界面的能力,能够实时执行和调试任务。

Devon通过阅读项目README文件,自主规划并执行了设置项目所需的步骤。

Ethan Mik强调了与AI编码者交互的新方式,可以随时随地与其交流,让它在后台执行和调试想法。

AI系统目前还不能完全自主地无误工作,仍需人类帮助以确保项目质量。

AI代理可以24小时不间断地工作,类似于雇佣的员工,根据提供的任务和反馈持续推进项目。

AI代理的发展可能会改变传统的工作模式,使得许多任务能够通过简单的指令完成。

AI Jason通过视频展示了Grock技术,这是一种专注于提升AI推理速度的新型处理器。

Grock的LPU(语言处理单元)旨在支持实时AI应用,提升响应速度。

AI代理能够处理客户服务,如接听电话、回答问题、收集支付信息等。

AI代理的成本效益非常高,例如回答客户邮件的成本仅为21美分。

许多公司正在开发各种AI代理,旨在为不同行业提供定制化的智能解决方案。

大型科技公司的AI研究者们正在全球范围内合作,推动AI技术的发展。

未来,AI的发展可能会使得每个人都能以低成本获得高质量的智能服务。

Jensen Huang提出,未来人们不需要编程,而是通过自然语言与AI合作,这将是一个令人惊喜的上技能过程。

Transcripts

play00:00

so at this point you probably heard

play00:01

about Devon the first AI software

play00:03

engineer this AI agent has been making

play00:06

some news a lot of people have their own

play00:08

takes on how far this will go to either

play00:10

replace software Engineers or augment

play00:13

them it's slowly being rolled out to the

play00:14

public and we're beginning to see it

play00:17

being used by some people for the first

play00:19

time but here's the thing this one is

play00:22

the first of many because on the heels

play00:25

of Devon's announcement from cognition

play00:27

Labs a number of other similar agents

play00:30

are dropping buckle up so here's McKay

play00:33

Wrigley he's saying I'm Blown Away by

play00:35

Devon watch me use it for 27 minutes

play00:38

it's insane the era of AI agents has

play00:41

begun I'll link this below if you want

play00:42

to see it but let me just highlight a

play00:44

few portions of this video okay guys

play00:46

we're going to just jump in right away

play00:47

so I'm going to send this off to Devon

play00:49

right now so yes we are in Devon we are

play00:51

in cognitions New AI software engineer

play00:54

basically what I'm doing is I'm in the

play00:56

chat here and has access to a browser

play00:58

here you can see following is on again

play01:01

this indicator shows us live um just to

play01:03

draw your attention to a few other

play01:05

things here this pulsing blue dot with

play01:07

this grade text is basically indicating

play01:09

what Devon is up to right now is I'm in

play01:12

the chat here and I'm asking Devon to

play01:15

set up this project and what I did was I

play01:17

gave it a GitHub URL which links to my

play01:20

chatbot UI project and I basically want

play01:22

to see can it set it up like this is the

play01:25

read me for this project it's a fairly

play01:28

complex setup which is is something I'd

play01:30

expect a software engineer to be able to

play01:32

do basically you can see over here is

play01:34

our chat and then over here is Devon's

play01:36

workspace okay and so if we go into quad

play01:38

view here Devon has four things has

play01:40

access to our chat has access to a shell

play01:42

so it can run a CLI it has an ID which

play01:46

will be right here once that begins if

play01:48

it needs that step and it has access to

play01:50

a browser here and this just follows it

play01:52

live okay we can actually see like a

play01:54

history of what it's done right here

play01:55

which is pretty cool this pulsing blue

play01:57

dot with this gr text is basically indic

play02:00

what Devon is up to right now you know

play02:02

we're just sitting here waiting for it

play02:03

to complete tasks okay every once in a

play02:05

while Devon will actually this will turn

play02:07

orange and it will need information cre

play02:10

look at a few things so you can see over

play02:11

here Devon is in its planner so you can

play02:14

see that Devon has a planning step here

play02:17

here are all the steps I need to

play02:18

accomplish based on the task I was given

play02:21

right and it just came up with that list

play02:22

by itself so step one was cloning the

play02:25

repository from get a blink we gave it

play02:27

it ran get clone with the URL

play02:30

so it cloned that it finished and then

play02:32

it marked it as done in the planner then

play02:34

you can see this next step which it was

play02:36

doing while I was just talking was

play02:38

identify the package manager and

play02:39

installed the dependencies so what it

play02:41

actually did was it ran npm install here

play02:45

basically that's just installing a bunch

play02:46

of like prepackaged software and now

play02:48

it's just doing its thing so you can see

play02:50

we got a little message from Devon said

play02:52

I've clone the repo I'm reading the read

play02:54

me it's basically just constructing a

play02:56

plan based on reading the readme and

play02:58

sure enough like these are all the steps

play02:59

I would expect a regular software

play03:01

engineer to follow right CL on the repo

play03:03

read the instructions install the

play03:05

packages and then in this particular

play03:07

project we have to set up a super Bas CI

play03:09

so that's basically the back end

play03:11

basically we we want it to basically get

play03:13

to this step and just get to the app

play03:15

Devon's now trying to install super base

play03:18

which is pretty impressive this is a

play03:20

fairly complex technical step so

play03:23

basically using the command line here

play03:26

Devon's now downloading more or less our

play03:29

our database that we use for the app and

play03:32

the reason I'm recording this and the

play03:34

reason I'm so excited about this is

play03:36

because this really feels like the chat

play03:38

GPT moment for agent products so

play03:42

definitely sounds like pretty good

play03:43

feedback here's Ethan mik we've covered

play03:46

him and a few of his posts before on

play03:47

this channel so it seems like he's

play03:49

working a lot at kind of the

play03:51

intersection of working productivity and

play03:54

AI that whole blog post about the jagged

play03:56

Frontier and cyborgs and centaurs was

play04:00

his work he was one of the people in

play04:01

that study so looks like he's got a a

play04:03

book coming out co-intelligence which I

play04:05

would guess would be pretty good and

play04:08

he's saying this the thing that makes

play04:09

Devon the AI coder really interesting is

play04:12

the new modality of interaction you can

play04:14

talk to it at any time like you would a

play04:16

person and it just keeps chugging away

play04:18

in the background executing and

play04:20

debugging your ideas and so here's just

play04:22

kind of a time laps of the project that

play04:25

he was doing and this is true with a lot

play04:27

of previous ones even something like

play04:28

Chad Dev and autogen where you can have

play04:30

multiple ages kind of strung together

play04:33

working together even autogen I mean you

play04:35

can take turn speaking but it is kind of

play04:37

a turn based approach with Chad Dev you

play04:41

kind of give them the higher level task

play04:43

and they get to work they talk back and

play04:45

forth but this kind of interface is it

play04:48

seems like it's the first view of what

play04:52

the future of Agents will look like it's

play04:54

not going to be a back and forth you

play04:57

will kind of asynchronously whenever you

play04:59

have time or ideas just kind of dump

play05:01

them into the chat right the agents

play05:03

might even have multiple projects

play05:06

running potentially you kind of add

play05:07

information as needed and the agent is

play05:10

always working as long as it kind of has

play05:12

some more subtasks to complete it keeps

play05:14

going and and building them 24 hours a

play05:17

day I mean this is a lot more like you

play05:18

would with somebody that that works for

play05:20

you for example you know if you hired

play05:21

somebody to help you with a project you

play05:23

give them the top of task they start

play05:24

working you add feedback and any other

play05:27

information they might know but they're

play05:28

they're kind of working that whole time

play05:30

he's saying to be clear AI systems

play05:32

aren't quite there yet to do this work

play05:33

autonomously and airror free without

play05:35

help even afterward there's a waste to

play05:37

go before you would want to trust a

play05:39

major project to AI but it is a

play05:41

fascinating start nevertheless he's

play05:43

saying I can't code in Python it is

play05:45

doing everything autonomously based on

play05:47

my request for a website that explains

play05:49

how dilution Works in startup funding I

play05:52

can't help but feel excited I can't wait

play05:54

to see this done so I've shared a little

play05:56

bit of my background on this channel a

play05:58

couple times but I've been kind of in

play05:59

the e-commerce space online business ads

play06:02

whatever you want to call it for the

play06:03

last decade plus so I've built a lot of

play06:06

websites and Shopify stores and a lot of

play06:09

campaigns and and setup servers and

play06:11

tracking and analytics and just

play06:14

everything everything everything and

play06:15

I've done it so many many many times to

play06:19

where I mean it's it's it's fairly easy

play06:21

for me now but still if you want to be

play06:23

effective at something like this you

play06:24

have to be able to get through a lot of

play06:27

manually doing stuff right to set up an

play06:29

online store you know you can see that

play06:31

as 2 hours of your focused attention you

play06:34

want to set up a marketing campaign with

play06:36

landing pages you know that's 3 to 5

play06:38

hours depending on the complexity you

play06:40

know if upsells or whatever of your

play06:42

focused undivided attention we're

play06:44

getting closer to the time when a lot of

play06:46

that is going to be done with a simple

play06:48

sentence maybe over time you'll build

play06:49

kind of a copy and paste little cheat

play06:51

sheet of command prompts that of prompts

play06:54

that that work really well to create the

play06:56

right thing for you some sort of custom

play06:58

instructions and then you just say it

play07:00

and this thing goes and it builds it

play07:02

builds stores blogs landing pages so

play07:06

what though so what happens so if I set

play07:08

up a gym website like I still have to

play07:10

take the phone calls right when the

play07:12

customers come in and I still have to

play07:13

sell them on it right I have to convince

play07:15

them to buy whatever product that I'm

play07:18

selling that still work AI can't do that

play07:20

work well let's see so this is AI Jason

play07:23

AI Json uh is a YouTube channel and he

play07:26

kind of comes out of nowhere every once

play07:28

in a while and just drops some fire and

play07:30

so I do want to highlight this one his

play07:32

latest is talking about Gro he does a

play07:35

deep dive into Gro with a Q so that's we

play07:39

we've covered that a little bit I've did

play07:41

a little bit more research into grock

play07:42

since then and yeah it's the real deal

play07:45

you're you're on uh Cloud n it

play07:50

seems I don't know how people hate that

play07:52

guy but in this one Jason basically goes

play07:54

through talks about grock what is the

play07:56

lpu the language processing unit how

play07:58

it's different which that's what grock

play08:00

is basically and then later he like live

play08:02

just you know kind of uh on video builds

play08:05

this app that uses grock and text to

play08:08

voice and builds this little app and

play08:11

take a look at what this app does so

play08:13

this is like a Skype chat where he's um

play08:15

you know he visits the website whatever

play08:17

and then submits his information um you

play08:20

know so this is like a chat app which we

play08:22

know chat Bots can do chat apps okay but

play08:25

what happens next next he receives a

play08:27

call on his phone let's listen in my

play08:30

and is start calling this function

play08:31

called you can see it called a function

play08:33

called cor pers our phone and if I

play08:36

switch to my phone I actually do get a

play08:38

phone call from the AI agent so I can

play08:41

click accept to receive

play08:43

it hi Jason I'm reaching out from crazy

play08:46

Fitness regarding your interest in our

play08:48

gym memberships I'm here to provide you

play08:50

with all the information you need and

play08:52

answer any questions you might have

play08:54

thanks for calling no problem so what

play08:56

specific fitness goals are you looking

play08:58

to achieve uh I just want to lose some

play08:59

weight I haven't do exercise at all for

play09:01

the past three or six months since Co I

play09:05

totally get that our personalized

play09:07

training and nutrition programs can help

play09:09

you reach your weight loss goals let's

play09:11

discuss the membership options that are

play09:13

best for

play09:14

you okay sounds good I think I'm pretty

play09:17

happy to just go ahead and get the

play09:19

membership start this Saturday if

play09:21

possible great to hear that to complete

play09:24

the process I just need your credit card

play09:26

number to secure your

play09:28

membership

play09:30

okay so my credit account number is

play09:32

12341234 4231

play09:35

4231 thanks Jason you're all set see you

play09:38

on Saturday all right thanks byebye once

play09:41

I finish the phone call it will actually

play09:42

send the transcript back to the agent

play09:45

session so you can see that it has

play09:47

message just call the pur back below is

play09:49

a transcript and based on this

play09:50

transcript then the agent will be able

play09:52

to decide next step after set up this

play09:55

hook we will first add it then we will

play09:57

something like this this and calling the

play09:59

function called you can see it called a

play10:01

function called call Press about phone

play10:03

and if I switch to my phone I actually

play10:05

do get a phone call from the AIA as you

play10:07

can see here that entire thing has been

play10:09

handled by the agent the the

play10:11

conversation has been saved as a

play10:13

transcript now you can you know extract

play10:15

the credit card number out of that you

play10:18

can probably automate that although you

play10:19

know maybe initially just do it yourself

play10:21

so it doesn't mess up but the point is I

play10:23

mean that was a live call as far as you

play10:26

can tell it was kind of going back and

play10:28

forth pretty seamlessly he wasn't

play10:31

waiting on the AI to respond and that is

play10:34

new that hasn't really been around for

play10:36

that long where that could happen that

play10:38

quickly that fast so we're at a time

play10:40

when this AI can generate text and near

play10:44

instantly transcribed into voice and so

play10:48

the input the like as you say things

play10:50

that voice to text happens then the chat

play10:53

bot the llm generates a response then

play10:56

takes that response as text turns it

play10:58

into voice and outputs of back and all

play11:00

of that happens you know near instantly

play11:03

or at least at conversational speeds

play11:06

what is the thing that makes it possible

play11:09

well that's Gro so instead of a GPU a

play11:11

graphics Processing Unit you have an lpu

play11:13

a language processing unit and so

play11:15

they're saying grock is on a mission to

play11:16

set the standard for Gen AI inference

play11:19

speed so inference speed inference is

play11:21

the response right so how quickly does

play11:24

it generate its response helping

play11:26

realtime AI applications come to life

play11:29

today so are you picking up what I'm

play11:31

putting down so you have something like

play11:34

this building out your entire online

play11:36

ecosystem marketing campaigns websites

play11:40

capture forms whatever you have the

play11:42

customers call in your phone number or

play11:44

start a chat where the bot can talk to

play11:46

them talk back and forth to try to get

play11:47

them to either call in or just check out

play11:50

when they call in a lovely cheerful and

play11:53

excited voice pick up to answer all your

play11:56

questions handle your objections and

play11:58

collect your payment details now you

play12:00

might be thinking yeah but so what that

play12:02

still leaves the worst part you still

play12:04

have to deal with those customers like

play12:06

if those stupid customers mess up right

play12:08

they I don't know they put the wrong

play12:10

address somewhere or whatever like I

play12:12

still have to troubleshoot and deal with

play12:14

that don't I well let's see so here's

play12:16

MSA AI Mesa Ai and so they're powered by

play12:19

a kpu they have a paper on it I looked

play12:22

through it briefly so there's a little

play12:24

bit I think of of marketing speed going

play12:25

on here cuz the kpu the knowledge

play12:27

Processing Unit generally that's

play12:29

deserved for you know that's kind of how

play12:30

we name our chips right so this is

play12:33

basically just a what would you call it

play12:34

a software stack now some people are

play12:36

kind of calling them out like saying

play12:38

well unless we're missing something this

play12:39

is an architecture based on well-known

play12:41

patterns so like we we've seen this

play12:43

before they're well put together they're

play12:44

wrapped around an API and sold as a

play12:46

service the point is don't read too much

play12:48

into like the marketing thing but that

play12:51

doesn't mean it's not good and again

play12:54

this will keep happening and people will

play12:55

keep pointing it out and and yes

play12:58

Shenanigans can be annoying even the big

play13:00

tech companies are seemingly now

play13:02

engaging in various forms of shenanigans

play13:04

to make their product appear better than

play13:06

it is to Edge out the competition but at

play13:09

the end of the day is it good does it

play13:11

work so here's a demo of them helping a

play13:15

customer for question about an order

play13:16

that did not arrive this time the

play13:17

customer accidentally did not write the

play13:19

order ID correctly so basically this

play13:22

thing the kpu as they're referring into

play13:25

it I didn't dive too much deeper but it

play13:26

just sounds like you know they took a

play13:28

large l language model GPT 4 or whatever

play13:31

and they just they they added some

play13:33

things to it some some functionalities

play13:35

some code and whether it was brand new

play13:37

or not doesn't matter it just sounds

play13:39

like they they executed whatever they

play13:41

did pretty well and so now this thing is

play13:44

working kind of like deavon but whereas

play13:46

Davin is a little bit more for software

play13:48

development and stuff like that this is

play13:50

a little bit more multi-use so here the

play13:53

agent basically starts by understanding

play13:55

what's happening in the context of this

play13:57

email they're reading this email that

play13:58

they're ping from email. text and it

play14:00

really spells out like hey we got to

play14:02

really understand what this person wants

play14:03

next they're loading the orders CSV to

play14:07

see whether or not it's correct next

play14:09

upep they open up the orders file to see

play14:11

what orders are in there to try to find

play14:13

the customer file and then try to

play14:15

identify the specific order that the

play14:16

customer is talking about and then they

play14:18

find the order and they respond to the

play14:20

customer telling them that it's marked

play14:21

as being in transit in our system and

play14:24

saying because we are so sorry let me

play14:26

give you a coupon code for 20% discount

play14:29

on your next purchase with us and so

play14:31

this is just one small task that it can

play14:33

do you know you can expand and check its

play14:35

reasoning process to see if there's any

play14:38

mistakes that are that have been made or

play14:39

anything like that how much did it cost

play14:41

to answer that customer email well 21

play14:44

cents you know and 45 seconds if you

play14:47

want it run without explanations or 96

play14:49

seconds with explanations which I don't

play14:51

know if that even matters I mean when

play14:53

we're talking about answering customer

play14:54

emails I mean it could take hours and

play14:57

still be very much faster than the

play14:59

expectation there's a weight list for

play15:00

the Early Access and again so this is

play15:02

just one example of what's coming some

play15:05

people are dunking in the thing cuz they

play15:07

you know of their how they name things

play15:09

but regardless of what any one

play15:11

particular app or agent or company that

play15:13

creates them how well they do the point

play15:15

is there's a lot of them and they're all

play15:17

working to create an agent for every

play15:20

business the sheer amount of money

play15:22

somebody that can create an agent that

play15:24

does customer service or any other

play15:26

specific thing the amount of money that

play15:28

can be made is massive because companies

play15:30

are already spending massive amounts of

play15:31

money on that so if they're able to cut

play15:34

the cost you know 10% it's a no-brain

play15:37

for them with a lot of stuff like this

play15:38

they're cutting their cost 99 plus per.

play15:41

if you go from paying human customer

play15:43

service agents to something like this

play15:46

again what was it 21 cents to answer an

play15:48

email I mean that's massive and the

play15:50

people building all this stuff they're

play15:52

smart like really smart for example for

play15:55

recognition Labs they're saying our team

play15:56

is small and talent dense all founding

play15:59

team has 10 II gold medals II is of

play16:02

course the bad guys in Ready Player one

play16:06

so it's it's either that which which

play16:08

this would make them the bad guys or

play16:10

possibly it's the international Olympiad

play16:13

in informatics which is an annual

play16:15

competitive programming competition

play16:17

where the best and brightest from all

play16:19

over the world compete in various

play16:21

complicated tasks usually involving

play16:24

writing a computer program so that that

play16:26

probably makes more sense I would say

play16:28

that this makesense some really really

play16:30

smart and there are leaders and Builders

play16:31

who have worked at The Cutting Edge of

play16:33

Applied AI at companies like cursor

play16:34

scale AI lunch Club modal Google deep

play16:37

mine weo and neuro this was from January

play16:40

of this year agent coder multi-agent

play16:42

code generation with iterative testing

play16:43

and optimization and we have you know

play16:46

Hong Kong Shanghai London Sussex various

play16:49

Global researchers working on this stuff

play16:51

attempting various architectures to to

play16:54

build these agent coders now you might

play16:56

have heard the leaks about GPT 4.5 5

play16:59

turbo so it seems like potentially

play17:01

somehow there's an index paged in Google

play17:03

with open AI you know setting that

play17:05

announcement for releasing GPT 4.5 turbo

play17:09

in June or at least they're saying that

play17:11

it will have a knowledge cut off of June

play17:13

which could maybe sounds like that's

play17:14

when it's ready to be released by

play17:17

potentially secondhand rumor Sam Alman

play17:19

thinks GPT 4.5 will automate 100 million

play17:23

jobs globally also UCI University of

play17:26

California Irvine is launching zot GPT

play17:29

what is z GPT well it's built on top of

play17:32

open AI so it's using one of the GPT

play17:34

models from open AI it provides no cost

play17:36

access to various open AI models

play17:39

basically for faculty and staff this is

play17:41

the way back machine so this is kind of

play17:43

a snapshot of how this web page looked

play17:45

like when this was launched on the on

play17:47

March 12th they did list the model as

play17:50

GPT

play17:51

4.5 when they published this blog right

play17:54

in here they're kind of saying that the

play17:55

a AI model is gpc4 but they have more

play17:58

planned so so it sounds like they're you

play17:59

know talking to open AI open saying hey

play18:02

we have something coming and if you've

play18:05

been following some of my videos I don't

play18:06

remember if it was a few months back or

play18:08

a month ago late at night one night a

play18:11

lot of people found that including

play18:12

myself cuz I ran the test I actually

play18:14

recorded it but Chad GPT was convinced

play18:17

that it was running on GPT 4.5 turbo but

play18:21

whatever the case is the point is the

play18:23

bigger better models are coming very

play18:25

soon the next GPT the next Gemini even

play18:29

if Google and openi cease to exist well

play18:32

here's chatbot Arena the rankings for

play18:35

all of the different LM models by at

play18:37

this point 408,000 people that have

play18:40

tested them blindly so they they don't

play18:42

know which ones they're using and they

play18:43

just vote based on how good the results

play18:45

are so kind of this is a representation

play18:48

of what models people prefer so GPT 4 is

play18:51

on top but uhoh what's this we have a

play18:54

newcomer that I mean I'm going to go

play18:56

ahead and say this is number two right

play18:57

cuz the first models GPT 4 right so this

play19:00

is you know after GPT this is the next

play19:03

best model tra trailing by just a few

play19:06

points here that's not that big of a

play19:08

difference all right and it's Claude 3

play19:11

Opus that's the model that everybody was

play19:13

freaking out cuz it was so self-aware

play19:15

they one to describe itself as some sort

play19:17

of a hyper intelligent octopus with

play19:20

infinite tendrils lashing out in all

play19:22

directions which you got to give it

play19:23

credit for style what's crazy is that

play19:26

CLA 3 Sonet the smaller model also Opus

play19:29

is the the top this one's the next down

play19:31

and then the ha coup I believe is the

play19:33

next one but Opus beats out Bard Gemini

play19:37

Pro that's connected to the internet

play19:38

that's online and clae 3 Sonet beats out

play19:41

the older versions of GPT 4 and also the

play19:44

mistal models and this is important to

play19:47

understand so we're seeing progress

play19:48

across everything we're seeing progress

play19:51

across chips and how fast they respond

play19:53

we see progress across the agent

play19:56

architecture kind of how we're building

play19:58

them a long time ago the smartest people

play20:00

in the world were working on various

play20:02

physics projects nuclear energy Etc

play20:05

somewhat more recently the smartest

play20:07

people in the world were working on how

play20:09

to get you to click on ads how to get

play20:11

you engaged in various content online

play20:15

and now all those smartest people and

play20:16

there's a lot more of them now when they

play20:18

can all talk to each other globally all

play20:20

of them are working on this AI agents

play20:24

chips anything in this infrastructure

play20:26

where they can be useful this is what

play20:27

they're building one things that a lot

play20:29

of you in the comments have asked

play20:31

repeatedly this is kind of a theme that

play20:33

I see is like what do we do what do we

play20:35

build how do we sort of you know what do

play20:38

we do we don't want to sit on the

play20:39

sidelines we want to go out there we

play20:40

want to create stuff and I know a lot of

play20:42

you are building stuff and it's very

play20:44

impressive so I posted this very

play20:46

recently just an hour ago as I'm

play20:48

recording this and it's basically asking

play20:49

people to answer a few questions for me

play20:51

to understand what they're looking to do

play20:54

with AI what they're struggling with so

play20:57

many of you responded that it blew past

play20:59

my free tier limmit at type form so I

play21:02

just updated it so if you've tried to

play21:04

get in and it said it's broken or

play21:06

whatever try again we should have plenty

play21:09

of room now so I'll post this link below

play21:12

if you can take two minutes share some

play21:14

of your thoughts with me I would greatly

play21:16

appreciate it it's not a weird thing to

play21:18

harvest some data from you your email or

play21:20

anything like that I just literally just

play21:21

want to know what you think because I'm

play21:23

getting a lot of people reaching out to

play21:25

me that are Engineers Builders various

play21:28

teams they have some interesting ideas

play21:30

about what they're building or what they

play21:31

could build so if some of that's useful

play21:33

for us as a community I'll bring them in

play21:35

put them in front of you I'm not sure if

play21:37

everybody noticed but we haven't had any

play21:39

sponsorships on this channel that's not

play21:41

cuz people aren't asking to sponsor this

play21:43

Channel That's cuz I'm saying no and

play21:45

that's largely because we have an

play21:47

amazing community and I have a few ideas

play21:49

of how to involve all of you in deciding

play21:53

where we all go next and to all the

play21:56

people that feel a little bit confused

play21:57

that a little bit ious about taking

play21:59

advantage of the stuff that's happening

play22:01

but not quite sure which direction to go

play22:03

I have some ideas this survey will help

play22:06

me refine some of my thoughts and we'll

play22:08

we're going to be announcing some really

play22:09

cool things in the upcoming days or next

play22:13

week perhaps so really fast just a few

play22:15

quick quotes here's Andrew a talking

play22:17

about intelligence the power to apply

play22:19

skills and knowledge to make good

play22:24

decisions we invest years of our lives

play22:26

and trillions of dollars on education

play22:29

all to develop our ability to make

play22:31

better decisions human intelligence is

play22:34

very expensive this is why only the

play22:36

wealthiest Among Us can afford to hire

play22:38

huge amounts of intelligence like that

play22:41

specialist doctor to carefully examine

play22:43

think about and advise you on a medical

play22:45

condition or a tutor that can truly take

play22:47

the time to understand your child and

play22:49

gently coach them where they need help

play22:51

but unlike human intelligence artificial

play22:55

intelligence can be made cheap so a

play22:59

opens up the potential for every

play23:01

individual to hire intelligence

play23:05

inexpensively so that you no longer have

play23:07

to worry about that huge bill from going

play23:10

to see a doctor for falling sick or for

play23:13

getting an

play23:14

education and you'll be able to hire an

play23:18

army of smart well-intentioned

play23:20

well-informed staff to help you think

play23:23

things through now think about what we

play23:25

were talking about earlier about you

play23:26

know for example building an online St

play23:28

or like some sort of a gym website where

play23:30

people input their data their phone

play23:32

numbers then the sales agent calls them

play23:35

in the past you needed money you need

play23:37

the capital to buy intelligence right

play23:40

cuz intelligence was only really

play23:41

available in humans and we're lazy and

play23:43

cranky and we sleep 8 hours a day we're

play23:46

expensive intelligence is expensive

play23:48

because it's scarce what happens when it

play23:50

gets cheaper or specifically what

play23:52

happens where it's more accessible what

play23:54

happens when you and I we can have

play23:56

Intelligence on demand intelligence to

play23:58

complete various tasks that either we

play24:01

can't do or maybe you know we could do

play24:03

but we don't have enough time or it's

play24:04

too monotonous or whatever if we could

play24:07

just spin up these agents and they would

play24:09

go and take care of it for us how much

play24:11

easier would life be would it be easier

play24:13

to do research and run a business and

play24:16

make sure all your bills are paid have

play24:18

something that keeps track of your

play24:20

calendar that used to be only available

play24:22

to the wealthiest of people hiring

play24:24

people hiring assistants was not

play24:26

available to everybody and now

play24:28

potentially in the future it would

play24:30

become more and more accessible here's

play24:32

Jensen hang so he is the CEO the founder

play24:36

of Nvidia and here he's talking about

play24:38

whether or not you will need to learn

play24:40

how to program now we'll do a video on

play24:43

this in a little bit because I kind of

play24:44

want to dive deeper into this but notice

play24:47

what he says will be important when we

play24:49

don't need to program what will be the

play24:51

skill that is needed if the text stuff

play24:54

becomes infinitely easier it's it's

play24:56

going to sound completely opposite of

play24:58

what people feel over the course of the

play24:59

last 10 years 15 years um almost

play25:01

everybody who sits on the stage like

play25:03

this would tell you it is vital that

play25:06

your children learn computer science um

play25:08

everybody should learn how to program

play25:10

and in fact it's almost exactly the

play25:11

opposite it is our job to create

play25:14

Computing technology such that nobody

play25:17

has to program and that the programming

play25:19

language is human everybody in the world

play25:22

is now a programmer this is the miracle

play25:25

of artificial intelligence the countries

play25:27

the people that understand and how to

play25:29

solve a domain problem in digital

play25:32

biology or in education of young people

play25:35

or in manufacturing or in farming those

play25:37

people who understand domain expertise

play25:40

now can utilize technology that is

play25:43

readily available to you you now have a

play25:45

computer that will do what you tell it

play25:46

to do it is vital that we upskill

play25:48

everyone and the upskilling process I I

play25:51

believe will be delightful surprising so

play25:54

it's that idea of domain expertise you

play25:56

need to be good at whatever it is that

play25:58

you chose to be good at ni AI provides

play26:01

all the other things that you might need

play26:03

it provides the intelligence to carry

play26:04

out those tasks provides the tech

play26:06

abilities to build some of the tech that

play26:08

you need so as he says coding is just

play26:11

done in natural language but I think the

play26:13

really big Point here is that we all

play26:15

need to upskill and potentially not in

play26:17

the same skills that we're used to I

play26:20

will on this channel do my best to try

play26:22

to provide whatever help I can again

play26:24

fill out that survey down below so I

play26:25

know what you need and thank you for for

play26:28

being part of this community I seriously

play26:30

mean that my goal is to hopefully get to

play26:32

know a lot of you better we have some

play26:34

big things planned I'll leave it on that

play26:36

my name is Wes rth and thank you for

play26:38

watching

Rate This

5.0 / 5 (0 votes)

相关标签
AI编程Devon代理技术进步软件工程师智能助手行业变革未来趋势编程教育智能交互企业效率