Should you still learn to code? (ft. Devin)

Luke Barousse
21 Mar 202409:31

Summary

TLDRThe video delves into the future of jobs amidst advancements in AI, spotlighting 'Devon,' a cutting-edge autonomous bot designed to tackle programming tasks and data analysis with ease. Despite its impressive capabilities, the video explores whether learning to code remains essential. It contrasts Devon's performance with other AI models, highlighting its proficiency yet emphasizing the continuous need for human guidance and specification in complex tasks. The video also touches on Coursera's educational offerings, other AI advancements, and concludes with an exploration of 'Claude,' another AI model, showcasing its potential in economic analysis. Ultimately, it suggests a future where coding skills, coupled with AI tools, could unlock new problem-solving paradigms.

Takeaways

  • 💻 The CEO of a major company suggests that the future of jobs, particularly in programming and computer science, may not align with the common belief that everyone should learn to code.
  • 🤖 Devon, introduced as a fully autonomous AI bot with capabilities in coding and browsing, represents a significant step towards automation in software engineering and data analysis.
  • 📊 The AI, Devon, impresses with its ability to troubleshoot code, find and fix bugs iteratively, and even perform data analytics, showcasing a broad range of potential impacts on professional tasks.
  • 📖 Education platforms like Coursera remain relevant for learning essential skills in data analytics, with special deals and comprehensive courses recommended for beginners.
  • 🔎 Comparative tests reveal Devon's proficiency in resolving real-world GitHub issues, significantly outperforming other models, though still far from perfect, indicating a trajectory towards improvement.
  • 🛠 Despite Devon's advanced capabilities, complex tasks require detailed human prompts, suggesting that human input remains crucial in guiding AI to solve specific problems effectively.
  • 🧐 The introduction of tools by GitHub to automatically fix code without developer intervention hints at diminishing the need for human involvement in certain coding tasks.
  • 📈 Anthropics' Claude, positioned as a competitor to OpenAI's GPT-4, demonstrates advanced economic analysis capabilities, including GDP prediction and transcribing accuracy, suggesting diverse applications for AI beyond software engineering.
  • 📱 The script highlights the importance of integrating coding skills, particularly Python, with AI technologies to tackle more complex analytical problems, indicating a future where coding remains a valuable skill.
  • ✨ The ongoing development and hype around AI tools like Devon and Claude reflect the dynamic nature of AI advancements, driven by funding and market interest, while stressing the critical role of human-AI collaboration.

Q & A

  • What is the CEO of the third most valuable company's claim about the future of jobs and learning computer science?

    -The CEO claimed that contrary to the popular belief that learning computer science is vital for future jobs, the reality might be almost exactly the opposite.

  • What is Devon, and what capabilities does it have?

    -Devon is described as a fully autonomous bot equipped with a coding environment and browser, capable of taking over tasks with just a simple prompt, conducting internet research, coding, and providing analyses that mimic human job functions.

  • How did Devon approach fixing a bug in a code according to the demonstration?

    -Devon used an iterative approach to debug, adding print statements to analyze the inputs and outputs of the failing test, identifying the incorrect case, and then updating the code to fix the identified bug.

  • Can Devon perform data analytics tasks?

    -Yes, besides troubleshooting code bases, Devon demonstrated the ability to perform data analysis, showcasing its versatility beyond just software engineering tasks.

  • What did the demonstration reveal about Devon's ability to train AI models?

    -The demonstration highlighted Devon's capability to download code and fine-tune a model, although it did not conclude the effectiveness of the training process.

  • What unique task did Devon accomplish related to making money?

    -Devon was tasked with identifying potholes on a road using computer vision, updating necessary packages, fixing a bug, running the model, and providing a detailed report along with screenshot examples of its work.

  • How does Devon's problem-solving process work according to the examples?

    -Devon's process involves receiving a problem from a human, finding solutions through iterative approaches, pulling necessary GitHub repositories, and reporting back, highlighting the ongoing need for human interaction in defining problems.

  • What new tool did GitHub introduce, and how does it challenge the need for human intervention?

    -GitHub introduced a new tool that automatically fixes code, potentially remediating more than two-thirds of vulnerabilities found, often without developers needing to edit any code, suggesting less need for human intervention.

  • How does Devon compare to other common models in resolving real-world GitHub issues?

    -Devon achieved a 14% success rate in resolving real-world GitHub issues, which is significantly higher than the best market model at around 2%, indicating a positive trajectory for AI in coding tasks.

  • What is Claude, and how does it demonstrate its capabilities in economic analysis?

    -Claude is a model introduced by the team at Anthropic, demonstrating capabilities in economic analysis by transcribing GDP graph data, evaluating accuracy, predicting GDP using Monte Carlo simulations, and performing international economic analysis, showcasing advanced use of coding and large language models in analytics.

Outlines

00:00

🤖 The Future of Jobs and AI's Role

This paragraph discusses the changing landscape of jobs due to advancements in AI technology. It highlights the capabilities of Devon, an autonomous bot that can perform tasks like coding and data analysis, potentially impacting job security. The speaker explores the implications of AI in software engineering and data science, emphasizing the importance of human involvement in guiding AI tools. The paragraph also touches on the limitations of AI and the need for continuous learning to stay relevant in the evolving job market.

05:01

📈 Comparative Analysis of AI Models

The second paragraph focuses on the performance of AI models, particularly Devon, in solving real-world problems. It compares Devon's success rate in resolving GitHub issues with the best models available in the market, highlighting the progress and potential of AI. The speaker also addresses the overhyped nature of AI technologies and the role of funding in their development. The paragraph concludes with a discussion about another AI model, Claude, which demonstrates the potential of combining AI with data analysis tools like Python for more complex problem-solving in the future.

Mindmap

The video is abnormal, and we are working hard to fix it.
Please replace the link and try again.

Keywords

💡AI software engineer

The term 'AI software engineer' refers to an artificial intelligence system designed to perform tasks typically associated with software engineering. In the context of the video, Devon is advertised as the first AI software engineer, capable of coding, debugging, and problem-solving in software development. This concept challenges the traditional notion of software engineering by suggesting that AI can autonomously handle complex tasks, raising questions about the future of jobs in the field.

💡Computer science education

Computer science education refers to the academic and professional study of computer technology, software, and programming. The video script suggests that it is widely believed that children should learn computer science to prepare for the future job market. However, the speaker challenges this notion by presenting the emergence of AI technologies like Devon, which could potentially reduce the demand for traditional programming skills.

💡Devon

Devon is a fully autonomous AI bot that has been showcased with capabilities such as coding, debugging, and data analysis. It is presented as a tool that can potentially revolutionize software engineering by taking over tasks that would normally require human expertise. The video discusses Devon's ability to understand prompts, browse the internet for validation, and perform coding tasks, which raises concerns about the future of human jobs in software engineering.

💡GitHub repository

A GitHub repository is a storage location for a project's code, where developers can collaborate, track changes, and manage the project's versions. In the context of the video, Devon is shown to pull from a GitHub repository to find solutions to coding problems, highlighting the importance of open-source collaboration in AI development and problem-solving.

💡Data analytics

Data analytics involves the process of examining data sets to draw conclusions about the information they contain. In the video, the AI's application in data analytics is discussed, with examples such as Devon performing data analysis and Claude working as an economic analyst. This highlights the expanding role of AI in not just software engineering, but also in analyzing and interpreting complex data sets.

💡迭代方法 (Iterative approach)

迭代方法是一种解决问题或开发产品的方法,它涉及重复的周期,每个周期都包括对工作的评估和改进。在视频中,Devon使用迭代方法来调试代码和修复bug,这表明AI可以采用逐步细化的方式来完善其解决方案,类似于人类软件工程师的工作方式。

💡自动化

自动化是指使用技术来控制过程和系统,以减少或消除人类干预的需求。在视频的上下文中,自动化特指Devon这样的AI工具能够自动执行编程和数据分析任务,这可能会改变未来工作的性质,尤其是在软件工程和数据科学领域。

💡编程

编程是创建计算机程序的过程,涉及编写代码来指示计算机执行特定任务。在视频中,编程是讨论的主要话题之一,特别是在探讨AI如Devon和Auto GPT等工具是否将取代人类程序员的背景下。视频提出了关于学习编程是否仍然有价值的问题,考虑到AI的发展可能会改变编程职业的未来。

💡经济分析

经济分析是研究经济数据和趋势以提供对经济状况的见解和预测的过程。在视频中,通过展示AI模型Claude作为经济分析师的工作,它能够查找GDP趋势并进行预测,这突出了AI在处理和解释复杂经济数据方面的潜力。

💡Python

Python是一种广泛使用的高级编程语言,以其清晰的语法和代码可读性而闻名。在视频中,Python被提及作为分析数据和与大型语言模型结合使用的工具,这表明Python在数据科学和AI领域中的重要性和应用潜力。

💡Coursera

Coursera是一个在线学习平台,提供来自世界各地大学和机构的课程。在视频中,Coursera被提及作为一个资源,推荐给有抱负的数据分析师,强调了其提供的Google数据分析证书课程的价值,这表明了在线教育在专业发展和技能提升中的作用。

Highlights

The introduction of Devon, a fully autonomous bot capable of coding and browsing the internet.

Devon's ability to break down tasks and solve problems by iterating and debugging code.

The demonstration of Devon's data analysis capabilities beyond just software troubleshooting.

Devon's use of an iterative approach to identify and fix bugs in code.

The showcasing of Devon's potential in data science through AI training.

Devon's capability to make money by solving real-world problems, such as identifying potholes using computer vision.

The importance of human involvement in guiding AI like Devon to solve the right problems.

GitHub's introduction of a tool to automatically fix code vulnerabilities without developer intervention.

The sponsor mention of Coursera and its Google Data Analytics certificate for aspiring data analysts.

Comparative testing of Devon against other models in resolving real-world GitHub issues.

The current trajectory of AI technology suggests a potential future where it could solve all issues, but not in the immediate future.

The necessity of detailed prompting for complex tasks, indicating the current limitations of AI in autonomy.

The discussion on the overhyping of AI technologies and the influence of funding on their promotion.

The introduction of Claude, an AI model working as an economic analyst.

Claude's use of machine learning to predict GDP trends and its accuracy in transcribing data from graphs.

The impressive parallel processing capabilities of Claude in analyzing global economies.

The potential future of analytics combining coding with large language models to solve complex problems.

The tutorial recommendation for learning SQL as a starting point in data analytics.

Transcripts

play00:00

that nerds our jobs in the future are

play00:02

going to be a lot different the CEO of

play00:04

the third most valuable company made

play00:06

this bold claim um almost everybody who

play00:09

sits on a stage like this would tell you

play00:11

it is vital that your children learn

play00:14

computer

play00:16

science um everybody should learn how to

play00:18

program and in fact it's almost exactly

play00:21

the opposite so is it even worth your

play00:23

time learning how to program well in

play00:25

this I'm going to be exploring a few new

play00:27

technologies that just came out in order

play00:30

to answer that question last week the

play00:32

world was introduced to Devon a fully

play00:34

autonomous bot equipped with tools like

play00:36

a coding environment and browser with

play00:38

this it can basically take over the

play00:39

world providing it with only a simple

play00:41

prompt it gets to work putting together

play00:43

a plan of action breaking up what it

play00:45

needs to do into simple tasks it starts

play00:47

by browsing the internet to make you

play00:49

feel more confident that its answers

play00:50

aren't going to be hallucinations then

play00:52

jumps into some light coding to make you

play00:54

even feel less secure about your

play00:56

long-term job security and then after it

play00:58

fixes a quick bug it provides this

play01:00

groundbreaking analysis that you can

play01:02

provide to your boss as your own work so

play01:04

let's break Devon down it's advertised

play01:06

as the first AI software engineer

play01:09

there's a lot of problems with that

play01:10

we'll get to that but let's actually

play01:12

look at some of the use cases they've

play01:13

used it for now all these demonstrations

play01:15

were done by employees of cognition so

play01:18

to be fair there haven't been a lot of

play01:19

outside tests of this tool anyway in

play01:21

this case the engineer wants to fix a

play01:23

bug in the code he provides some pretty

play01:25

detailed instructions and Devon gets to

play01:27

work now one of the impressive things

play01:29

from this demo was it used as an

play01:31

iterative approach so Deon here actually

play01:34

wrote uh actually added a print

play01:36

statement to debug the outputs uh and

play01:39

the uh inputs to the failing test reran

play01:42

the tests and actually found which case

play01:44

was wrong which is actually a second bug

play01:47

that Devon found and it then went and

play01:49

updated the code to fix this second bug

play01:53

and with that demo you may be like Luke

play01:54

I'm a data nerd not a software engineer

play01:57

this thing's just troubleshooting code

play01:59

bases and not actually performing data

play02:00

analytics so I have nothing to worry

play02:02

about with my job well if you recall

play02:03

from that first example that I showed

play02:05

Devon did do data analysis additionally

play02:08

they had a demo showcasing well as they

play02:11

stated today I'm going to show you an AI

play02:13

training in AI which is not only meta

play02:16

but also shows that this is not just

play02:18

geared towards software Engineers but

play02:20

also has the potential defect Us in data

play02:22

science now this followed a similar

play02:24

approach as we've seen before of

play02:25

downloading the code and in this case

play02:27

going through and fine-tuning a model

play02:29

which after about an hour it's only

play02:30

about 4% done with training and

play02:32

conveniently there's no conclusion on

play02:34

what happened with the training now one

play02:36

of the most impressive exercises that

play02:37

demon demonstrated was the ability to

play02:40

actually make money it was provided with

play02:41

a problem from upward which side note

play02:44

how are you going to calculate hourly

play02:45

rates whenever AIS work almost

play02:47

instantaneous anyway this thing was

play02:49

looking at making inferences with a

play02:51

computer vision model that's fancy talk

play02:53

in this case as all it really wanted to

play02:54

do was label potholes on a Road Devon

play02:57

got to work and the first thing I noted

play02:58

was some of the packages were out of day

play03:00

so it updated it which then it found a

play03:01

bug in the code which wasn't supposed to

play03:03

be there and once again it used that

play03:04

print statement approach in order to

play03:06

find it I'll be honest I don't know why

play03:08

it's not using a debugger so finally

play03:09

after this it gets into running the

play03:11

model and providing a detailed report on

play03:13

it and even provides some screenshot

play03:15

examples of it working in action along

play03:17

with this final write up in a text file

play03:20

that overviews the work and also the

play03:22

conclusions that it came to not going to

play03:24

lie if I received this on upwork I'd be

play03:26

pretty impressed so there's a Core theme

play03:27

that Devon is following that I found in

play03:29

all these examples some human is unhappy

play03:31

because it doesn't know how to solve a

play03:33

problem so it all flows that to Devon

play03:35

who gets to work Devon then in all these

play03:37

cases went and pulled this GitHub

play03:39

repository after it found a solution

play03:41

working in an iterative approach and

play03:43

reported back to the human I love you

play03:44

Devon which should no longer be unhappy

play03:46

and this demonstrates an important point

play03:48

you still need a human in the mix in

play03:51

order to guide Devon onto what problems

play03:53

it needs to be solving oh wait what's

play03:57

this GitHub introduces a new tool in

play04:00

order to automatically fix code this new

play04:02

feature promises that this new system

play04:04

can remediate more than 2third of the

play04:06

vulnerabilities that it finds often

play04:08

without the developers having to edit

play04:10

any code

play04:11

themselves okay scratch that on human

play04:14

intervention all right before we go

play04:15

further we need to pay some bills and

play04:17

give a shout out to the sponsor of this

play04:18

video corsera which is having a special

play04:21

deal right now now the number one course

play04:23

that I recommend for aspiring data

play04:25

analyst is the Google data analytics

play04:27

certificate this covers not only what

play04:28

it's like to be a data analyst but also

play04:30

goes into all the core Technologies you

play04:33

need to know including SQL programming

play04:35

languages Vis tools and spreadsheets

play04:38

it's where I recommend anyone new to

play04:39

that analytics start and I've made a

play04:41

number of videos interviewing those that

play04:43

have taken this to better understand the

play04:44

value of this certificate now right now

play04:46

corsera is offering a heck of a deal

play04:48

where you can receive $100 off your

play04:50

yearly subscription to corsera plus

play04:53

which works out to being less than a

play04:55

dollar a day with this it not only gives

play04:56

you access to the Google certificate but

play04:58

also 7,000 other learning programs

play05:01

including a ton of resources on my

play05:03

favorite programming language now I'm

play05:04

not just recommending corsera because it

play05:06

was a sponsor of this video I've

play05:08

actually personally paid for a corsera

play05:10

plus and used it for my learnings as

play05:12

shown by this receipt more recently I've

play05:14

been using this to improve my knowledge

play05:16

on applying AI in data analytics

play05:18

specifically I've been working through a

play05:20

lot of different courses and I just

play05:21

completed this project based course on

play05:23

using Python's Lang chain for analyzing

play05:26

your own data which we're going to go

play05:28

into more detail in a bit of what

play05:29

Technologies I'm going to be covering

play05:31

over the next year all right thanks

play05:33

again to corsera for sponsoring this

play05:35

video and let's get back to it so how

play05:37

does Devon actually perform in a

play05:39

comparative test to other common models

play05:42

and these results were testing whether

play05:44

it can resolve real world GitHub issues

play05:47

Devon got a whopping 14% which you're

play05:49

probably like Luke that's nowhere near

play05:51

100% that's like 3 and 20 how the heck

play05:54

is it going to actually do my job but if

play05:56

you look at the best model in the market

play05:57

today it's only at around 2 % so it's a

play06:01

little bit better personally I think

play06:02

this graph answers on whether you should

play06:04

learn coding or not it's not solving all

play06:06

the issues today or tomorrow but we're

play06:08

on a positive trajectory to maybe one

play06:10

day be at 100% but not anytime soon now

play06:13

the other thing that reassures me from

play06:14

this is that I don't know if you noticed

play06:16

this from those videos that I showed

play06:17

earlier but for complex tasks Devon

play06:20

takes a fair bit of prompting and by

play06:22

fair I mean a lot and this case I feel

play06:24

the engineer had to go into an enormous

play06:26

amount of detail in order to specify how

play06:28

it wanted it to solve its problem which

play06:31

with this level of specifity I think

play06:33

even the free version of chat gbt could

play06:34

solve it and that's where I think we are

play06:36

with this technology today yeah although

play06:38

they're claiming that Devon is first AI

play06:40

software engineer Auto GPT which has

play06:42

been around for almost a year now has

play06:44

been doing a lot of the same things but

play06:46

doesn't get as nearly as much virality

play06:48

as Devon did which coincidentally is

play06:50

happening almost in tandem when these

play06:51

type of companies are raising funding

play06:53

like cognition did last week I want to

play06:55

be clear I'm not trying to on Devon

play06:57

and say it's a bad tool in fact I think

play06:59

quite opposite I think they've done

play07:00

incredible advancements and we're moving

play07:02

in the right direction but these type of

play07:05

Technologies can be overhyped and is

play07:08

driven by funding now there's another

play07:10

announcement last week that I feel is

play07:11

more relevant to us data nerds and it

play07:13

deals with this model which is only

play07:15

second to open ai's GPT 4 the team at

play07:18

anthropic released this video on Claude

play07:20

working as an economic analyst they

play07:22

prompted it to look up GDP trends for

play07:24

the US and write a markdown table of the

play07:27

estimates which you got to work

play07:28

transcribing this screenshot of a graph

play07:31

of the GDP from there they went to

play07:33

evaluate how accurate those

play07:35

transcriptions were so it had the model

play07:37

plot those transcribed values in this

play07:39

interactive plot and then after having

play07:42

provided the model the actual results it

play07:44

plotted them side by side so how

play07:46

accurate is Claude at using the vision

play07:49

model for transcription we tried it with

play07:51

a large sample of madeup GDP graphs and

play07:53

its transcription accuracy was within

play07:55

11% on average which not bad but

play07:58

probably can be improve so then they

play08:00

moved into having Claude use machine

play08:02

learning in order to predict GDP in this

play08:05

case using a Monte Carlo simulation and

play08:08

just like most people thinks the US

play08:09

economy is going to be just fine for the

play08:11

next few years but really none of this

play08:13

was impressive until I saw this where it

play08:15

asked Claude to perform in an analysis

play08:17

of the world's economy looking at more

play08:19

than just one country in this case

play08:21

although they didn't disclose it it

play08:22

looks like they were using some sort of

play08:24

large language model framework in order

play08:27

to implement agents which all of these

play08:30

agents were working in parallel

play08:32

collecting all the data they needed for

play08:34

these top countries and processing it

play08:37

pretty dang impressive for the final

play08:39

results it provided these pie charts

play08:42

comparing the two values side note I was

play08:44

a little disappointed with this because

play08:45

pie charts are actually really bad at

play08:46

comparing values but nonetheless it not

play08:48

only provided an analysis it also

play08:51

provided a final summary detailing how

play08:54

the major countries planed a fair over

play08:55

the next few years now I thought this

play08:57

was more impressive because it

play08:58

demonstrated how you can actually use

play09:01

coding such as python to perform an

play09:03

analysis with a large language model and

play09:06

frankly this is where I see analytics

play09:07

going into the future personally I'm

play09:09

going to be exploring more on this

play09:10

channel how to use things like python in

play09:13

conjunction with libraries that build

play09:15

out agents for large language models to

play09:17

solve more complex problems all right as

play09:20

always you got value out this video

play09:21

smash that like button and if you'd like

play09:22

to learn more about how to start coding

play09:24

in data analytics I just made this

play09:26

tutorial right here on how to learn SQL

play09:28

all right with that see you in the next

play09:30

one

Rate This

5.0 / 5 (0 votes)

Related Tags
AI FutureCoding EducationData AnalyticsDevon BotJob SecurityAI Software EngineerGitHub IntegrationEconomic AnalysisMachine LearningProgramming