Was GPT-5 Underwhelming? OpenAI Co-founder Leaves, Figure02 Arrives, Character.AI Gutted, GPT-5 2025

AI Explained
6 Aug 202412:05

Summary

TLDRThe transcript discusses the current state of AI development, highlighting delays in GPT-5's release and leadership changes at OpenAI, including Greg Brockman's leave and John Schulman's departure to Anthropic. The narrative mentions other AI advancements, like Meta's Llama models and Google's Gemini 1.5 Pro, suggesting OpenAI may be falling behind. It also covers upcoming events like OpenAI's Dev Day and the ongoing lawsuit from Elon Musk against Sam Altman. Additionally, the transcript touches on humanoid robots using OpenAI's technology and the broader AI landscape's shifting dynamics.

Takeaways

  • ๐Ÿš€ The GPT 5 release date is uncertain, with Open AI leadership experiencing changes and departures.
  • ๐Ÿง˜ Greg Brockman, co-founder of Open AI, is taking a leave of absence, highlighting the ongoing challenges in achieving safe AGI.
  • ๐Ÿข There is a trend of departures among Open AI co-founders, including John Scharff who left for Anthropic, signaling potential alignment issues.
  • ๐Ÿ“‰ Open AI may be lagging in raw intelligence compared to models like Llama with 45 billion parameters, suggesting a need for significant improvement in their next model.
  • ๐Ÿ”ฎ The anticipation for GPT 5 is high, but its release might be delayed beyond the expected timeframe, possibly due to strategic planning around developer events.
  • ๐Ÿค– The development of agile, autonomous humanoid robots like Fig2 indicates a potential shift towards practical applications of AI beyond just model development.
  • ๐Ÿ“‰ Elon Musk's lawsuit against Sam Altman and Open AI raises questions about the original intentions and current direction of Open AI.
  • ๐ŸŒ The debate over who will control the future of AI is intensifying, with concerns about the concentration of power in a few large companies.
  • ๐Ÿ“Š The LM CIS chatbot Arena leaderboard shows a competitive landscape with models like Gemini 1.5 Pro and Claude 3.5 Sonic outperforming others.
  • ๐Ÿ”‘ There is a growing trend of smaller AGI labs being overshadowed by larger entities with more resources, such as Google acquiring talent from Character AI.
  • ๐Ÿ”„ The importance of a data flywheel effect is emphasized, where more data leads to better AI, which in turn can generate more data.

Q & A

  • What is the significance of the departure of John Scharff from OpenAI to Anthropic?

    -John Scharff's departure to Anthropic is significant because it indicates a trend of departures among co-founders at OpenAI, and it suggests that he may not have been working with people who were deeply engaged with the alignment work he was focused on, which is an attempt to align machine values with human values.

  • What does the term 'alignment' refer to in the context of AI?

    -In the context of AI, 'alignment' refers to the effort to align machine values with human values, ensuring that AI systems behave in ways that are beneficial and safe for humans.

  • Why is the release of GPT-5 potentially delayed beyond expectations?

    -The release of GPT-5 could be delayed because OpenAI has announced events focusing on advancements in APIs and dev tools, which implies that they are not planning to release a new model immediately after these events, suggesting a release date more likely at the end of the year or later.

  • What is the advanced voice mode mentioned in the script, and how is it related to OpenAI's models?

    -The advanced voice mode is a feature that makes the voice output of AI models incredibly lifelike. It is related to OpenAI's models as it could potentially be a part of their future offerings, enhancing user interaction with their AI systems.

  • What is the controversy surrounding Elon Musk's lawsuit against Sam Altman and OpenAI?

    -Elon Musk is suing Sam Altman and OpenAI, accusing them of lying and deceit, claiming that Altman was motivated by greed and that Musk wanted OpenAI to be more open to compete with Google. The lawsuit describes the original OpenAI as a 'spurious venture' and uses strong language to emphasize the accusations.

  • How does the script suggest that smaller AI labs like Character AI are struggling to compete with larger labs?

    -The script suggests that smaller AI labs like Character AI are struggling because they are being overshadowed by the scale of data and compute resources that larger labs like Meta and Google have, which is making it difficult for them to develop models that are as intelligent as those of the larger labs.

  • What is the significance of the LM Cis Chatbot Arena leaderboard for AI models?

    -The LM Cis Chatbot Arena leaderboard is significant as it provides a ranking of AI models based on their performance in various tasks, offering a comparative measure of their capabilities and helping to identify trends and strengths among different models.

  • Why might the new Gemini 1.5 Pro version be considered a more justified leader on the LM Cis Chatbot Arena leaderboard compared to GPT-40 Mini?

    -The new Gemini 1.5 Pro version might be considered a more justified leader because it performs slightly worse than 3.5 Sonic but far better than other models, suggesting a more balanced and comprehensive performance across various tasks.

  • What is the potential impact of the advanced voice mode on the adoption of AI models by the public?

    -The advanced voice mode could significantly increase the adoption of AI models by the public because it makes interactions with AI systems more natural and lifelike, potentially leading to hundreds of millions of people using it once it becomes available.

  • What does the script suggest about the future of AI control and competition among different companies?

    -The script suggests that the future of AI control is becoming increasingly concentrated among a few large companies, with OpenAI, Meta, and Google leading the way. It raises questions about who will control the future of AI and the implications of these companies' dominance in the field.

Outlines

00:00

๐Ÿ” Open AI Leadership and Departures

The script discusses the shifting landscape of AI leadership, with Open AI facing internal changes as Greg Brockman takes a leave of absence and several key figures depart, including John Schan, the former head of alignment. The departures suggest a lack of alignment with the company's direction, especially concerning the critical issue of AI alignment with human values. There is speculation about the capabilities of Open AI's next frontier model, which may have already been trained, and the implications of these departures on the company's progress towards AGI. The script also touches on the lawsuit by Elon Musk against Sam Altman and Open AI, accusing them of deceit and greed, and the broader debate on who will control the future of AI.

05:03

๐Ÿ“Š AI Competition and Model Advancements

This paragraph delves into the competitive AI landscape, highlighting the strides made by companies like Meta with their Llama model and the challenges faced by Open AI to keep pace. It discusses the potential release of advanced models like Claude 3.5 and the significance of open-source competition encouraged by the White House. The script also points out the struggles of smaller AI labs like Character AI, which have been overshadowed by tech giants like Google. The discussion includes the performance of various AI models on benchmarks and leaderboards, emphasizing the difficulty in measuring AI intelligence and the nuances of model capabilities.

10:03

๐Ÿค– Integration of AI with Robotics and Future Prospects

The final paragraph focuses on the integration of AI with robotics, particularly the adoption of Open AI's models by Figure O2 humanoid robots. It describes the capabilities of these robots, including advanced voice interaction and physical strength equivalent to humans. The script speculates on the potential for a data flywheel effect as these robots perform tasks autonomously and self-correct. It also mentions the use of Weights & Biases for tracking machine learning experiments and the importance of iterating on LLM applications. The paragraph concludes by reflecting on the shifting dynamics in the AI field and expressing gratitude to the audience for their engagement.

Mindmap

Keywords

๐Ÿ’กAGI (Artificial General Intelligence)

AGI refers to a theoretical form of artificial intelligence that possesses the ability to understand, learn, and apply knowledge across a wide range of tasks at a level equal to or beyond that of a human. In the video's context, AGI is the ultimate goal that various AI labs are striving towards, with discussions about whether companies like OpenAI are making progress towards achieving it.

๐Ÿ’กOpenAI

OpenAI is a research laboratory that focuses on creating and developing friendly artificial general intelligence (AGI). In the script, OpenAI is highlighted as a key player in the AI field, with discussions around its leadership changes, model developments, and the departure of co-founders.

๐Ÿ’กAlignment

Alignment in the context of AI refers to the effort to ensure that the goals and values of an AI system are consistent with human values and interests. The script mentions several departures from OpenAI, particularly those from the alignment team, suggesting challenges in this area.

๐Ÿ’กGPT (Generative Pre-trained Transformer)

GPT is a type of AI language model developed by OpenAI that is capable of generating human-like text based on the input it receives. The script discusses the evolution of GPT models, with a focus on the anticipated release and capabilities of GPT 5.

๐Ÿ’กCo-founder Departures

The script notes a trend of co-founders leaving OpenAI, which may signal internal disagreements or a shift in direction. These departures are significant as they could impact the company's focus and progress towards AGI.

๐Ÿ’กAdvanced Voice Mode

Advanced Voice Mode is a feature mentioned in the script that is expected to make AI-generated speech more lifelike and interactive. It is an example of how AI is becoming more integrated into user experiences, potentially leading to widespread adoption.

๐Ÿ’กLM CIS Chatbot Arena

LM CIS Chatbot Arena is a leaderboard that ranks AI chatbots based on their performance. The script uses this leaderboard to discuss the relative capabilities of different AI models, such as Gemini 1.5 Pro and Claude 3.5 Sonic.

๐Ÿ’กWeights and Biases

Weights and Biases is a tool used for tracking machine learning experiments. In the script, it is mentioned as a sponsor and a resource for AI labs, including OpenAI, to improve their models and applications.

๐Ÿ’กData Flywheel

A data flywheel refers to a self-reinforcing cycle where the accumulation of data leads to improvements in AI models, which in turn generate more data. The script suggests that this concept is relevant to the development of AI, particularly in the context of robots like Fig2.

๐Ÿ’กFig2

Fig2 refers to a humanoid robot mentioned in the script that utilizes AI for tasks and interactions. The robot's capabilities, such as speech and physical tasks, are discussed as indicators of progress towards more advanced AGI.

๐Ÿ’กElon Musk

Elon Musk is mentioned in the script in relation to a lawsuit against Sam Altman and OpenAI, alleging deceit and a lack of openness. This highlights the contentious nature of the AI industry and the differing visions of its future.

Highlights

GPT 5 release date is uncertain, with Open AI leadership facing challenges.

Greg Brockman, co-founder of Open AI, is taking a leave of absence until the end of the year.

John Scharf, another co-founder, has left Open AI for Anthropic, indicating a lack of alignment with the company's direction.

There is a trend of departures among co-founders at Open AI, raising concerns about the company's future.

Open AI has begun training its next Frontier Model, but details on its capabilities remain unknown.

Open AI's Dev Day in October suggests that GPT 5 will not be released before the end of November 2024.

Elon Musk is suing Sam Altman and Open AI, alleging deceit and a lack of openness in the company's operations.

Meta and Google are leading in AI development, with smaller labs struggling to compete.

Character AI, a smaller AGI lab, has been overshadowed by Google's acquisition of key talent and IP.

The debate on AI intelligence suggests that scale in data and compute is more important than obscure tricks or arcane knowledge.

LM CIS chatbot Arena leaderboard shows new models like Gemini 1.5 Pro taking the lead in performance.

Open AI's advanced voice mode is incredibly lifelike, suggesting potential for widespread adoption.

Figure O2 humanoid robots are using Open AI models, indicating a practical application of AI technology.

AI Labs are increasingly relying on tools like Weights & Biases to track machine learning experiments.

The shift in AI development suggests that control over the future of AI is becoming more contested.

The release of raw data by LMIS provides insights into model performance and their refusal to answer certain questions.

The new Gemini 1.5 Pro is being tested for its reasoning capabilities, with results expected to be published online.

AI development is moving towards more practical applications, such as autonomous humanoid robots, rather than just theoretical advancements.

Transcripts

play00:00

as the GPT 5 release date sails further

play00:03

into the Horizon open AI leadership

play00:06

splinters meanwhile other AI Labs ship

play00:10

incrementally smarter models and smaller

play00:13

AGI efforts like character AI are

play00:16

swallowed by the Google whale leaving in

play00:19

my eyes just four companies remaining in

play00:22

contention for having the most capable

play00:24

models out there but if we get agile

play00:26

autonomous humanoid robots like fig2

play00:29

that could soon count to 50 in one

play00:31

breath with chat gbt advanced voice

play00:34

maybe many will take that as a win for

play00:36

2024 but let's start with the leadership

play00:39

of open Ai and Greg Brockman is going on

play00:42

a leave of absence through to the end of

play00:45

the year calling it his first time to

play00:47

relax since co-founding the company he

play00:50

goes on the mission is far from complete

play00:52

though we still have a safe AGI to build

play00:56

now of course any one person taking time

play00:58

away might well be for person reasons

play01:00

but we have the full-on departure to

play01:03

anthropic of another one of the

play01:05

co-founders John schan the reason that

play01:07

he as the former head of alignment at

play01:09

openai jump ship to anthropic I think is

play01:12

given away in this sentence he said he

play01:14

wants to do research alongside people

play01:17

deeply engaged with the topics he's most

play01:19

interested in now it is pretty hard to

play01:21

read that as saying anything other than

play01:24

he wasn't working with people who were

play01:27

deeply engaged with the kind of

play01:28

alignment work that he was working on

play01:30

Just for those who don't know alignment

play01:32

is this attempt to align machine values

play01:34

with human values but it follows the

play01:37

departure of the previous head of

play01:39

alignment Yan Leica and before that the

play01:41

previous co-head of alignment and

play01:43

co-founder Ilia satova I'm definitely

play01:46

starting to notice a trend of Departures

play01:48

among co-founders at openai so I've

play01:51

listed chat PT to count the former heads

play01:54

of alignment at open AI 1 2 3 4 5 6 7 8

play01:57

9 10 11 12 13 14 15 16 17 18 192 21 22

play02:00

23 24 25 26 27 28 29 30 31 32 33 34

play02:05

obviously I was somewhat joking there I

play02:06

think there's been less than 10 former

play02:09

heads of alignment but that advanced

play02:11

voice mode does seem cool I mean even if

play02:14

open AI models aren't actually getting

play02:16

smarter and arguably with GPC 40 are

play02:18

getting slightly Dumber that advanced

play02:20

voice mode is incredibly lifelike and I

play02:23

could see hundreds of millions of people

play02:25

using it at least when it comes out

play02:27

which seems to be on the Never Never and

play02:29

here's some even more important context

play02:31

open aai back in May said that they had

play02:35

recently begun training its next

play02:37

Frontier Model now here we are on August

play02:40

the 6th it would almost certainly have

play02:42

finished training by now so those key

play02:44

players at openai would have a rough

play02:47

sense for its capabilities all of these

play02:49

departures after they've trained their

play02:52

latest Frontier Model seems strange and

play02:55

then we got this about open ai's

play02:57

so-called Dev day which is starting in

play02:59

October October and running through to

play03:01

November I am sure it will be

play03:03

fascinating but they released this

play03:05

particular nugget while we know

play03:06

developers are waiting for our next big

play03:09

model which we shared has begun training

play03:11

earlier this year in May these events

play03:14

will focus on advancements in the API

play03:17

and are Dev tools taking that at face

play03:19

value would imply that quote GPT 5 will

play03:23

not come before November 21st more

play03:26

likely it means that gbt 5 wouldn't even

play03:28

come before the end of the year because

play03:29

why would you release a model just after

play03:31

you've invited a load of devs to play

play03:33

about with your tools now yes samman has

play03:36

recently claimed in the Washington Post

play03:38

that more advances will soon follow and

play03:41

will usher in a decisive period in the

play03:43

story of human society but in recent

play03:46

months it would be hard to say that open

play03:49

AI have produced much in the way of

play03:51

decisive progress and of course all of

play03:54

that comes as Elon Musk is suing yet

play03:57

again Sam mman and open AI for what he

play04:00

says is lying and pery the lawsuit calls

play04:04

the original open AI a spurious Venture

play04:07

the language used throughout this

play04:08

86-page document is hardly subtle musk

play04:11

claims that samman is doing a long con

play04:15

his pery and deceit are of Shakespearean

play04:18

proportions now it does go on and on but

play04:20

the basic accusation is that Sam ultman

play04:22

was motivated by greed and Elon Musk

play04:26

just wanted to have something more open

play04:28

to compete versus Google so I think musk

play04:31

and Others May raise an eyebrow when

play04:33

later in the article samman said making

play04:36

sure open-source models are readily

play04:38

available to developers in other nations

play04:40

will further bolster our advantage

play04:42

talking of the US and the question that

play04:45

the article fundamentally raises is who

play04:47

will control the future of AI well as of

play04:51

today it looks less and less likely to

play04:54

be Sam Morman open AI are certainly good

play04:57

at productizing Ai and the advanced

play04:59

voice mode as we saw is great search GPT

play05:03

could make them some money and S is

play05:04

coming out at some point presumably this

play05:07

year and at least at the moment the

play05:09

figuro 2 robot is using an open AI video

play05:12

language model we'll get back to that in

play05:14

just a moment but in terms of raw

play05:17

intelligence open AI feel like they're

play05:19

falling behind the Llama 3 45 billion

play05:23

parameter model is already smarter than

play05:26

GPT 40 and Zuckerberg has recently

play05:29

committed to 10 times more computing

play05:32

power to train llama 4 or to put it

play05:34

another way the next open AI model would

play05:36

have to be significantly better than GPT

play05:38

40 just to catch up to the current

play05:41

state-ofthe-art let alone the

play05:42

state-ofthe-art when llama 4 comes out

play05:44

and of course in the meantime even just

play05:46

this year we might be getting Claude 3.5

play05:48

Opus from anthropic or even Claude 4

play05:51

simply put a year is a long time in Ai

play05:54

and the debate has moved on even the

play05:56

White House are now encouraging

play05:57

open-source competition to the light of

play06:00

open AI Sam Alman meanwhile is still

play06:03

warning about people stealing key

play06:05

intellectual property such as model

play06:07

weights now do forgive me for pointing

play06:10

out that one of the modules on my

play06:11

corsera course is about that difference

play06:14

between open source and open weights

play06:16

super grateful of course for those 10

play06:18

reviewers who have kindly left reviews

play06:21

for this course but don't get me wrong

play06:23

it's not like meta is having it all its

play06:25

own way do you remember those Tom Brady

play06:27

Paris Hilton chatbots that all the

play06:29

youngsters were apparently going to be

play06:31

using I think each celebrity was paid

play06:33

something like $5 million for a few

play06:36

hours of recordings well apparently

play06:38

they're now being scrapped and none of

play06:40

those AI chat Bots amassed a

play06:42

particularly big following but nor are

play06:44

things going particularly well for these

play06:46

smaller AGI Labs like character AI their

play06:50

product was or is an array of chat Bots

play06:53

but they were also aiming at AGI and

play06:56

training Their Own Foundation models

play06:58

obviously the leaders of character AI

play07:00

must have been somewhat disappointed by

play07:02

those New Foundation models because

play07:04

essentially they've been bought out or

play07:05

hollowed out by Google not actually

play07:08

buying a rival company but taking its

play07:10

key talent and IP but at this point you

play07:13

might be starting to notice somewhat of

play07:15

a trend if the incrementally greater

play07:18

intelligence of new models were down to

play07:20

obscure tricks or Arcane knowledge then

play07:23

you'd expect smaller Labs like character

play07:25

AI to be doing as well as the biggest

play07:28

Labs but if it was all about sheer scale

play07:31

of data and compute you'd expect the

play07:33

leaders to be increasingly well meta and

play07:36

Google and that is more or less what

play07:38

we're seeing though of course measuring

play07:39

that intelligence is quite hard we do

play07:41

have the LM CIS chatbot Arena

play07:43

leaderboard in which the new version of

play07:46

Gemini 1.5 Pro takes the lead at almost

play07:49

1300 ELO but if you'll notice we have

play07:51

GPT 40 mini coming third ahead indeed of

play07:55

Claude 3.5 Sonic which in my own

play07:58

Benchmark is far and a way ahead if we

play08:01

just relied on these ELO rankings you'd

play08:03

think well if open AI can come up with a

play08:06

tiny model doing almost as good as the

play08:09

rest they must be doing amazingly but

play08:11

lmis recently did something great which

play08:13

was release a batch of raw data showing

play08:16

comparisons between the models and which

play08:18

one won and I looked through the dozens

play08:21

of examples and one Trend emerged claw

play08:24

3.5 Sonic essentially refused more

play08:27

requests than GPT 40 Mini even when both

play08:30

models couldn't perform a task like

play08:32

creating an image natively GPT 40 mini

play08:35

gave it I guess more of a go it at least

play08:37

describe the image that it would create

play08:39

or in this example when the models were

play08:41

asked a political question GPC 40 mini

play08:44

gave a response whereas Claude 3.5 Sonic

play08:48

just apologized and said it wouldn't

play08:49

provide analysis now I have noticed that

play08:51

myself that Claude 3.5 Sonic is more

play08:54

sensitive than any other model but

play08:56

that's not a sign of lacking

play08:58

intelligence so to the the extent that

play09:00

we're going to call language modeling

play09:01

intelligence Claude 3.5 Sonic is far

play09:04

more capable than GPC 40 mini but this

play09:07

Rance to answer certain questions could

play09:09

explain the leaderboard rankings now yes

play09:12

to anyone following the channel I have

play09:13

been testing the new version of Gemini

play09:15

1.5 Pro on my simple bench the final

play09:18

scores will be presented on a website

play09:21

that I'm hoping to release before the

play09:22

next video but in the early testing it

play09:25

performs slightly worse than 3.5 Sonic

play09:28

but far better than other models so in

play09:30

that sense this lead B position could be

play09:33

far more Justified at least than GPT 40

play09:35

mini for those who haven't heard of my

play09:37

new reasoning Benchmark humans score

play09:39

over 90% quite easily whereas models

play09:42

like the new Gemini 1.5 pro version

play09:44

score around 25% it would however be

play09:47

somewhat hypy to say that this is

play09:49

another step toward AGI as one of the

play09:52

co-founders of Google deepmind recently

play09:55

said but whether you think that new

play09:56

Gemini 1.5 Pro is the best or Claude 3.5

play10:00

Sonic certainly more and more people are

play10:03

now shifting their API spend away from

play10:05

openai openai letting other labs have

play10:08

the lead for a couple of weeks could be

play10:10

a timing issue but a couple of months

play10:12

just makes it seem like they don't have

play10:14

a reply but as I mentioned at the start

play10:16

at least open a eyes models are the ones

play10:19

being chosen by figure O2 these humanoid

play10:22

robots have an onboard mic and speaker

play10:24

so hopefully you could chat to them like

play10:26

you would the new openai advanced voice

play10:28

mode in other words seamlessly with very

play10:31

low latency Adcock the founder of figure

play10:34

said that the default user interface to

play10:36

our robot will be speech apparently the

play10:39

robot can work for around 20 hours

play10:42

straight which is even more than me

play10:44

reading the latest AI papers its hands

play10:46

have 16 degrees of freedom and

play10:48

apparently human equivalent strength so

play10:51

honestly even though it can speak back

play10:53

to you you might not want to speak back

play10:55

to it now apparently these fig2 robots

play10:58

can perform certain tasks autonomously

play11:01

and self-correct and that data flywheel

play11:04

will be in effect and one might well say

play11:06

that the argument that ubiquitous robot

play11:09

assistance will arrive before artificial

play11:12

general intelligence looks more

play11:14

plausible than ever and speaking of a

play11:16

data flywheel you may already know that

play11:19

AI Labs including open AI have used

play11:21

weights and biases this video sponsor to

play11:24

track Frontier machine learning

play11:26

experiments but what you might not know

play11:28

is that weit and bu now have a weave a

play11:30

lightweight tour kit to confidently

play11:32

iterate on llm applications they also

play11:35

produce free prompt and llm agent

play11:37

courses on their website and if you

play11:39

didn't know that you can let them know

play11:41

that you came from me by using my

play11:43

customized link and the link is in the

play11:46

description so in short the vibe is

play11:48

Shifting but what isn't changing is my

play11:51

gratitude for you watching all the way

play11:53

to the end if you're Keen to carry on

play11:55

the conversation with me personally I'd

play11:57

love to see you over on AI ins iders on

play12:00

patreon but to everyone watching have a

play12:03

wonderful day

Rate This
โ˜…
โ˜…
โ˜…
โ˜…
โ˜…

5.0 / 5 (0 votes)

Related Tags
AI FutureLeadershipAGI RaceOpenAIGreg BrockmanAnthropicAlignmentElon MuskSam AltmanLM BenchmarksRobotics