Project Orion (GPT-5 Strawberry) Imminent, Already Shown To FEDS!

Matthew Berman
27 Aug 202415:42

Summary

TLDRThe script discusses OpenAI's potential release of a new AI model, 'Strawberry,' aimed at improving complex reasoning and math capabilities. It highlights OpenAI's demonstration of the technology to federal officials and its use in developing 'Orion,' their next flagship model. The script also touches on the debate over open-source AI, the challenges of generating high-quality training data, and the competition in the AI industry. It speculates on how Strawberry might be integrated into products like Chat-GPT and the importance of reducing AI 'hallucinations' for enterprise adoption.

Takeaways

  • 🧠 OpenAI is reportedly on the verge of releasing a new AI model, possibly named 'Strawberry', which could be a significant technical breakthrough for complex task completion.
  • πŸ” 'Strawberry' is associated with slower thinking models that plan ahead and perform better in math and logic due to multi-step reasoning capabilities.
  • πŸ€” OpenAI has demonstrated this unreleased technology to American National Security officials, possibly setting a new standard for AI transparency with policymakers.
  • πŸ“ˆ There is speculation that 'Strawberry' could be used to generate high-quality training data for 'Orion', OpenAI's next flagship large language model.
  • πŸ’‘ The technology behind 'Strawberry' might be too slow for consumer use in a chat-based product, indicating a potential focus on less time-sensitive applications.
  • πŸ“Š OpenAI's business has seen substantial growth with LLM sales and Chat GPT subscriptions tripling, despite ongoing losses.
  • πŸš€ The launch of 'Orion' is crucial for OpenAI to stay ahead of competitors like Google's DeepMind and Anthropic, who are also advancing in AI capabilities.
  • πŸ›‘οΈ 'Strawberry' could help reduce 'hallucinations' or errors in AI models by improving the quality of training data, which is vital for enterprise adoption.
  • πŸ’‘ The potential applications of 'Strawberry' in solving complex math problems could be lucrative for fields like aerospace and structural engineering.
  • πŸ”‘ The origins of 'Strawberry' lie in research started years ago by Ilya Sutskever, OpenAI's former Chief Scientist, who has since left to start a competing AI lab.
  • πŸ“° The AI community is abuzz with speculation and excitement similar to tech industry hype around new product releases, indicating the fervor around anticipated AI advancements.

Q & A

  • What is the significance of the 'Strawberry' AI model mentioned in the transcript?

    -Strawberry is a mysterious technical breakthrough that could enhance Open AI models' ability to complete complex tasks such as math problems, which conversational AI traditionally struggles with. It represents a slower thinking model that can plan ahead, think through problems, and perform better at math and logic reasoning.

  • What is the connection between Strawberry and Orion in the context of Open AI's developments?

    -Strawberry is being used to generate high-quality training data for Orion, which is reportedly Open AI's next flagship large language model in development. This is important as much of the training data on the internet has already been used, and Strawberry could help overcome limitations on obtaining enough high-quality data.

  • Why did Open AI demonstrate Strawberry to American National Security officials?

    -Open AI demonstrated Strawberry to American National Security officials to set a new standard for AI developers, especially as advanced AI increasingly becomes a national security concern. This could be part of Open AI's push to be more transparent with policymakers who could cause the company problems if they feel threatened by its technology.

  • What is the potential impact of Open AI's decision to demonstrate unreleased technology to government officials?

    -Demonstrating unreleased technology to government officials could influence how AI is regulated and integrated into national security strategies. It might also set a precedent for other AI developers to engage with policymakers before releasing new technologies.

  • What is the role of 'Qar' in the context of Strawberry and Orion?

    -Qar is a model that, like Strawberry, focuses on slower thinking and multi-step reasoning. It is part of the same initiative to develop AI that can handle complex tasks better, and it could be incorporated into future Open AI products to improve their reasoning capabilities.

  • How does the transcript suggest that Strawberry and similar models could change the AI industry?

    -The transcript suggests that Strawberry and similar models could change the AI industry by setting new standards for complex reasoning and problem-solving in AI. It also implies that these models could lead to more transparent interactions between AI developers and policymakers.

  • What is the potential application of Strawberry in improving AI's reasoning capabilities for coding errors in GitHub?

    -Strawberry's potential application in GitHub could involve using its improved reasoning capabilities to fix non-critical coding errors. Given that Strawberry takes time to think through problems, it might be well-suited for tasks that do not require immediate responses.

  • How does the transcript discuss the challenges of AI-generated data, also known as synthetic data?

    -The transcript discusses the challenges of synthetic data by noting that it is derivative and may not create new knowledge. It raises concerns about whether training AI models with data generated by another model could lead to sustainable improvements in AI capabilities.

  • What is the significance of Open AI's potential move to simplify and shrink Strawberry through a process called distillation?

    -The significance of simplifying and shrinking Strawberry through distillation is to make the technology suitable for use in a chat-based product before Orion is released. This suggests that the original Strawberry model might be too slow for consumer settings where immediate responses are expected.

  • How does the transcript address the competition in the AI industry, especially with regards to reasoning capabilities?

    -The transcript addresses competition by mentioning other companies like Google's DeepMind and Anthropic, which are also developing AI models with improved reasoning capabilities. It highlights the competitive landscape and the need for Open AI to launch something incredible to stay ahead.

  • What is the potential business impact of Open AI's new models on fields that require complex problem-solving like math-heavy fields?

    -The potential business impact could be significant, as AI that can solve tough math problems could be lucrative in fields such as Aerospace and Structural Engineering, where existing AI isn't great at handling complex mathematical tasks.

Outlines

00:00

πŸš€ Open AI's New Model 'Strawberry' and Orion

The script discusses the imminent release of Open AI's new model, 'Strawberry,' and its potential demonstration to the federal government. It highlights the connection between 'Strawberry' and 'Orion,' Open AI's next frontier model. The article by Stephanie Palazolo suggests that 'Strawberry' could aid in complex task completion, such as math problems, which are traditionally challenging for conversational AI. The script also speculates on the implications of Open AI's transparency with policymakers and the national security concerns surrounding advanced AI. It touches on the debate over open-source AI and the difficulty of keeping AI advancements secret, especially from competitors like China.

05:00

πŸ€– Strawberry's Role in Enhancing AI Reasoning and Training Data

This paragraph delves into the specifics of 'Strawberry,' a model that enables long-term thinking and multi-step reasoning, which are significant improvements over traditional conversational AI. The script mentions the efforts to simplify 'Strawberry' through a process called distillation, possibly for integration into a chat-based product. It also discusses the potential applications of 'Strawberry,' such as improving the accuracy of AI responses in less time-sensitive scenarios, like coding error correction on GitHub or the use in AI agents. The paragraph further explores the financial aspects of Open AI, including its revenue from LLM sales and subscriptions, and the importance of launching a successful flagship model like 'Orion' to stay competitive in the market.

10:01

🧠 Synthetic Data and Reducing AI 'Hallucinations'

The script addresses the use of 'Strawberry' for generating synthetic data to train new models like 'Orion,' overcoming the limitations of obtaining high-quality real-world data. It discusses the potential of 'Strawberry' to reduce errors or 'hallucinations' in AI models, which is crucial for enterprise adoption and critical applications. The importance of having accurate training data to minimize ambiguity and improve the AI's problem-solving abilities is highlighted. The paragraph also touches on the competitive landscape with other companies like Google's DeepMind and Anthropic, and the challenges of creating new knowledge through AI models.

15:03

🌐 Speculations and Anticipations for Open AI's Future Models

The final paragraph wraps up with a discussion on the speculations surrounding Open AI's models, the role of 'Strawberry' in research, and its potential impact on future models. It mentions the origins of 'Strawberry' in research conducted by Ilya Sutskever and its development by Jacob and Simon Sedom. The script also references the departure of key researchers from Open AI and the competitive AI lab landscape. It concludes with a lighter note on the excitement and speculation in the AI community, akin to the buzz around tech product releases, and hints at the upcoming in-depth coverage of the new Open AI model.

Mindmap

Keywords

πŸ’‘Open AI

Open AI refers to a research laboratory that focuses on advancing digital intelligence. In the context of the video, Open AI is on the verge of releasing a new AI model, which is a significant development in the field of artificial intelligence. The script discusses how Open AI's innovations, such as the 'Strawberry' AI, are poised to impact the technology landscape.

πŸ’‘Strawberry AI

Strawberry AI is a mysterious technical breakthrough mentioned in the script, which is associated with Open AI's efforts to improve AI's capability to complete complex tasks, such as math problems. It is a key concept in the video, illustrating the potential for AI to engage in slower, more thoughtful processing akin to human reasoning.

πŸ’‘Orion

Orion is described as Open AI's next frontier model in AI development. It is suggested to be a significant leap from current models, indicating a new direction in AI technology. The script implies that Orion will utilize the advancements of Strawberry AI to enhance its capabilities.

πŸ’‘QAR

QAR, or 'Question Answering with Reasoning,' is a concept related to AI's ability to provide answers after engaging in a reasoning process. The script mentions QAR in relation to Strawberry AI, highlighting how these models can offer more accurate responses by thinking through problems rather than providing immediate, potentially incorrect, answers.

πŸ’‘AI-generated data

AI-generated data, as discussed in the script, refers to the creation of new training data by AI models themselves. This is a novel approach that Open AI might be using to train models like Orion, overcoming the limitations of relying solely on existing real-world data.

πŸ’‘Distillation

In the context of AI, distillation is a process used to simplify a complex model into a smaller, more efficient one without losing much of its functionality. The script mentions Open AI's efforts to distill Strawberry AI, making it suitable for consumer applications like chat-based products.

πŸ’‘Hallucinations

In AI, 'hallucinations' refer to the incorrect or fabricated information that a model might produce. The script discusses how Strawberry AI could help reduce these errors in models like Orion, improving the reliability of AI outputs.

πŸ’‘Large Language Models (LLMs)

LLMs are AI models designed to process and understand large amounts of human-generated text. The script talks about the competition in the LLM space, with Open AI's Orion aiming to outperform existing models like Google's DeepMind and Anthropic's models.

πŸ’‘Synthetic data

Synthetic data, as mentioned in the script, is artificially created data that mimics the characteristics of real data. Open AI is reportedly using synthetic data generated by Strawberry AI to train Orion, which could potentially lead to more accurate and less ambiguous AI models.

πŸ’‘Agents

In AI, 'agents' refer to autonomous systems that can perform tasks on behalf of users. The script suggests that Open AI is developing AI agents that could benefit from the reasoning capabilities of Strawberry AI, indicating a future where AI can assist in more complex and personalized ways.

πŸ’‘Reasoning capabilities

Reasoning capabilities in AI denote the ability of a model to think logically and draw conclusions. The script emphasizes the improved reasoning capabilities of Strawberry AI and its potential integration into products like Chat-GPT, which could lead to more accurate and thoughtful AI responses.

Highlights

Open AI is reportedly on the verge of releasing a new model named 'Strawberry', which may have been demonstrated to the federal government.

The 'Strawberry' model is associated with slower thinking models that can plan ahead and excel in math and logic.

Open AI's demonstration of unreleased technology to government officials could set a new standard for AI developers, particularly concerning national security.

The technology behind 'Strawberry' could be used to generate high-quality training data for Open AI's next flagship model, 'Orion'.

Open AI is considering simplifying 'Strawberry' through a process called 'distillation' for use in a chat-based product before 'Orion' is released.

The 'Strawberry' model might be too slow for consumer settings, indicating a potential trade-off between accuracy and response time.

Open AI's monthly revenue from LLMs and Chat GPT subscriptions has tripled to $283 million, though the company is likely still operating at a loss.

The 'Orion' model is expected to be a significant advancement over current models, leaving competitors behind.

Open AI faces competition from models like Google's DeepMind and Anthropic's latest LLM, which are also improving in reasoning capabilities.

AI's ability to solve complex math problems could be lucrative in fields like aerospace and structural engineering.

The 'Strawberry' model has its roots in research by Ilya Sutskever, Open AI's former Chief Scientist.

New York Magazine's article humorously discusses the speculation around 'Strawberry' and AI, likening it to the excitement around tech product releases.

The potential applications of 'Strawberry' include improving reasoning in chatbots and reducing errors in AI-generated responses.

Open AI is raising more capital to support the development of advanced AI products like 'Strawberry'.

The 'Strawberry' model's slower pace is likened to human thought processes, taking time to reason and iterate on results.

The transcript suggests that Open AI may soon launch AI agents that could benefit from 'Strawberry's' capabilities.

Transcripts

play00:00

apparently open AI is on the verge of

play00:02

releasing their brand new model and now

play00:04

we have a ton of additional information

play00:06

about it including the fact that open AI

play00:09

may actually have shown this new

play00:10

technology to the federal government so

play00:12

we're going to go over all of that right

play00:14

now all right so we have two main

play00:17

articles from the information that were

play00:18

posted over the last couple days first

play00:20

open AI shows strawberry so we've talked

play00:22

a lot about strawberry talked a lot

play00:24

about qar I'll link those videos in the

play00:26

description below and I will drop a link

play00:28

to the full article in the description

play00:31

below open AI shows strawberry AI to the

play00:33

feds and uses it to develop Orion and

play00:37

Orion is new to me and apparently it is

play00:40

their next Frontier Model now here's the

play00:43

thing if you were ever doubting Jimmy

play00:45

apples the infamous open AI leaker look

play00:48

at this tweet from all the way back at

play00:50

November 24th 2023 let's conquer the

play00:53

cosmos mood curious Jimmy so with that

play00:57

did he actually know and then as of

play00:59

today yeah been waiting since last year

play01:01

for this mood patients Jimmy and he's

play01:03

referring to the information article

play01:06

about Orion so maybe he knew about it so

play01:09

let's read a little bit about the

play01:10

article so this article by Stephanie

play01:12

palazolo starts with in case you were

play01:15

wondering why Sam Alman cryptically

play01:16

posted a picture of strawberries earlier

play01:18

this month the answer almost certainly

play01:19

has to do with strawberry a mysterious

play01:22

technical breakthrough that could help

play01:24

open AI models complete complex tasks

play01:27

such as math problems that

play01:28

conversational AI has traditionally

play01:30

struggled with so just a quick catchup

play01:32

qar strawberry it's all referencing the

play01:35

same thing slower thinking models models

play01:38

that can actually plan ahead think

play01:40

through problems multi-step reasoning

play01:43

and actually do much better at math and

play01:46

logic and reasoning these models don't

play01:48

just return the first token they predict

play01:50

they actually go off and think about it

play01:52

and actually do long-term thinking in

play01:55

mid July Reuters reported on the

play01:57

existence of strawberry and this morning

play01:58

we published this piece with even more

play02:00

details and we will go over that in a

play02:02

moment here is the most interesting bit

play02:04

this summer his team Sam alman's team

play02:07

demonstrated the technology to American

play02:10

National Security officials said a

play02:12

person with direct knowledge of those

play02:13

meetings which haven't been previously

play02:15

reported by demonstrating an unreleased

play02:17

technology to government officials open

play02:19

AI could be setting a new standard for

play02:21

AI developers especially as advanced AI

play02:23

increasingly becomes a national security

play02:25

concern the demonstration could be part

play02:27

of open ai's push to be more transparent

play02:29

with policy makers who could cause the

play02:31

company problems if they feel threatened

play02:34

by its technology and scanning down a

play02:36

bit and maybe also take a shot at meta

play02:38

platforms for releasing openweight AI

play02:40

that China and everyone else can access

play02:44

so you all know how I feel about this

play02:46

open source is the way to go to think

play02:48

that a single private company is going

play02:51

to be able to protect secrets to protect

play02:53

the weights against an adversary like

play02:56

China infiltrating and getting that IP

play02:59

it's just so unlikely that they will be

play03:01

able to protect it forever right here

play03:03

meta CEO Mark Zuckerberg says it's

play03:05

inevitable that China will get it one

play03:07

way or the other and I agree all it

play03:09

takes is one slip up in security which

play03:12

we're humans We Make Mistakes One slip

play03:14

up and then China will have the weights

play03:16

anyways and it's not just about the

play03:18

weights you also need the compute to

play03:21

power it you need a lot of other things

play03:22

as well I did a full review of Mark

play03:24

Zuckerberg's thoughts on open source and

play03:26

it overlaps with my own quite a bit so

play03:28

I'll drop that video in the description

play03:30

below let's keep reading about

play03:31

strawberry and why it matters to Orion

play03:34

and again strawberry qar it is different

play03:36

from Orion Oran is reportedly their next

play03:39

Frontier Model that is going to leave

play03:41

everybody else in the dust one of the

play03:43

most important applications of

play03:44

strawberry is to generate high quality

play03:47

training data for Orion open ai's next

play03:49

Flagship large language model that's in

play03:51

development now why is that important

play03:54

much of the training data on the

play03:55

internet has already been used there's

play03:57

almost none left that is public and

play04:00

easily accessible now it's all behind

play04:02

pay walls and authentication that's why

play04:05

open AI has been doing deals with

play04:07

different Publications and Reddit and

play04:09

all these other companies and that's why

play04:10

Twitter's data set is so important to x.

play04:13

a but Sam Alman a while ago hinted that

play04:16

maybe more and more training data isn't

play04:18

actually necessary there's actually two

play04:20

other approaches that are possible one

play04:22

that they can actually do a lot more

play04:24

with existing data and the second path

play04:26

which it looks like they're talking

play04:28

about here is actually generating in

play04:30

synthetic data using one model for

play04:32

another model now there's been a lot of

play04:34

doubt that that is a sustainable model

play04:36

because if you're creating data with one

play04:38

model to be used on another model it's

play04:40

basically derivative data a lot of

play04:42

people think that large language models

play04:44

aren't actually creating new knowledge

play04:46

it's simply outputting what it already

play04:49

knows so if it's only able to Output

play04:51

what it already knows how could it

play04:52

really train a model to be that much

play04:54

better using strawberry could help Orion

play04:56

reduce the number of hallucinations or

play04:58

errors it produces resarch ERS tell me

play05:00

that's because AI models learn from

play05:02

their training data so the more correct

play05:03

examples of complex reasoning they see

play05:06

the better so what could be going on

play05:08

here is they have a set of data and

play05:10

Orion and qstar are doing long-term

play05:12

thinking and multi-step planning and

play05:14

basically thinking through the data to

play05:16

make sure it's accurate and then

play05:17

producing new data that is highly

play05:20

accurate but there's also a push within

play05:22

open AI to simplify and Shrink

play05:23

strawberry through a process called

play05:25

distillation so it can be used in a

play05:27

chat-based product before Orion is

play05:29

released so that tells me that the

play05:30

technology behind strawberry might just

play05:32

be so slow it's unusable in a consumer

play05:35

setting and that would make sense and

play05:37

I'll actually touch more on this in a

play05:39

bit but the gist is strawberry and qar

play05:42

takes time that's a feature not a bug it

play05:44

actually takes a lot of time it thinks

play05:46

through it just like humans would when

play05:48

you ask someone something they don't

play05:50

immediately spit out the first thing

play05:51

that they think of or at least most

play05:53

people don't instead they take time to

play05:54

think through it they might take notes

play05:56

especially hard questions reasoning

play05:58

logic math these are things things that

play06:00

we take our time with we write down

play06:02

thoughts and we iterate on the result

play06:04

here the information admits we're not

play06:06

sure what a strawberry based product

play06:08

might look like but we can make an

play06:10

educated guess one obvious idea of what

play06:13

strawberry could be an actual product is

play06:15

incorporating strawberries improved

play06:16

reasoning capabilities into Chachi BT

play06:18

and that's the obvious one however

play06:20

though these answers would likely be

play06:22

more accurate they also might be slower

play06:25

that means that strawberry might be ill

play06:26

suited for applications where users

play06:28

expect immediate responses like open AI

play06:30

search GPT search engine but ideal for

play06:32

less time sensitive use cases like

play06:34

fixing non-critical coding errors in

play06:36

GitHub now another good use case for

play06:38

this is Agents if I have an agent

play06:40

working for me 24 hours a day I really

play06:42

don't need a response immediately I can

play06:44

give it these superpowers powered by

play06:46

qstar powered by strawberry allow it to

play06:48

go do its thing and then bring me back

play06:50

the best possible response you could

play06:52

imagine a Noto distant future where chat

play06:55

GPT users are able to toggle strawberry

play06:57

on and off depending on how sensitive

play06:59

they request are so that's the first

play07:00

article from the information about Orion

play07:03

that's the first time I've heard of it

play07:04

so hopefully I'm bringing it to you for

play07:06

the first time too now chubby from X who

play07:09

is an amazing follow if you're not

play07:11

already following him what chubby

play07:12

touches on here is that strawberry is

play07:14

slower because rather than just

play07:16

responding immediately with whatever it

play07:18

thinks is the next correct token it

play07:20

actually takes its time and does what is

play07:22

called system to thinking rather than

play07:24

just system one thinking now let's read

play07:27

more about strawberry so this is another

play07:28

article from from the information open

play07:30

AI raises to launch strawberry reasoning

play07:32

AI to boost chatbot business so it is

play07:35

reported that open AI is looking to

play07:36

raise even more Capital which is insane

play07:39

they've already raised so much but you

play07:41

know what it costs a lot of money to

play07:42

build these models at least for now its

play07:44

researchers are trying to launch a new

play07:45

artificial intelligence product they

play07:47

believe can reason through tough

play07:48

problems much better than its existing

play07:50

AI here they just touch briefly on qar

play07:54

we've already discussed it quite a bit

play07:56

when given additional time to think the

play07:57

strawberry model can also answer

play07:59

customers questions about more

play08:00

subjective topics such as product

play08:02

marketing strategies to demonstrate

play08:04

strawberry's prowess with language

play08:06

related tasks open AI employees have

play08:08

shown their co-workers how strawberry

play08:10

can for example solve New York Times

play08:12

connections a complex word puzzle so if

play08:15

it were given the puzzle and just asked

play08:17

to Output immediately what it thought

play08:19

system one thinking then it's not able

play08:21

to solve it nearly as effectively as if

play08:23

it had time to iterate and use maybe

play08:26

tree of thought or Chain of Thought or

play08:29

any these other more advanced

play08:31

capabilities where the model can

play08:33

actually look ahead plan and test things

play08:36

out come back and test other things out

play08:38

as they see things are working or not

play08:39

here the information talks about the

play08:41

open AI business and its sales of llms

play08:44

to corporations and of chat GPT

play08:46

subscriptions have roughly tripled to

play08:48

283 million in monthly Revenue compared

play08:51

to a year ago that's insane though its

play08:53

monthly losses are likely higher than

play08:55

that as reported by the information and

play08:57

I've already explained that they are

play08:58

probably still still losing money but

play09:00

they're not going to go bankrupt anytime

play09:02

soon the company is privately valued at

play09:03

86 billion but open AI prospects rest in

play09:07

part on the eventual launch of a new

play09:09

flagship llm it is currently developing

play09:11

Cod named Orion so we already talked

play09:13

about that now why is this next model so

play09:15

important well open source pretty much

play09:17

caught up with GPT 40 llama 370b llama

play09:20

345b is a fraction of the price you can

play09:23

run it locally you can fine-tune it and

play09:25

it's nearly as good or as good as GPT 4

play09:29

o for the majority of use cases and not

play09:32

only that we have the clad models we

play09:34

have grock 2 we have more grock models

play09:36

coming we have perplexity for search so

play09:39

the competition is heating up really

play09:42

quickly and open AI really needs to

play09:44

launch something incredible to jump

play09:46

ahead because intelligence is being

play09:48

driven down to a cost of nothing here

play09:51

they talk about the same things that we

play09:53

touched on in the last article but let

play09:54

me just read it again open AI is also

play09:56

using the bigger version of strawberry

play09:57

to generate data for training Orion and

play09:59

set a person with knowledge of the

play10:00

situation that kind of AI generated data

play10:02

is known as synthetic something we've

play10:04

touched on a lot on this channel it

play10:06

means that strawberry could help open AI

play10:08

overcome limitations on obtaining enough

play10:11

highquality data to train new models

play10:12

from Real World data such as text or

play10:14

images pulled from the internet and here

play10:17

apparently open aai is going to be

play10:19

launching agents soon so strawberry

play10:21

could Aid upcoming open aai agents that

play10:23

person said using strawberry to generate

play10:25

higher quality training data could help

play10:27

open AI reduce the number of Errors its

play10:29

model generate otherwise known as

play10:30

hallucinations now one of the biggest

play10:32

blockers one of the biggest hurdles for

play10:34

artificial intelligence in general to be

play10:36

adopted within Enterprise settings and

play10:38

more critical settings is the fact that

play10:40

it still hallucinates there's a bunch of

play10:43

things that you could do to reduce

play10:44

hallucinations whether that's improving

play10:46

your prompts having multiple agents talk

play10:48

to each other and verify kind of like an

play10:50

agentic system and pulling in

play10:52

information from the internet to verify

play10:54

and of course doing your own

play10:55

verification but at the end of the day

play10:57

if we want large language models to be

play10:59

fully autonomous and to really run at

play11:02

the scale that we believe it can it

play11:04

really needs to reduce hallucinations to

play11:06

nearly zero so the CEO of agent startup

play11:08

minion Ai and former Chief Architect of

play11:10

GitHub co-pilot says imagine a model

play11:12

without hallucinations a model where you

play11:14

ask it a logic puzzle and it's right on

play11:17

the first try the reason why the model

play11:19

is able to do that is because there is

play11:21

less ambiguity in the training data so

play11:23

it's guessing less so from Sam Alman at

play11:26

an event back in may we feel like we

play11:28

have enough data for this next model so

play11:31

they are close and I really cannot wait

play11:34

to see the next thing that they bring

play11:35

out I have said this a lot it's really

play11:38

hard for me to imagine that a private AI

play11:40

company is going to have some completely

play11:42

unique research or unique Tech that

play11:44

allows them to make a 50 100%

play11:46

Improvement on what's already out there

play11:49

maybe 10% maybe 20% but a large leap

play11:53

from what is already on the market today

play11:55

is really hard for me to imagine just

play11:57

because of the way that the scientific

play11:58

community operates all of these papers

play12:01

get published and unless open AI

play12:04

research scientists are the only ones in

play12:05

the world who actually thought of a new

play12:07

idea and it didn't leak out anywhere

play12:10

it's just very unlikely that they're

play12:12

going to have some completely new

play12:13

technology here they talk about why

play12:15

solving math problems could be so

play12:17

lucrative for open AI as a business AI

play12:20

that solves tough math problems could be

play12:21

a potentially lucrative application

play12:24

given that existing AI isn't great at

play12:26

math heavy Fields such as Aerospace and

play12:28

Structural Engineering now that isn't

play12:30

necessarily true Google's deepmind team

play12:32

actually got I think silver at the math

play12:35

olympiads so it's definitely possible

play12:38

today already that AI can be extremely

play12:41

good at math and right here they

play12:43

reference what I just mentioned Google

play12:45

deep mine said its AI would beat most

play12:47

human participants in the international

play12:49

mathematical Olympiad another major

play12:51

rival anthropic said its latest llm

play12:53

could write more complicated software

play12:55

code than its prior llms could and

play12:57

answer questions about charts and graphs

play12:58

than to Improvement in its reasoning

play13:01

capability so again competition is there

play13:03

and specifically about anthropic the

play13:05

cloud model is seemingly the favored

play13:07

large language model for coding use

play13:09

cases everybody seems to be using cursor

play13:12

the IDE that is AI native in conjunction

play13:15

with the claw models it continues with

play13:18

to improve models reasoning some

play13:19

startups have been using a cheap hack

play13:21

that involves breaking down a problem

play13:23

into smaller steps though the

play13:25

workarounds are slow and expensive it's

play13:27

kind of weird to call it a cheap hack

play13:28

these are these are discovered prompt

play13:30

techniques and Frameworks to wrap your

play13:32

large language model to make them more

play13:34

effective and I don't see that as a

play13:36

cheap hack at all and yeah they tend to

play13:38

be slower but what if you just power it

play13:40

with Gro grq and then all of a sudden

play13:43

you have all the benefits of these

play13:44

additional layers of technique to get

play13:46

the best response but you're also

play13:48

getting hundreds of tokens per second so

play13:51

it ends with what ilas saw strawberry

play13:53

has its roots in research it was started

play13:55

Years Ago by ilas suser then open ai's

play13:58

Chief scientist he recently left to

play13:59

start a competing AI lab before he left

play14:02

open AI researchers Jacob and Simon

play14:05

sedor built on SS's work by developing a

play14:07

new math solving model qar alarming some

play14:10

researchers focused on AI safety and

play14:13

basically the entire super alignment

play14:15

team has left since then so what did

play14:17

Ilia really see and last just to add a

play14:20

pinch of fun to this from New York

play14:22

Magazine the AI guys are driving

play14:24

themselves mad and this is in reference

play14:26

to Strawberry man and qar and the

play14:29

strawberry model Jimmy apples and all of

play14:32

this conjecture and speculation and you

play14:35

know what it's all in good fun it's the

play14:37

same thing that basically every other

play14:38

industry does when apple is about to

play14:40

release a new phone everybody who

play14:42

follows Apple starts talking about what

play14:44

it could possibly be and talking about

play14:46

the leaks and you know what we're so

play14:48

excited about what's to come why not add

play14:51

a little speculation and try to figure

play14:53

out what might come so this article is

play14:55

pretty funny it just talks about how I

play14:57

rule the world Mo was responded to by

play15:00

Sam Alman and all of his hypey tweets

play15:03

which is great and here's the awesome

play15:05

part they actually included one of my

play15:07

videos about qar which is flattering

play15:10

it's so cool they also go on to talk

play15:12

about Lily Ashwood who everybody's

play15:15

trying to figure out if she's AI or not

play15:17

we talked about that on the live stream

play15:18

last week so there's definitely a lot

play15:20

going on a lot of fun and as soon as the

play15:23

new open AI model comes out you know I'm

play15:25

going to cover it in depth and I'll give

play15:27

you all the information so be sure to

play15:29

subscribe to my channel and as a

play15:31

reminder I have an awesome newsletter so

play15:32

you can stay up to dat in the latest AI

play15:34

Trends and news Matthew burman.com check

play15:37

it out if you enjoyed this video please

play15:39

consider giving a like And subscribe and

play15:40

I'll see you in the next one

Rate This
β˜…
β˜…
β˜…
β˜…
β˜…

5.0 / 5 (0 votes)

Related Tags
AI ModelsStrawberry AIOrion ModelReasoning AITech BreakthroughNational SecurityAI TransparencySynthetic DataChatbot BusinessAI Hallucinations