AI News: We're One Step Closer To AGI This Week!

Matt Wolfe
19 Jul 202426:03

Summary

TLDRThis week in AI news, OpenAI outlines five levels of AGI progress, with current tech near Level 2. A new reasoning tech, 'Strawberry,' is speculated to push towards Level 3. Controversies arise over OpenAI's employee practices, while advancements in AI-generated images, videos, and educational tools are highlighted. HubSpot offers a free AI resource bundle, and new models like GPT-40 Mini and MistrAL Nemo promise enhanced capabilities. The video also covers AI's role in forensics, e-commerce, and the upcoming Olympics.

Takeaways

  • 🧠 OpenAI has outlined five levels of progress towards AGI, with current technology at Level 1 for chatbots and near Level 2 for reasoners that can solve human-level problems.
  • 🔍 OpenAI is reportedly working on a new reasoning technology called 'Strawberry', which is believed to be a rebranded version of the previously mentioned 'QAR' project.
  • 🤖 'Strawberry' aims to perform deep research by autonomously navigating the internet, indicating a move towards Level 3 AI that can take actions on behalf of users.
  • 🗳️ Whistleblowers from OpenAI have raised concerns about the company's practices regarding employee rights and non-disclosure agreements, which OpenAI has refuted.
  • 🖼️ There is speculation that OpenAI's image model, DALL-E, may have received an update, as demonstrated by clearer text in generated images.
  • 🎥 Sora AI's video generation capabilities are generating excitement, though the release of other tools like Runway Gen 3 and Lum's Dream Machine have somewhat diminished the anticipation.
  • 🛍️ HubSpot has released a free resource bundle for using AI at work, including flowcharts, templates, and checklists to integrate AI into various professional tasks.
  • 🏫 Andrej Karpathy, former OpenAI figure, has announced a new venture, Eureka Labs, focusing on AI-assisted education.
  • 📱 Anthropic's CLA, an AI chat app, has been released for Android, expanding accessibility beyond iOS.
  • 🎨 Google's new app, Google Vids, is an AI-powered video creation tool integrated with Google Workspace, currently in testing with a select group of users.
  • 🎵 YouTube is testing 'Music Sound Search', a feature that identifies songs based on user humming or singing, akin to the Shazam app.

Q & A

  • What are the five levels of AI progress outlined by Open AI?

    -Open AI has defined five levels of AI progress: Level one includes chatbots with conversational language capabilities. Level two involves reasoners capable of human-level problem-solving. Level three consists of agents that can take actions on our behalf. Level four is about innovators that can aid in invention by creating novel ideas. Finally, level five is about organizations and AI that can do the work of an entire organization.

  • What is the current status of AI in terms of these levels?

    -As of the script's recording, AI is at level one, with capabilities of chatbots and conversational AI. It is very close to reaching level two, which involves reasoners that can solve problems at a human level.

  • What is the new reasoning technology being developed by Open AI?

    -Open AI is working on a new reasoning technology codenamed 'Strawberry'. It is designed to not only generate answers to queries but also to plan ahead and navigate the internet autonomously to perform deep research.

  • What is the aim of the 'Strawberry' project?

    -The aim of the 'Strawberry' project is to enable AI to perform long-horizon tasks or complex tasks that require planning ahead and performing a series of actions over an extended period of time. It is intended to conduct research by browsing the web autonomously.

  • What controversy has arisen regarding Open AI's practices with employees?

    -Whistleblowers have claimed that Open AI illegally prevents employees from talking to government regulators about problems at work and removes their rights to rewards for whistleblowing. Open AI refutes these claims, stating they have a policy that protects employees' rights to make protected disclosures.

  • What updates have been speculated about Dolly image model?

    -There is speculation that the Dolly image model may have received an update, as demonstrated by a post showing an image with legible text. Previously, Dolly struggled with generating clear text in images.

  • What is the significance of the new demo videos from Sora?

    -The new demo videos from Sora showcase impressive black and white clips, indicating advancements in AI-generated video technology. These demos are increasing anticipation for the release of Sora.

  • What is the new AI-powered video creation app announced by Google?

    -Google has announced 'Google Vids', an AI-powered video creation app designed for work and integrated with the Google Workspace Suite. It is currently being tested with a select group of trusted testers.

  • What is the controversy about the source of training data for AI models?

    -There is controversy over the use of data from Uther AI, which collects data from various sources, including YouTube videos, to train AI models. Some YouTubers have noticed their transcripts being used in the training data set, raising questions about data privacy and consent.

  • What new model has Open AI launched recently?

    -Open AI has launched a new model called GPT 40 Mini. This model is designed to be more cost-efficient and faster than its predecessor, GPT 3.5, and supports text and vision in the API with future support for text, image, video, and audio inputs and outputs.

  • What collaboration has resulted in the creation of Mistral Nemo?

    -Nvidia and Mistral have teamed up to create Mistral Nemo, a 12 billion parameter model designed for on-device deployment, catering to businesses with limited internet connectivity or stringent data privacy requirements.

Outlines

00:00

🤖 AI Progress Levels and New Reasoning Tech

Open AI has outlined five levels of AI development, starting with chatbots at level one and progressing towards AGI at level five. The current focus is on level two, which involves AI capable of human-level problem-solving. Open AI is reportedly close to achieving this. Additionally, a new reasoning technology codenamed 'Strawberry' is under development, aiming to perform deep research autonomously by navigating the internet. This technology is speculated to be a rebranded version of the previously mentioned QAR. It is designed to not only answer queries but also plan ahead and conduct research tasks. Open AI is also facing scrutiny over alleged suppression of employee whistleblowers, with claims that they illegally restrict communication with government regulators and remove rights to rewards for whistleblowing.

05:01

🖼️ Updates in AI Image and Video Generation

There are speculations about updates to the Dolly image model, which seems to have improved its text generation capabilities. Users can now generate images with clearer text using Dolly 3, accessible for free on Bing's website. New demo videos from Sora showcase impressive black and white clips, increasing anticipation for the tool. Meanwhile, Runway Gen 3 and Lum's Dream Machine have dampened some of the excitement as they also offer AI-generated video capabilities. HubSpot has released a free resource bundle for using AI at work, including flowcharts, templates, and checklists to ensure AI-generated content aligns with brand voice and quality standards. Andre Karpathy, a former Open AI employee, has announced a new venture, Eureka Labs, focusing on AI-assisted education.

10:02

📱 AI Apps and Tools Expansion

Anthropic's CLA has expanded to Android, offering an alternative to the Chat GPT app. Gemini, an AI assistant for Android, can now answer general questions even when the device is locked. Google has announced Google Vids, an AI-powered video creation app integrated with the Workspace Suite, currently in testing. YouTube is testing a new feature called 'Music Sound Search', similar to Shazam, and an AI-generated conversational radio. Controversy has arisen over the use of YouTube videos in training data for AI models, with claims that companies like Apple, Nvidia, and Anthropic have used data from Uther AI's 'pile', a dataset that includes copied transcripts from YouTube videos.

15:02

🎨 AI in Design and Education

Microsoft's Designer platform, similar to Canva, is being integrated into various Microsoft apps, allowing users to create images and edit them on mobile devices. New features include a restyle function and the ability to use the co-pilot sidebar for image creation. Mistral, a French AI company, has released a new model called Codstrol Mamba, designed for code generation and capable of handling large inputs. Amazon has introduced Rufus, an AI shopping assistant within the Amazon app, providing shopping and political information. Meta has decided not to offer multimodal models in the EU due to regulatory uncertainties, focusing instead on text models.

20:03

🏥 AI in Healthcare and Forensics

AI systems have achieved a 96% accuracy rate in determining sex from dental X-rays, primarily in forensic applications. This technology is less accurate with children under six, who have not yet lost their teeth. The news highlights the potential of AI in forensics and medical diagnostics. Additionally, Open AI has launched a new model called GPT 40 Mini, designed to be more cost-efficient and faster than its predecessor, GPT 3.5. The model supports text and vision in the API, with plans to include text, image, video, and audio inputs and outputs in the future.

25:05

🏅 AI in Sports and Future Developments

Google has become the official AI sponsor for Team USA in the upcoming Summer Olympics, promising a significant AI presence in related advertising. Nvidia and Mistral have collaborated on a new model, Mistral Nemo, designed for on-device deployment, suitable for environments with limited internet connectivity or strict data privacy requirements. The model is efficient and can be used on laptops and desktop PCs. Google's AI is expected to be prominently featured during the Olympics, indicating a growing integration of AI in various aspects of society.

📢 Staying Updated with AI News

The video concludes with a reminder to stay updated with the latest AI news by subscribing to the channel and visiting futur.tools, where AI tools and news are curated. The host expresses gratitude for the viewers and sponsors, hinting at exciting upcoming AI developments.

Mindmap

Keywords

💡AGI

AGI stands for Artificial General Intelligence, which is the ability of an AI system to understand, learn, and apply knowledge across a wide range of tasks at a level equal to or beyond that of a human. In the video, OpenAI outlines five levels towards AGI, indicating the progression from basic chatbots to systems capable of deep research and organizational work.

💡Conversational AI

Conversational AI refers to systems that can engage in dialogue with humans using natural language processing. The script mentions level one of AGI as chatbots and AI with conversational language, which is the current state of AI like Chat GPT and Claude.

💡Reasoners

Reasoners are AI systems that can perform human-level problem solving. The video script places the current state of AI close to reaching level two, which includes reasoners, suggesting that AI is on the verge of being able to solve complex problems like humans.

💡Autonomous Agents

Autonomous Agents are AI systems that can take actions on behalf of users, such as booking flights or responding to emails. The script describes level three of AGI as involving these agents, which can perform tasks without direct human intervention.

💡Innovators AI

Innovators AI refers to AI systems that can aid in invention by creating novel ideas. The video mentions level four of AGI, which includes these innovators, indicating a future where AI can contribute to creative and inventive processes.

💡Organizational AI

Organizational AI is capable of performing the work of an entire organization. The script describes the final level of AGI, level five, as involving organizational AI, suggesting a future where AI can manage and execute complex organizational tasks.

💡Strawberry

Strawberry is a code name for a new reasoning technology being developed by OpenAI. The script discusses this technology as an advancement towards level two of AGI, with the aim of not just answering queries but also planning ahead to perform deep research autonomously on the internet.

💡Whistleblowers

Whistleblowers are individuals who report misconduct or illegal activities within an organization to the public or authorities. The script mentions that some whistleblowers from OpenAI claim the company has questionable practices regarding employee rights and communication with government regulators.

💡AI-generated Videos

AI-generated videos are videos created using AI technology, which can produce content without human intervention. The script discusses tools like Runway Gen 3 and Lum's Dream Machine, which allow users to create AI-generated videos, as well as the anticipation for Sora's capabilities.

💡AI in the Workplace

The script touches on the integration of AI in the workplace, emphasizing the importance of understanding when to use AI tools like Chat GPT and providing resources to ensure AI-generated content aligns with a brand's voice and quality standards.

💡GPT-4

GPT-4 refers to a new model released by OpenAI, which is more powerful and efficient than its predecessor, GPT-3.5. The script mentions the release of GPT-4 Mini, a smaller and faster version of GPT-4, designed to compete with other smaller models from different AI companies.

💡Mistral Mamba

Mistral Mamba is an open-source model developed by the French AI company, Mistral, for code generation. With the ability to handle up to 256,000 tokens, it offers double the input and output capacity compared to OpenAI's current offerings, making it a significant development for programmers and developers.

💡Rufus

Rufus is an AI shopping assistant rolled out by Amazon, designed to answer questions about shopping and other topics based on the data available within Amazon's platform. The script mentions Rufus as an example of AI integration into consumer applications for enhanced user experience.

💡AI Act

The AI Act refers to the European Union's regulatory framework for AI, which is still being finalized. The script discusses how the unpredictable nature of this regulatory environment affects companies like Meta, which decide not to offer multimodal AI models in the EU due to compliance concerns.

Highlights

Open AI outlines five levels of progress towards AGI, with current technology at Level 1 for chatbots and nearing Level 2 for reasoners.

Open AI is developing a new reasoning technology code-named 'Strawberry', aiming for deep research capabilities and autonomous internet navigation.

Internal document leak suggests 'Strawberry' may have been previously known as QAR, with testing scores over 90% on a math dataset.

Whistleblowers allege Open AI suppresses employee communication with government regulators and removes rights to whistleblower rewards.

Dolly image model by Open AI may have received an update, as evidenced by clearer text in generated images.

Sora AI generates impressive black and white video demos, increasing anticipation for its release.

Runway Gen 3 and Lum's Dream Machine offer current AI-generated video creation, partially reducing excitement for Sora.

HubSpot offers a free bundle of resources for using AI at work, including flowcharts, templates, and checklists.

Andre Karpathy announces a new venture, Eureka Labs, focusing on AI-assisted education.

Anthropic's CLA is now available on Android, expanding accessibility beyond iOS.

Google introduces Google Vids, an AI-powered video creation app for work, integrated with Google Workspace.

YouTube Music Sound Search allows users to identify songs by humming or singing, similar to Shazam.

Controversy arises over the use of YouTube videos in AI training data without consent.

Microsoft rolls out Designer platform updates, integrating AI image creation into various Microsoft apps.

Mistral releases Codstrol Mamba, an open-source model for code generation with a large token input capacity.

Amazon launches Rufus, an AI shopping assistant within the Amazon app, providing recommendations and answers.

Meta will not offer multimodal AI models in the EU due to regulatory uncertainty and GDPR compliance concerns.

Google AI is the official sponsor for Team USA in the Summer Olympics, with ads promoting Google AI products.

Open AI releases GPT 40 Mini, a faster and smarter model than GPT 3.5, with support for text and vision inputs.

Nvidia and Mistral collaborate on MistrAL Nemo, a 12 billion parameter model designed for on-device deployment.

AI systems achieve 96% accuracy in determining sex from dental X-rays, with potential applications in forensics.

Transcripts

play00:00

here's the AI news that you might have

play00:01

missed this

play00:02

[Music]

play00:04

week starting with the fact that open AI

play00:08

mapped out their five levels towards the

play00:11

progress of AGI here's a quick breakdown

play00:13

of those five levels so level one they

play00:16

say would be chat Bots and AI with

play00:18

conversational language that's

play00:19

essentially what we're getting right now

play00:21

out of chat GPT Claude llama 3 things

play00:24

like that then you have level two which

play00:26

is reasoners that can do human level

play00:28

problem solving they claim their very

play00:30

very close to level two right now then

play00:33

it moves on to level three which is

play00:34

Agents or systems that can take actions

play00:36

on our behalf you know book flights for

play00:39

us respond to emails for us things like

play00:41

that then there's level four which they

play00:43

say is the innovators AI that can Aid an

play00:46

invention it's actually going to create

play00:48

novel ideas and then finally you have

play00:50

level five which is organizations and AI

play00:53

that can do the work of an organization

play00:56

so basically we're right here right now

play00:58

we're at level one almost on the level

play01:01

two we're right on that precipice of

play01:02

level two and open AI believes that

play01:04

we'll sort of move through each of these

play01:06

levels on our way to a true AGI now this

play01:09

was actually released on July 11th last

play01:12

week but it didn't make it into last

play01:13

week's video but it felt a little extra

play01:16

relevant this week because this week we

play01:19

got the news that open aai has been

play01:21

working on a new reasoning technology

play01:24

code named Strawberry now I've seen a

play01:26

lot of other YouTube videos and a lot of

play01:28

X posts about this with people

play01:30

speculating a lot of people believe that

play01:33

this was what was originally called qar

play01:36

and they've now rebranded it to

play01:38

Strawberry now this comes from a leaked

play01:40

internal document it says teams inside

play01:42

of open AI are working on strawberry

play01:45

according to a copy of a recent internal

play01:46

open AI document seen by Reuters Reuters

play01:49

couldn't ascertain the precise state of

play01:51

the document and they could not

play01:53

establish how close strawberry is to

play01:55

actually being publicly available likely

play01:57

not very close the aim of strawberry is

play02:00

to not just generate answers to queries

play02:02

but to plan ahead enough to navigate the

play02:04

internet autonomously and reliably to

play02:07

perform what open AI terms deep research

play02:10

the article does claim the strawberry

play02:12

project was formerly known as qar this

play02:14

exact article here was actually updated

play02:17

after it was published to add this

play02:19

section it says a different Source

play02:21

briefed on the matter said openai has

play02:22

tested AI internally that scored over

play02:25

90% on a math data set a benchmark of

play02:28

Championship math problems now writers

play02:31

couldn't actually figure out if they

play02:33

were referring to the strawberry project

play02:35

or not but it kind of sounds like

play02:37

they're probably the same project now

play02:39

outside of this little information that

play02:41

we have on it there's a lot of

play02:43

speculation around what this is but it

play02:45

sounds like this strawberry is pretty

play02:48

close to that level two towards AGI that

play02:51

we were just talking about and at the

play02:52

moment it sounds like the main purpose

play02:54

of This research is for this new model

play02:57

to essentially do research among the

play03:00

capabilities openi is aiming strawberry

play03:02

at is performing long Horizon tasks or

play03:06

complex tasks that require a model to

play03:07

plan ahead and perform a series of

play03:09

actions over an extended period of time

play03:12

open AI specifically wants its model to

play03:14

use these capabilities to conduct

play03:15

research by browsing the web

play03:17

autonomously with the assistance of a

play03:19

computer using agent or CUA that can

play03:22

then take actions based on its findings

play03:25

again not much more is known about this

play03:27

and open AI has notoriously been kind of

play03:29

hush H about their upcoming models and

play03:32

usually we don't know much about them

play03:34

until literally the day they make the

play03:35

announcement of them and while we're on

play03:37

the topic of open AI more people from

play03:40

open AI are coming out and talking about

play03:43

some of the questionable practices of

play03:46

open ai's business it came out this week

play03:48

that some whistleblowers are saying that

play03:50

open AI illegally keeps employees from

play03:52

talking to government Regulators about

play03:54

problems at work and removes their

play03:56

rights to rewards for blowing the

play03:58

whistle this comes from a letter that

play04:00

was sent to Gary gendler the chair of

play04:02

the SEC openai refutes the claims saying

play04:05

that they have a policy on

play04:06

whistleblowers that protects employees

play04:09

rights to make protected disclosures now

play04:11

this isn't the first time that open ai's

play04:14

policies and contracts with their

play04:16

employees have been under scrutiny

play04:17

several weeks ago it came out that open

play04:20

AI was forcing people to sign

play04:21

non-disparagement agreements and if they

play04:23

talked badly about open AI they could

play04:26

lose their vested equity in the company

play04:28

well now it sounds like people are

play04:29

coming forward and claiming that if we

play04:31

blow the whistle on anything we think

play04:33

open AI is doing that's somewhat

play04:36

suspicious we can also lose our vested

play04:38

equity and that's not legal now the

play04:40

sources are Anonymous open AI claims

play04:43

that that's not actually happening but I

play04:45

have a feeling open AI is probably in

play04:47

the process of overhauling a lot of

play04:49

their contracts that get signed by any

play04:51

new employees that join the company due

play04:54

to all this scrutiny again back when

play04:55

most of these people probably signed up

play04:57

for open AI the company wasn't nearly as

play04:59

big or in the public eye now that they

play05:01

are as big and in the public eye a lot

play05:03

of this stuff is kind of starting to

play05:05

come under the microscope and while

play05:06

we're on the topic of open II there's

play05:08

some speculation that maybe the dolly

play05:10

image model recently got an update this

play05:12

is a post from my buddy angry penguin

play05:14

over on X where he shows off an image

play05:16

that he created that has pretty legible

play05:19

writing in it this clearly says evolve

play05:21

all over it previously DOI struggled

play05:24

with words if I go to Dolly and say

play05:26

create an image of a robot holding a

play05:27

sign that says Please Subscribe I

play05:30

actually get an image that has the words

play05:32

kind of nailing it so I think DOI did

play05:34

make some updates because the text seems

play05:37

to be much more clear than it used to be

play05:39

and if you're interested in using Dolly

play05:41

but you don't have a chat GPT Plus

play05:43

account you can always go to bing.com

play05:45

slim imagesc create and use Dolly 3 for

play05:49

free over on Bing's website which if

play05:52

Dolly 3 did get an update it appears to

play05:54

have also rolled out here inside of

play05:56

being image Creator I mean two and a

play05:59

half out of four sort of nailed what I

play06:01

was going for we also got some new demo

play06:03

videos from Sora we can see this like

play06:06

black and white video showing all sorts

play06:08

of different clips in black and white

play06:09

that actually look pretty dang

play06:11

impressive these were shared on Matthew

play06:14

burman's X account here's another one

play06:16

that he shared of like ocean crashing

play06:18

and uh I don't know a gas station or

play06:21

motel or something uh but yeah we're

play06:24

getting more demos from Sora which is

play06:25

just making people more anxious to

play06:27

actually get their hands on it but right

play06:29

now we do sort of have that itch

play06:32

scratched in the form of Runway gen 3

play06:35

and lum's dream machine we can actually

play06:37

create some pretty good AI generated

play06:39

videos now with those tools it sort of

play06:42

damped down the excitement for Sora a

play06:45

little bit but the fact that this can

play06:46

create much longer videos and open AI

play06:49

tends to kind of set the bar for almost

play06:51

everything they put out I'm still

play06:52

excited about it but I have gotten that

play06:54

need met with some other tools recently

play06:57

if you're somebody that uses AI at work

play06:59

or or you're thinking about using AI at

play07:01

work you need to check out hubspot's

play07:03

completely free bundle called five

play07:06

essential resources for using chat GPT

play07:09

at work and honestly if you haven't

play07:10

embraced AI yet just remember what

play07:13

Nvidia CEO Jensen hang said AI will be

play07:16

the most transformative technology of

play07:18

the 21st century it will affect every

play07:21

industry and aspect of our lives so if

play07:23

you're not using AI to speed up and

play07:25

improve the quality of your work well

play07:27

your competitors probably are so the

play07:29

link to this totally free resource is

play07:31

down in the description below and trust

play07:33

me this is something you're going to

play07:34

want to look through it includes

play07:36

interesting flowcharts on when you

play07:37

should or shouldn't use chat GPT there's

play07:40

also a really cool template that you can

play07:42

use with chat GPT to make sure that any

play07:44

of the content it creates for you

play07:46

follows your Brand's voice you've got an

play07:49

AI generated content refinement

play07:51

checklist to double check the ai's work

play07:53

and ensure that you're putting something

play07:54

out into the world that you really want

play07:56

to be putting out there there's also a

play07:58

four-page check list that you can easily

play08:00

follow all about adopting AI in the

play08:03

workplace and a super comprehensive PDF

play08:06

guide on how to supercharge your day

play08:08

with chat GPT and here's what's really

play08:10

cool about this if you scroll all the

play08:13

way down to the bottom of this document

play08:15

they have 100 ways to try chat GPT today

play08:19

and it's got some really cool prompts to

play08:21

test out things like providing

play08:22

recommendations for improving customer

play08:24

service and support providing

play08:26

recommendations for improving SEO

play08:28

helping with email management and

play08:30

organization and so much more again it

play08:32

is really comprehensive and really

play08:35

helpful the link to this completely free

play08:37

resource from HubSpot is in the

play08:39

description below and thank you so much

play08:41

to HubSpot for sponsoring this video

play08:43

Andre karpathy who previously worked at

play08:46

open aai and then recently stepped away

play08:48

just announced a new Venture that he's

play08:50

working on he said excited to share that

play08:52

I'm starting an AI plus education

play08:54

company called Eureka Labs at Eureka

play08:57

Labs they're building a new kind of

play08:58

school that is AI I native they say that

play09:00

subject matter experts who are deeply

play09:02

passionate great at teaching infinitely

play09:04

patient and fluent in all of the world's

play09:06

languages are also very scarce and

play09:08

cannot personally tutor all 8 billion of

play09:11

us on demand so it sounds like he's

play09:13

creating a sort of online education

play09:16

where the teacher still designs the

play09:17

course materials but they are supported

play09:19

leveraged and scaled with an AI teaching

play09:21

assistant who is optimized to help guide

play09:24

the students through them so this

play09:26

announcement here is really all that we

play09:28

have he hasn't really talked a whole lot

play09:29

about this more than the announcement

play09:31

but what I'm sort of imagining is that a

play09:33

teacher with subject matter expertise

play09:36

goes in creates an entire course on

play09:39

their subject matter all of that

play09:41

information is then sort of trained in

play09:43

the AI I don't know if they're going to

play09:45

use you know retrieval augmented

play09:47

generation or they're going to fine-tune

play09:48

the model I don't know exactly how

play09:49

they're going to do it but all of the

play09:51

information that the teacher taught is

play09:52

now available inside of the model so

play09:55

anybody who wants to learn this stuff

play09:57

can then work with a tutor who

play09:59

understands all of the training material

play10:01

and can speak to the student in whatever

play10:03

language they want to learn in this will

play10:06

massively scale the ability of an

play10:08

individual teacher who can teach the

play10:10

concept once and then let their AI

play10:12

assistant teach it to everybody else who

play10:14

wants to learn that information again

play10:17

I'm just sort of speculating on what

play10:18

this is going to look like I don't know

play10:20

exactly but that's sort of what the

play10:22

concept sounds like to me if you're a

play10:24

fan of anthropics CLA and you don't have

play10:26

an iPhone well good news they just

play10:29

released it on Android it's been on iOS

play10:31

for a couple months now and they just

play10:33

now rolled out an Android version

play10:35

personally I'm still a fan of the chat

play10:36

GPT app a little bit more than the

play10:38

anthropic app just because the

play10:40

conversational voice portion of the chat

play10:42

GPT app is actually really really good

play10:44

when I'm on my computer I usually use

play10:46

either clot or perplexity when I'm using

play10:48

my phone I still go to the chat GPT app

play10:50

but I also understand most people

play10:51

probably don't want to pay for free

play10:53

separate chat subscriptions so if you

play10:55

really like the ability to have a voice

play10:57

conversation with an AI chat GPT still

play11:00

the way if you don't care about that you

play11:02

just want the best model in your hand

play11:04

clad is probably the best and they now

play11:06

have an Android app and since we're

play11:07

talking about Android phones Gemini now

play11:10

answers general questions when your

play11:11

Android phone is locked there's not too

play11:13

much more to share on this story it is

play11:15

exactly what it sounds like Google Now

play11:17

lets you get answers from Gemini without

play11:19

actually unlocking your device also this

play11:21

week Google announced Google vids vids

play11:24

is an AI powered video creation app

play11:25

that's designed for work and deeply

play11:27

integrated with the workspace Suite you

play11:29

use every day you can actually find it

play11:31

over at

play11:33

workspace.com

play11:35

product/ VDS right now it's not

play11:37

available to everybody they say we're

play11:39

currently testing this new application

play11:40

with a select group of trusted testers

play11:43

and according to the video on their

play11:44

website it looks like you give it a

play11:45

prompt like help me create a sales

play11:48

training video and then it will help

play11:50

create this like slide Style video for

play11:54

you there's a bunch of different styles

play11:56

that you can choose from and once you

play11:58

pick your St style you can speak out a

play12:01

script add a voice over to it and add

play12:04

stock footage to it to get the perfect

play12:06

sort of layout for your video and then

play12:08

it creates that sort of slide

play12:10

presentation video for you and since

play12:13

we're talking about Google and we're

play12:14

talking about video let's talk about

play12:15

this new feature that YouTube is rolling

play12:17

out called YouTube music sound search

play12:20

this sounds like a feature that's very

play12:21

similar to Shazam where you can have it

play12:24

listen to a snippet of music and it will

play12:26

figure out what song it is but you can

play12:27

also hum the song it'll be able to

play12:29

figure out what song it was based on

play12:31

your humming we can see some screenshots

play12:33

that they shared here they've got a

play12:34

little search box here with a microphone

play12:37

next to it I'm assuming they Click the

play12:39

microphone and then it says play sing or

play12:41

hum a song and then it figures out what

play12:44

song that you were trying to find just

play12:46

based on the singing or humming

play12:48

YouTube's also testing an AI generated

play12:51

conversational radio it'll let you

play12:53

create a custom radio by describing

play12:55

exactly what they want to hear this

play12:57

article goes on to say be on the lookout

play12:58

for ask for music any way you like card

play13:01

in your home feed this will open the

play13:03

chat-based UI with a field at the bottom

play13:05

that lets you ask for music there's been

play13:07

a little bit more controversy this week

play13:09

about the source of training data for

play13:12

various AI models this article on proof

play13:15

news here claims that Apple Nvidia and

play13:17

anthropic use thousands of swiped

play13:19

YouTube videos to train AI basically

play13:21

here's what's happening with this

play13:23

there's a company called Uther AI which

play13:25

is an open-sourced company that collects

play13:28

a whole bunch of data from from

play13:29

everywhere and puts it into what they

play13:31

call the pile and the pile is this giant

play13:34

data set that companies then use to

play13:38

train their AI models initially so that

play13:40

it can just sort of learn how the

play13:42

language works and just get injected

play13:44

with a ton of data to start well this

play13:47

pile is trained on publicly available

play13:49

data and it turns out that a lot of that

play13:51

publicly available data was transcripts

play13:54

that were copied and pasted straight

play13:57

from YouTube videos and a lot of

play13:59

YouTubers started to notice there's data

play14:01

in there from people like MKBHD Mr Beast

play14:04

PewDiePie and others and this site proof

play14:07

news.org actually put up a little search

play14:09

engine so that you can see if a video

play14:12

that you created or literally anybody's

play14:14

video is found within the piles data set

play14:17

now I did a search for my own name and

play14:20

no results were found I don't know

play14:22

whether I should be offended or relieved

play14:24

at the time the data was scraped my

play14:26

channel probably just wasn't big enough

play14:27

now after all this came out Apple

play14:29

stepped up to say yes we've used the

play14:31

pile for some research purposes and some

play14:34

training but the model that we're using

play14:36

inside of our Apple intelligence is not

play14:39

trained on the piles data so that

play14:41

information is not inside of Apple's

play14:42

training set according to them Microsoft

play14:45

has a platform called designer which if

play14:47

you're not familiar with it it's very

play14:49

similar to canva it's a platform to

play14:51

create things like YouTube thumbnails

play14:54

and banner ads and Instagram images and

play14:58

things like that well this designer

play15:00

platform is now being rolled out into a

play15:02

whole bunch of different Microsoft apps

play15:04

directly where you can use the co-pilot

play15:06

sidebar over here ask it to create a

play15:09

specific image in a specific style and

play15:12

it will actually use Microsoft's

play15:14

designer to create that image and allow

play15:17

you to pull it directly inside of your

play15:19

document or your PowerPoint or whatever

play15:22

Microsoft tool that you're using here's

play15:24

another example of it being shown off

play15:26

inside of Microsoft PowerPoint where

play15:29

they create some images with designer

play15:31

over here it generates some images and

play15:33

then they just pull that in as the

play15:35

background of the slide designer also

play15:37

got a free mobile app on both IOS and

play15:40

Android so you can easily create and

play15:42

edit images on the go on a mobile device

play15:45

now there's a whole bunch of other new

play15:47

features for designer if you want to

play15:49

dive deeper into it this is something

play15:50

that you're really interested in I will

play15:52

make sure it's linked up in the

play15:53

description so you can see all of the

play15:55

updates here the article is quite long

play15:58

and there are quite a few updates but it

play16:00

seems like it's got some other pretty

play16:01

cool features like this restyle feature

play16:03

you upload an image and it restyles it

play16:05

to a different style of image mistol the

play16:08

French AI company that develops large

play16:10

language models released a new model

play16:13

called Cod strol Mamba this is a model

play16:16

designed for code generation it is open

play16:19

source and it can handle an input of up

play16:21

to

play16:22

256,000 tokens which is double what open

play16:25

aai currently offers with chat GPT

play16:28

that's rough roughly 192,000 words

play16:31

between the amount of text inputed and

play16:33

the amount of text outputed this is a 7

play16:36

billion parameter model and offers a

play16:38

fast response time even with longer

play16:40

input text so if you're a coder and

play16:42

you're looking to try another large

play16:44

language model to see if it outperforms

play16:46

the other models you've tried maybe Cod

play16:48

stroll Mamba is a choice to try out

play16:50

Amazon started rolling out an AI

play16:52

shopping assistant called Rufus which

play16:54

apparently answers questions about

play16:56

shopping and also politics Rufus is

play16:59

essentially a chatbot just like chat GPT

play17:02

but it's built directly inside of the

play17:04

Amazon app and it's trained on the data

play17:07

that's in Amazon so you can ask what are

play17:10

the best lawn games for kids birthday

play17:12

parties and it will suggest lawn games

play17:14

as well as where to find them and buy

play17:16

them on Amazon this Verge article also

play17:18

tested some other questions and got it

play17:20

to answer questions about the candidates

play17:23

for the 2024 election I've got more bad

play17:25

news if you're in the EU it sounds like

play17:28

meta is not going to be offering their

play17:29

multimodal models in the European Union

play17:32

they will be offering their normal text

play17:34

input output models like llama but

play17:36

you're probably not going to be able to

play17:37

create AI images AI videos and anything

play17:41

other than more text if you're in the EU

play17:43

due to the eu's I guess unclear policies

play17:47

they say here we will release a

play17:48

multimodal llama model over the coming

play17:50

months but not in the EU due to the

play17:53

unpredictable nature of the European

play17:55

regulatory environment says here that

play17:57

meta's issue isn't with the the still

play17:59

being finalized AI act but rather with

play18:02

how it can train models using data from

play18:04

European customers while complying with

play18:06

gdpr the eu's existing data Protection

play18:09

Law the United Kingdom has nearly

play18:11

identical laws to gdpr but meta says it

play18:14

isn't seeing the same level of

play18:15

regulatory uncertainty and plans to

play18:18

launch its new model for the UK users

play18:20

here's something that I came across on X

play18:22

from johanis Stelzer I just thought it

play18:24

was really cool they hooked up a little

play18:26

uh mey device to their computer and they

play18:29

can turn the knobs to change different

play18:31

aspects of the images they appear to be

play18:33

using stable diffusion here and then

play18:36

using these knobs to change different

play18:38

elements within stable diffusion

play18:41

different sort of parameters and I just

play18:43

thought it looked really cool and I

play18:44

wanted to share it and they also put the

play18:46

code for it up on GitHub so if you want

play18:49

to play around with something like this

play18:51

and hook up sdxl to a mey device well

play18:54

that's available for you to do here's

play18:56

another article that I came across that

play18:57

I couldn't find a whole lot on I just

play18:59

thought it looked cool is from Gizmo

play19:01

China turn your selfie into a printable

play19:03

3D character with 10 cents AI powered

play19:06

app so apparently this is an app where

play19:08

you can upload a selfie and it will

play19:10

generate a 3D model based on that one

play19:13

selfie that is so good that you can 3D

play19:16

print it now I actually did some digging

play19:19

to try to find more info about what

play19:21

they're doing here and this was

play19:22

literally the only article I can find

play19:24

about it but as I learn more about it I

play19:26

do have a 3D printer I do love AI this

play19:29

is something I will be playing with if I

play19:30

can get my hands on it and here's

play19:32

something interesting AI systems achieve

play19:34

a 96% accuracy in determining the sex

play19:37

from dental X-rays so they basically

play19:39

trained an AI model on a whole bunch of

play19:42

dental images and then when they ran new

play19:45

dental images through it it was able to

play19:47

determine the sex of whose teeth those

play19:49

were at a rate of 96% accuracy and the

play19:53

ones that it wasn't accurate on that was

play19:55

mostly children the the article claims

play19:57

that it's less accurate if you're six or

play19:59

under or basically haven't lost your

play20:01

teeth yet now the main use case for

play20:03

something like this would be in

play20:04

forensics if they find you know skeletal

play20:07

remains or something they can actually

play20:08

identify the sex of the skeletal remains

play20:11

but I just thought it was fascinating so

play20:13

I thought I'd share with you so I

play20:14

started recording that video while I was

play20:16

in San Diego still and well now I'm on

play20:19

vacation in Colorado and a few more

play20:21

pieces of news came out that I wanted to

play20:23

make sure got shared in Friday's news

play20:25

video including the fact that open AI

play20:28

just launched a new model today on

play20:30

Thursday the day I record this called

play20:33

GPT 40 mini with pretty much every large

play20:37

language model Creator out there

play20:38

creating models that are smaller

play20:40

designed to be more cost-efficient and

play20:42

faster open AI needed to create a

play20:44

language model to compete so this new

play20:46

GPT 40 is replacing the old GPT 3.5 not

play20:52

quite as powerful as the full-on GPT 40

play20:55

but it is faster and smarter than the

play20:57

previous GPT 3.5 we can see that right

play21:00

now today GPT 40 mini supports text and

play21:03

vision in the API with support for text

play21:06

image video and audio inputs and outputs

play21:09

in the future it's got 128,000 token

play21:12

context window so you should still be

play21:14

able to put large amounts of text as

play21:17

your input however the output only

play21:19

supports 16,000 tokens we can see this

play21:23

comparison here of model evaluation

play21:25

scores with GPT 40 and pink being the

play21:29

best model and it pretty much performs

play21:31

the best across the board here in every

play21:34

test with GPT 40 mini this new model

play21:37

that was just released performing second

play21:40

best across pretty much all of these

play21:41

benchmarks here now keep in mind this is

play21:44

comparing it to these other companies

play21:46

smaller models it almost kind of feels

play21:48

unfair to be putting GPT 40 in here

play21:51

compared against you know Claude Haiku

play21:53

and Gemini flash which is both of those

play21:55

platforms smaller models while GPT 40 is

play21:58

open ai's current state-of-the-art model

play22:01

but nonetheless we can see how this new

play22:04

Mini version of GPT 4L outperforms all

play22:06

the other mini models that are out there

play22:09

if we log into our chat GPT account here

play22:11

up in the top left corner where you

play22:13

select the model you can see that we now

play22:15

have access to 40 40 mini and the Legacy

play22:20

gp4 now at the time of this recording

play22:22

when I try to not log in and just use

play22:25

the free version it's still claiming

play22:26

it's using chat GPT 3.5 although it does

play22:30

say here in chat GPT free plus and team

play22:34

users will be able to use GPT 40 mini

play22:36

starting today in other large language

play22:38

Model news Nvidia and mistol teamed up

play22:41

to create mistol Nemo this is a 12

play22:44

billion parameter model and it also has

play22:47

128,000 tokens just like the new GPT 40

play22:50

Mini model now what's cool about this

play22:52

model is it's actually designed to be

play22:54

run on device we can see here it says

play22:56

this model's efficiency and local

play22:58

deployment capabilities could attract

play23:00

businesses operating environments with

play23:02

limited internet connectivity or those

play23:05

with stringent data privacy requirements

play23:08

they do go on to say that it's more

play23:09

designed for laptops and desktop PCS

play23:11

than smartphones so companies that want

play23:14

to run a really really powerful large

play23:16

language model with a large context

play23:18

window that can take a lot of input and

play23:20

a lot of output text and maybe concerned

play23:22

about privacy or not have internet

play23:24

access well now they have a model that

play23:26

they can use that's going to provide

play23:28

pretty much everything you're going to

play23:29

need it says the model is immediately

play23:31

available and we have a link here with a

play23:34

downloadable version promised in the

play23:35

near future so you can actually try this

play23:38

model out over on nvidia's website if we

play23:41

come over to build. nvidia.com exlore

play23:45

slcover click on reasoning over on the

play23:47

left side we can see fresh off the press

play23:50

mral Nemo 12b instruct if we click in

play23:53

here we get a chat window where we can

play23:55

actually play around with this model if

play23:56

we want to again this is a cloud version

play23:59

where you can just sort of play around

play24:00

with it but a desktop version is coming

play24:03

soon and finally if you're planning on

play24:05

watching the Summer Olympics this year

play24:08

it looks like Google's AI is going to be

play24:10

everywhere Google is apparently the

play24:12

official AI sponsor for Team USA and

play24:16

claim that they're going to have ads all

play24:18

over for all of the various Google AI

play24:21

products so if you haven't heard enough

play24:23

about AI lately on TV well watching the

play24:26

Olympics you're going to see a lot of it

play24:28

and anyway that's all I got for you

play24:29

today again I record these videos on

play24:32

Thursday in this case some of it was

play24:33

recorded on Wednesday some of it was

play24:34

recorded on Thursday so if there was any

play24:36

news that came out late Thursday evening

play24:38

or on Friday it didn't make this video

play24:40

but it will be in next week's news video

play24:43

while I'm here on vacation I'm sort of

play24:45

slowing down on putting new videos out

play24:47

but I am going to publish my Friday news

play24:50

videos on schedule just like I always do

play24:53

just less other videos during the week

play24:55

this week if you like this video and you

play24:57

want to stay looped in on all the latest

play24:59

AI news the coolest AI tools interesting

play25:02

AI research and you know some of my own

play25:05

commentary and opinions around it make

play25:07

sure you like this video And subscribe

play25:08

to this channel it really helps my

play25:10

channel grow it will also ensure that

play25:12

you see more videos like this one inside

play25:14

your YouTube feed and if you haven't

play25:16

already make sure to check out futur

play25:18

tools. where I curate all of the coolest

play25:20

AI tools I come across I keep the AI

play25:23

news page up to date on pretty much a

play25:25

daily basis and we've got a free news

play25:28

newsletter where you can get all of the

play25:29

coolest AI tools and most interesting AI

play25:32

news delivered directly to your email

play25:34

inbox you can find it all over at futur

play25:37

tools. completely free thank you so much

play25:40

for tuning into this video I really

play25:41

appreciate you thank you so much to

play25:43

HubSpot for sponsoring this one I have a

play25:45

feeling the AI news is really going to

play25:47

start heating up again real soon there's

play25:49

a lot of cool things in the works that

play25:51

uh I've sort of been getting some sneak

play25:53

peeks of and I'm excited to share what's

play25:54

on the way so make sure you're

play25:56

subscribed make sure you're tuning into

play25:57

these videos I really appreciate you

play25:59

thanks so much for nning out with me

play26:01

I'll see you in the next video bye-bye

Rate This

5.0 / 5 (0 votes)

Ähnliche Tags
AI NewsOpenAIAGI LevelsStrawberry TechAI EthicsWhistleblowerAI EducationDolly ImageSora DemosAI Video CreationHubSpot AIAnthropic CLAGoogle VidsYouTube MusicAI Training DataMicrosoft DesignerMistral MambaAI ShoppingRufus AssistantEU AI PolicyMeta LlamaAI ForensicsGPT 40 MiniNvidia NemoAI Olympics
Benötigen Sie eine Zusammenfassung auf Englisch?