AI News: We're One Step Closer To AGI This Week!
Summary
TLDRThis week in AI news, OpenAI outlines five levels of AGI progress, with current tech near Level 2. A new reasoning tech, 'Strawberry,' is speculated to push towards Level 3. Controversies arise over OpenAI's employee practices, while advancements in AI-generated images, videos, and educational tools are highlighted. HubSpot offers a free AI resource bundle, and new models like GPT-40 Mini and MistrAL Nemo promise enhanced capabilities. The video also covers AI's role in forensics, e-commerce, and the upcoming Olympics.
Takeaways
- 🧠 OpenAI has outlined five levels of progress towards AGI, with current technology at Level 1 for chatbots and near Level 2 for reasoners that can solve human-level problems.
- 🔍 OpenAI is reportedly working on a new reasoning technology called 'Strawberry', which is believed to be a rebranded version of the previously mentioned 'QAR' project.
- 🤖 'Strawberry' aims to perform deep research by autonomously navigating the internet, indicating a move towards Level 3 AI that can take actions on behalf of users.
- 🗳️ Whistleblowers from OpenAI have raised concerns about the company's practices regarding employee rights and non-disclosure agreements, which OpenAI has refuted.
- 🖼️ There is speculation that OpenAI's image model, DALL-E, may have received an update, as demonstrated by clearer text in generated images.
- 🎥 Sora AI's video generation capabilities are generating excitement, though the release of other tools like Runway Gen 3 and Lum's Dream Machine have somewhat diminished the anticipation.
- 🛍️ HubSpot has released a free resource bundle for using AI at work, including flowcharts, templates, and checklists to integrate AI into various professional tasks.
- 🏫 Andrej Karpathy, former OpenAI figure, has announced a new venture, Eureka Labs, focusing on AI-assisted education.
- 📱 Anthropic's CLA, an AI chat app, has been released for Android, expanding accessibility beyond iOS.
- 🎨 Google's new app, Google Vids, is an AI-powered video creation tool integrated with Google Workspace, currently in testing with a select group of users.
- 🎵 YouTube is testing 'Music Sound Search', a feature that identifies songs based on user humming or singing, akin to the Shazam app.
Q & A
What are the five levels of AI progress outlined by Open AI?
-Open AI has defined five levels of AI progress: Level one includes chatbots with conversational language capabilities. Level two involves reasoners capable of human-level problem-solving. Level three consists of agents that can take actions on our behalf. Level four is about innovators that can aid in invention by creating novel ideas. Finally, level five is about organizations and AI that can do the work of an entire organization.
What is the current status of AI in terms of these levels?
-As of the script's recording, AI is at level one, with capabilities of chatbots and conversational AI. It is very close to reaching level two, which involves reasoners that can solve problems at a human level.
What is the new reasoning technology being developed by Open AI?
-Open AI is working on a new reasoning technology codenamed 'Strawberry'. It is designed to not only generate answers to queries but also to plan ahead and navigate the internet autonomously to perform deep research.
What is the aim of the 'Strawberry' project?
-The aim of the 'Strawberry' project is to enable AI to perform long-horizon tasks or complex tasks that require planning ahead and performing a series of actions over an extended period of time. It is intended to conduct research by browsing the web autonomously.
What controversy has arisen regarding Open AI's practices with employees?
-Whistleblowers have claimed that Open AI illegally prevents employees from talking to government regulators about problems at work and removes their rights to rewards for whistleblowing. Open AI refutes these claims, stating they have a policy that protects employees' rights to make protected disclosures.
What updates have been speculated about Dolly image model?
-There is speculation that the Dolly image model may have received an update, as demonstrated by a post showing an image with legible text. Previously, Dolly struggled with generating clear text in images.
What is the significance of the new demo videos from Sora?
-The new demo videos from Sora showcase impressive black and white clips, indicating advancements in AI-generated video technology. These demos are increasing anticipation for the release of Sora.
What is the new AI-powered video creation app announced by Google?
-Google has announced 'Google Vids', an AI-powered video creation app designed for work and integrated with the Google Workspace Suite. It is currently being tested with a select group of trusted testers.
What is the controversy about the source of training data for AI models?
-There is controversy over the use of data from Uther AI, which collects data from various sources, including YouTube videos, to train AI models. Some YouTubers have noticed their transcripts being used in the training data set, raising questions about data privacy and consent.
What new model has Open AI launched recently?
-Open AI has launched a new model called GPT 40 Mini. This model is designed to be more cost-efficient and faster than its predecessor, GPT 3.5, and supports text and vision in the API with future support for text, image, video, and audio inputs and outputs.
What collaboration has resulted in the creation of Mistral Nemo?
-Nvidia and Mistral have teamed up to create Mistral Nemo, a 12 billion parameter model designed for on-device deployment, catering to businesses with limited internet connectivity or stringent data privacy requirements.
Outlines
🤖 AI Progress Levels and New Reasoning Tech
Open AI has outlined five levels of AI development, starting with chatbots at level one and progressing towards AGI at level five. The current focus is on level two, which involves AI capable of human-level problem-solving. Open AI is reportedly close to achieving this. Additionally, a new reasoning technology codenamed 'Strawberry' is under development, aiming to perform deep research autonomously by navigating the internet. This technology is speculated to be a rebranded version of the previously mentioned QAR. It is designed to not only answer queries but also plan ahead and conduct research tasks. Open AI is also facing scrutiny over alleged suppression of employee whistleblowers, with claims that they illegally restrict communication with government regulators and remove rights to rewards for whistleblowing.
🖼️ Updates in AI Image and Video Generation
There are speculations about updates to the Dolly image model, which seems to have improved its text generation capabilities. Users can now generate images with clearer text using Dolly 3, accessible for free on Bing's website. New demo videos from Sora showcase impressive black and white clips, increasing anticipation for the tool. Meanwhile, Runway Gen 3 and Lum's Dream Machine have dampened some of the excitement as they also offer AI-generated video capabilities. HubSpot has released a free resource bundle for using AI at work, including flowcharts, templates, and checklists to ensure AI-generated content aligns with brand voice and quality standards. Andre Karpathy, a former Open AI employee, has announced a new venture, Eureka Labs, focusing on AI-assisted education.
📱 AI Apps and Tools Expansion
Anthropic's CLA has expanded to Android, offering an alternative to the Chat GPT app. Gemini, an AI assistant for Android, can now answer general questions even when the device is locked. Google has announced Google Vids, an AI-powered video creation app integrated with the Workspace Suite, currently in testing. YouTube is testing a new feature called 'Music Sound Search', similar to Shazam, and an AI-generated conversational radio. Controversy has arisen over the use of YouTube videos in training data for AI models, with claims that companies like Apple, Nvidia, and Anthropic have used data from Uther AI's 'pile', a dataset that includes copied transcripts from YouTube videos.
🎨 AI in Design and Education
Microsoft's Designer platform, similar to Canva, is being integrated into various Microsoft apps, allowing users to create images and edit them on mobile devices. New features include a restyle function and the ability to use the co-pilot sidebar for image creation. Mistral, a French AI company, has released a new model called Codstrol Mamba, designed for code generation and capable of handling large inputs. Amazon has introduced Rufus, an AI shopping assistant within the Amazon app, providing shopping and political information. Meta has decided not to offer multimodal models in the EU due to regulatory uncertainties, focusing instead on text models.
🏥 AI in Healthcare and Forensics
AI systems have achieved a 96% accuracy rate in determining sex from dental X-rays, primarily in forensic applications. This technology is less accurate with children under six, who have not yet lost their teeth. The news highlights the potential of AI in forensics and medical diagnostics. Additionally, Open AI has launched a new model called GPT 40 Mini, designed to be more cost-efficient and faster than its predecessor, GPT 3.5. The model supports text and vision in the API, with plans to include text, image, video, and audio inputs and outputs in the future.
🏅 AI in Sports and Future Developments
Google has become the official AI sponsor for Team USA in the upcoming Summer Olympics, promising a significant AI presence in related advertising. Nvidia and Mistral have collaborated on a new model, Mistral Nemo, designed for on-device deployment, suitable for environments with limited internet connectivity or strict data privacy requirements. The model is efficient and can be used on laptops and desktop PCs. Google's AI is expected to be prominently featured during the Olympics, indicating a growing integration of AI in various aspects of society.
📢 Staying Updated with AI News
The video concludes with a reminder to stay updated with the latest AI news by subscribing to the channel and visiting futur.tools, where AI tools and news are curated. The host expresses gratitude for the viewers and sponsors, hinting at exciting upcoming AI developments.
Mindmap
Keywords
💡AGI
💡Conversational AI
💡Reasoners
💡Autonomous Agents
💡Innovators AI
💡Organizational AI
💡Strawberry
💡Whistleblowers
💡AI-generated Videos
💡AI in the Workplace
💡GPT-4
💡Mistral Mamba
💡Rufus
💡AI Act
Highlights
Open AI outlines five levels of progress towards AGI, with current technology at Level 1 for chatbots and nearing Level 2 for reasoners.
Open AI is developing a new reasoning technology code-named 'Strawberry', aiming for deep research capabilities and autonomous internet navigation.
Internal document leak suggests 'Strawberry' may have been previously known as QAR, with testing scores over 90% on a math dataset.
Whistleblowers allege Open AI suppresses employee communication with government regulators and removes rights to whistleblower rewards.
Dolly image model by Open AI may have received an update, as evidenced by clearer text in generated images.
Sora AI generates impressive black and white video demos, increasing anticipation for its release.
Runway Gen 3 and Lum's Dream Machine offer current AI-generated video creation, partially reducing excitement for Sora.
HubSpot offers a free bundle of resources for using AI at work, including flowcharts, templates, and checklists.
Andre Karpathy announces a new venture, Eureka Labs, focusing on AI-assisted education.
Anthropic's CLA is now available on Android, expanding accessibility beyond iOS.
Google introduces Google Vids, an AI-powered video creation app for work, integrated with Google Workspace.
YouTube Music Sound Search allows users to identify songs by humming or singing, similar to Shazam.
Controversy arises over the use of YouTube videos in AI training data without consent.
Microsoft rolls out Designer platform updates, integrating AI image creation into various Microsoft apps.
Mistral releases Codstrol Mamba, an open-source model for code generation with a large token input capacity.
Amazon launches Rufus, an AI shopping assistant within the Amazon app, providing recommendations and answers.
Meta will not offer multimodal AI models in the EU due to regulatory uncertainty and GDPR compliance concerns.
Google AI is the official sponsor for Team USA in the Summer Olympics, with ads promoting Google AI products.
Open AI releases GPT 40 Mini, a faster and smarter model than GPT 3.5, with support for text and vision inputs.
Nvidia and Mistral collaborate on MistrAL Nemo, a 12 billion parameter model designed for on-device deployment.
AI systems achieve 96% accuracy in determining sex from dental X-rays, with potential applications in forensics.
Transcripts
here's the AI news that you might have
missed this
[Music]
week starting with the fact that open AI
mapped out their five levels towards the
progress of AGI here's a quick breakdown
of those five levels so level one they
say would be chat Bots and AI with
conversational language that's
essentially what we're getting right now
out of chat GPT Claude llama 3 things
like that then you have level two which
is reasoners that can do human level
problem solving they claim their very
very close to level two right now then
it moves on to level three which is
Agents or systems that can take actions
on our behalf you know book flights for
us respond to emails for us things like
that then there's level four which they
say is the innovators AI that can Aid an
invention it's actually going to create
novel ideas and then finally you have
level five which is organizations and AI
that can do the work of an organization
so basically we're right here right now
we're at level one almost on the level
two we're right on that precipice of
level two and open AI believes that
we'll sort of move through each of these
levels on our way to a true AGI now this
was actually released on July 11th last
week but it didn't make it into last
week's video but it felt a little extra
relevant this week because this week we
got the news that open aai has been
working on a new reasoning technology
code named Strawberry now I've seen a
lot of other YouTube videos and a lot of
X posts about this with people
speculating a lot of people believe that
this was what was originally called qar
and they've now rebranded it to
Strawberry now this comes from a leaked
internal document it says teams inside
of open AI are working on strawberry
according to a copy of a recent internal
open AI document seen by Reuters Reuters
couldn't ascertain the precise state of
the document and they could not
establish how close strawberry is to
actually being publicly available likely
not very close the aim of strawberry is
to not just generate answers to queries
but to plan ahead enough to navigate the
internet autonomously and reliably to
perform what open AI terms deep research
the article does claim the strawberry
project was formerly known as qar this
exact article here was actually updated
after it was published to add this
section it says a different Source
briefed on the matter said openai has
tested AI internally that scored over
90% on a math data set a benchmark of
Championship math problems now writers
couldn't actually figure out if they
were referring to the strawberry project
or not but it kind of sounds like
they're probably the same project now
outside of this little information that
we have on it there's a lot of
speculation around what this is but it
sounds like this strawberry is pretty
close to that level two towards AGI that
we were just talking about and at the
moment it sounds like the main purpose
of This research is for this new model
to essentially do research among the
capabilities openi is aiming strawberry
at is performing long Horizon tasks or
complex tasks that require a model to
plan ahead and perform a series of
actions over an extended period of time
open AI specifically wants its model to
use these capabilities to conduct
research by browsing the web
autonomously with the assistance of a
computer using agent or CUA that can
then take actions based on its findings
again not much more is known about this
and open AI has notoriously been kind of
hush H about their upcoming models and
usually we don't know much about them
until literally the day they make the
announcement of them and while we're on
the topic of open AI more people from
open AI are coming out and talking about
some of the questionable practices of
open ai's business it came out this week
that some whistleblowers are saying that
open AI illegally keeps employees from
talking to government Regulators about
problems at work and removes their
rights to rewards for blowing the
whistle this comes from a letter that
was sent to Gary gendler the chair of
the SEC openai refutes the claims saying
that they have a policy on
whistleblowers that protects employees
rights to make protected disclosures now
this isn't the first time that open ai's
policies and contracts with their
employees have been under scrutiny
several weeks ago it came out that open
AI was forcing people to sign
non-disparagement agreements and if they
talked badly about open AI they could
lose their vested equity in the company
well now it sounds like people are
coming forward and claiming that if we
blow the whistle on anything we think
open AI is doing that's somewhat
suspicious we can also lose our vested
equity and that's not legal now the
sources are Anonymous open AI claims
that that's not actually happening but I
have a feeling open AI is probably in
the process of overhauling a lot of
their contracts that get signed by any
new employees that join the company due
to all this scrutiny again back when
most of these people probably signed up
for open AI the company wasn't nearly as
big or in the public eye now that they
are as big and in the public eye a lot
of this stuff is kind of starting to
come under the microscope and while
we're on the topic of open II there's
some speculation that maybe the dolly
image model recently got an update this
is a post from my buddy angry penguin
over on X where he shows off an image
that he created that has pretty legible
writing in it this clearly says evolve
all over it previously DOI struggled
with words if I go to Dolly and say
create an image of a robot holding a
sign that says Please Subscribe I
actually get an image that has the words
kind of nailing it so I think DOI did
make some updates because the text seems
to be much more clear than it used to be
and if you're interested in using Dolly
but you don't have a chat GPT Plus
account you can always go to bing.com
slim imagesc create and use Dolly 3 for
free over on Bing's website which if
Dolly 3 did get an update it appears to
have also rolled out here inside of
being image Creator I mean two and a
half out of four sort of nailed what I
was going for we also got some new demo
videos from Sora we can see this like
black and white video showing all sorts
of different clips in black and white
that actually look pretty dang
impressive these were shared on Matthew
burman's X account here's another one
that he shared of like ocean crashing
and uh I don't know a gas station or
motel or something uh but yeah we're
getting more demos from Sora which is
just making people more anxious to
actually get their hands on it but right
now we do sort of have that itch
scratched in the form of Runway gen 3
and lum's dream machine we can actually
create some pretty good AI generated
videos now with those tools it sort of
damped down the excitement for Sora a
little bit but the fact that this can
create much longer videos and open AI
tends to kind of set the bar for almost
everything they put out I'm still
excited about it but I have gotten that
need met with some other tools recently
if you're somebody that uses AI at work
or or you're thinking about using AI at
work you need to check out hubspot's
completely free bundle called five
essential resources for using chat GPT
at work and honestly if you haven't
embraced AI yet just remember what
Nvidia CEO Jensen hang said AI will be
the most transformative technology of
the 21st century it will affect every
industry and aspect of our lives so if
you're not using AI to speed up and
improve the quality of your work well
your competitors probably are so the
link to this totally free resource is
down in the description below and trust
me this is something you're going to
want to look through it includes
interesting flowcharts on when you
should or shouldn't use chat GPT there's
also a really cool template that you can
use with chat GPT to make sure that any
of the content it creates for you
follows your Brand's voice you've got an
AI generated content refinement
checklist to double check the ai's work
and ensure that you're putting something
out into the world that you really want
to be putting out there there's also a
four-page check list that you can easily
follow all about adopting AI in the
workplace and a super comprehensive PDF
guide on how to supercharge your day
with chat GPT and here's what's really
cool about this if you scroll all the
way down to the bottom of this document
they have 100 ways to try chat GPT today
and it's got some really cool prompts to
test out things like providing
recommendations for improving customer
service and support providing
recommendations for improving SEO
helping with email management and
organization and so much more again it
is really comprehensive and really
helpful the link to this completely free
resource from HubSpot is in the
description below and thank you so much
to HubSpot for sponsoring this video
Andre karpathy who previously worked at
open aai and then recently stepped away
just announced a new Venture that he's
working on he said excited to share that
I'm starting an AI plus education
company called Eureka Labs at Eureka
Labs they're building a new kind of
school that is AI I native they say that
subject matter experts who are deeply
passionate great at teaching infinitely
patient and fluent in all of the world's
languages are also very scarce and
cannot personally tutor all 8 billion of
us on demand so it sounds like he's
creating a sort of online education
where the teacher still designs the
course materials but they are supported
leveraged and scaled with an AI teaching
assistant who is optimized to help guide
the students through them so this
announcement here is really all that we
have he hasn't really talked a whole lot
about this more than the announcement
but what I'm sort of imagining is that a
teacher with subject matter expertise
goes in creates an entire course on
their subject matter all of that
information is then sort of trained in
the AI I don't know if they're going to
use you know retrieval augmented
generation or they're going to fine-tune
the model I don't know exactly how
they're going to do it but all of the
information that the teacher taught is
now available inside of the model so
anybody who wants to learn this stuff
can then work with a tutor who
understands all of the training material
and can speak to the student in whatever
language they want to learn in this will
massively scale the ability of an
individual teacher who can teach the
concept once and then let their AI
assistant teach it to everybody else who
wants to learn that information again
I'm just sort of speculating on what
this is going to look like I don't know
exactly but that's sort of what the
concept sounds like to me if you're a
fan of anthropics CLA and you don't have
an iPhone well good news they just
released it on Android it's been on iOS
for a couple months now and they just
now rolled out an Android version
personally I'm still a fan of the chat
GPT app a little bit more than the
anthropic app just because the
conversational voice portion of the chat
GPT app is actually really really good
when I'm on my computer I usually use
either clot or perplexity when I'm using
my phone I still go to the chat GPT app
but I also understand most people
probably don't want to pay for free
separate chat subscriptions so if you
really like the ability to have a voice
conversation with an AI chat GPT still
the way if you don't care about that you
just want the best model in your hand
clad is probably the best and they now
have an Android app and since we're
talking about Android phones Gemini now
answers general questions when your
Android phone is locked there's not too
much more to share on this story it is
exactly what it sounds like Google Now
lets you get answers from Gemini without
actually unlocking your device also this
week Google announced Google vids vids
is an AI powered video creation app
that's designed for work and deeply
integrated with the workspace Suite you
use every day you can actually find it
over at
workspace.com
product/ VDS right now it's not
available to everybody they say we're
currently testing this new application
with a select group of trusted testers
and according to the video on their
website it looks like you give it a
prompt like help me create a sales
training video and then it will help
create this like slide Style video for
you there's a bunch of different styles
that you can choose from and once you
pick your St style you can speak out a
script add a voice over to it and add
stock footage to it to get the perfect
sort of layout for your video and then
it creates that sort of slide
presentation video for you and since
we're talking about Google and we're
talking about video let's talk about
this new feature that YouTube is rolling
out called YouTube music sound search
this sounds like a feature that's very
similar to Shazam where you can have it
listen to a snippet of music and it will
figure out what song it is but you can
also hum the song it'll be able to
figure out what song it was based on
your humming we can see some screenshots
that they shared here they've got a
little search box here with a microphone
next to it I'm assuming they Click the
microphone and then it says play sing or
hum a song and then it figures out what
song that you were trying to find just
based on the singing or humming
YouTube's also testing an AI generated
conversational radio it'll let you
create a custom radio by describing
exactly what they want to hear this
article goes on to say be on the lookout
for ask for music any way you like card
in your home feed this will open the
chat-based UI with a field at the bottom
that lets you ask for music there's been
a little bit more controversy this week
about the source of training data for
various AI models this article on proof
news here claims that Apple Nvidia and
anthropic use thousands of swiped
YouTube videos to train AI basically
here's what's happening with this
there's a company called Uther AI which
is an open-sourced company that collects
a whole bunch of data from from
everywhere and puts it into what they
call the pile and the pile is this giant
data set that companies then use to
train their AI models initially so that
it can just sort of learn how the
language works and just get injected
with a ton of data to start well this
pile is trained on publicly available
data and it turns out that a lot of that
publicly available data was transcripts
that were copied and pasted straight
from YouTube videos and a lot of
YouTubers started to notice there's data
in there from people like MKBHD Mr Beast
PewDiePie and others and this site proof
news.org actually put up a little search
engine so that you can see if a video
that you created or literally anybody's
video is found within the piles data set
now I did a search for my own name and
no results were found I don't know
whether I should be offended or relieved
at the time the data was scraped my
channel probably just wasn't big enough
now after all this came out Apple
stepped up to say yes we've used the
pile for some research purposes and some
training but the model that we're using
inside of our Apple intelligence is not
trained on the piles data so that
information is not inside of Apple's
training set according to them Microsoft
has a platform called designer which if
you're not familiar with it it's very
similar to canva it's a platform to
create things like YouTube thumbnails
and banner ads and Instagram images and
things like that well this designer
platform is now being rolled out into a
whole bunch of different Microsoft apps
directly where you can use the co-pilot
sidebar over here ask it to create a
specific image in a specific style and
it will actually use Microsoft's
designer to create that image and allow
you to pull it directly inside of your
document or your PowerPoint or whatever
Microsoft tool that you're using here's
another example of it being shown off
inside of Microsoft PowerPoint where
they create some images with designer
over here it generates some images and
then they just pull that in as the
background of the slide designer also
got a free mobile app on both IOS and
Android so you can easily create and
edit images on the go on a mobile device
now there's a whole bunch of other new
features for designer if you want to
dive deeper into it this is something
that you're really interested in I will
make sure it's linked up in the
description so you can see all of the
updates here the article is quite long
and there are quite a few updates but it
seems like it's got some other pretty
cool features like this restyle feature
you upload an image and it restyles it
to a different style of image mistol the
French AI company that develops large
language models released a new model
called Cod strol Mamba this is a model
designed for code generation it is open
source and it can handle an input of up
to
256,000 tokens which is double what open
aai currently offers with chat GPT
that's rough roughly 192,000 words
between the amount of text inputed and
the amount of text outputed this is a 7
billion parameter model and offers a
fast response time even with longer
input text so if you're a coder and
you're looking to try another large
language model to see if it outperforms
the other models you've tried maybe Cod
stroll Mamba is a choice to try out
Amazon started rolling out an AI
shopping assistant called Rufus which
apparently answers questions about
shopping and also politics Rufus is
essentially a chatbot just like chat GPT
but it's built directly inside of the
Amazon app and it's trained on the data
that's in Amazon so you can ask what are
the best lawn games for kids birthday
parties and it will suggest lawn games
as well as where to find them and buy
them on Amazon this Verge article also
tested some other questions and got it
to answer questions about the candidates
for the 2024 election I've got more bad
news if you're in the EU it sounds like
meta is not going to be offering their
multimodal models in the European Union
they will be offering their normal text
input output models like llama but
you're probably not going to be able to
create AI images AI videos and anything
other than more text if you're in the EU
due to the eu's I guess unclear policies
they say here we will release a
multimodal llama model over the coming
months but not in the EU due to the
unpredictable nature of the European
regulatory environment says here that
meta's issue isn't with the the still
being finalized AI act but rather with
how it can train models using data from
European customers while complying with
gdpr the eu's existing data Protection
Law the United Kingdom has nearly
identical laws to gdpr but meta says it
isn't seeing the same level of
regulatory uncertainty and plans to
launch its new model for the UK users
here's something that I came across on X
from johanis Stelzer I just thought it
was really cool they hooked up a little
uh mey device to their computer and they
can turn the knobs to change different
aspects of the images they appear to be
using stable diffusion here and then
using these knobs to change different
elements within stable diffusion
different sort of parameters and I just
thought it looked really cool and I
wanted to share it and they also put the
code for it up on GitHub so if you want
to play around with something like this
and hook up sdxl to a mey device well
that's available for you to do here's
another article that I came across that
I couldn't find a whole lot on I just
thought it looked cool is from Gizmo
China turn your selfie into a printable
3D character with 10 cents AI powered
app so apparently this is an app where
you can upload a selfie and it will
generate a 3D model based on that one
selfie that is so good that you can 3D
print it now I actually did some digging
to try to find more info about what
they're doing here and this was
literally the only article I can find
about it but as I learn more about it I
do have a 3D printer I do love AI this
is something I will be playing with if I
can get my hands on it and here's
something interesting AI systems achieve
a 96% accuracy in determining the sex
from dental X-rays so they basically
trained an AI model on a whole bunch of
dental images and then when they ran new
dental images through it it was able to
determine the sex of whose teeth those
were at a rate of 96% accuracy and the
ones that it wasn't accurate on that was
mostly children the the article claims
that it's less accurate if you're six or
under or basically haven't lost your
teeth yet now the main use case for
something like this would be in
forensics if they find you know skeletal
remains or something they can actually
identify the sex of the skeletal remains
but I just thought it was fascinating so
I thought I'd share with you so I
started recording that video while I was
in San Diego still and well now I'm on
vacation in Colorado and a few more
pieces of news came out that I wanted to
make sure got shared in Friday's news
video including the fact that open AI
just launched a new model today on
Thursday the day I record this called
GPT 40 mini with pretty much every large
language model Creator out there
creating models that are smaller
designed to be more cost-efficient and
faster open AI needed to create a
language model to compete so this new
GPT 40 is replacing the old GPT 3.5 not
quite as powerful as the full-on GPT 40
but it is faster and smarter than the
previous GPT 3.5 we can see that right
now today GPT 40 mini supports text and
vision in the API with support for text
image video and audio inputs and outputs
in the future it's got 128,000 token
context window so you should still be
able to put large amounts of text as
your input however the output only
supports 16,000 tokens we can see this
comparison here of model evaluation
scores with GPT 40 and pink being the
best model and it pretty much performs
the best across the board here in every
test with GPT 40 mini this new model
that was just released performing second
best across pretty much all of these
benchmarks here now keep in mind this is
comparing it to these other companies
smaller models it almost kind of feels
unfair to be putting GPT 40 in here
compared against you know Claude Haiku
and Gemini flash which is both of those
platforms smaller models while GPT 40 is
open ai's current state-of-the-art model
but nonetheless we can see how this new
Mini version of GPT 4L outperforms all
the other mini models that are out there
if we log into our chat GPT account here
up in the top left corner where you
select the model you can see that we now
have access to 40 40 mini and the Legacy
gp4 now at the time of this recording
when I try to not log in and just use
the free version it's still claiming
it's using chat GPT 3.5 although it does
say here in chat GPT free plus and team
users will be able to use GPT 40 mini
starting today in other large language
Model news Nvidia and mistol teamed up
to create mistol Nemo this is a 12
billion parameter model and it also has
128,000 tokens just like the new GPT 40
Mini model now what's cool about this
model is it's actually designed to be
run on device we can see here it says
this model's efficiency and local
deployment capabilities could attract
businesses operating environments with
limited internet connectivity or those
with stringent data privacy requirements
they do go on to say that it's more
designed for laptops and desktop PCS
than smartphones so companies that want
to run a really really powerful large
language model with a large context
window that can take a lot of input and
a lot of output text and maybe concerned
about privacy or not have internet
access well now they have a model that
they can use that's going to provide
pretty much everything you're going to
need it says the model is immediately
available and we have a link here with a
downloadable version promised in the
near future so you can actually try this
model out over on nvidia's website if we
come over to build. nvidia.com exlore
slcover click on reasoning over on the
left side we can see fresh off the press
mral Nemo 12b instruct if we click in
here we get a chat window where we can
actually play around with this model if
we want to again this is a cloud version
where you can just sort of play around
with it but a desktop version is coming
soon and finally if you're planning on
watching the Summer Olympics this year
it looks like Google's AI is going to be
everywhere Google is apparently the
official AI sponsor for Team USA and
claim that they're going to have ads all
over for all of the various Google AI
products so if you haven't heard enough
about AI lately on TV well watching the
Olympics you're going to see a lot of it
and anyway that's all I got for you
today again I record these videos on
Thursday in this case some of it was
recorded on Wednesday some of it was
recorded on Thursday so if there was any
news that came out late Thursday evening
or on Friday it didn't make this video
but it will be in next week's news video
while I'm here on vacation I'm sort of
slowing down on putting new videos out
but I am going to publish my Friday news
videos on schedule just like I always do
just less other videos during the week
this week if you like this video and you
want to stay looped in on all the latest
AI news the coolest AI tools interesting
AI research and you know some of my own
commentary and opinions around it make
sure you like this video And subscribe
to this channel it really helps my
channel grow it will also ensure that
you see more videos like this one inside
your YouTube feed and if you haven't
already make sure to check out futur
tools. where I curate all of the coolest
AI tools I come across I keep the AI
news page up to date on pretty much a
daily basis and we've got a free news
newsletter where you can get all of the
coolest AI tools and most interesting AI
news delivered directly to your email
inbox you can find it all over at futur
tools. completely free thank you so much
for tuning into this video I really
appreciate you thank you so much to
HubSpot for sponsoring this one I have a
feeling the AI news is really going to
start heating up again real soon there's
a lot of cool things in the works that
uh I've sort of been getting some sneak
peeks of and I'm excited to share what's
on the way so make sure you're
subscribed make sure you're tuning into
these videos I really appreciate you
thanks so much for nning out with me
I'll see you in the next video bye-bye
関連動画をさらに表示
AI News: This Was an INSANE Week in AI!
Papo de IA #29 | Notícias de IA - ChatGPT-5, GPT Store Liberada, Humanos Digitais, O fim do cinema..
OpenAI’s new “deep-thinking” o1 model crushes coding benchmarks
Why OpenAI's Announcement Was A Bigger Deal Than People Think
AI News: The AI Arms Race is Getting Insane!
These AI Use Cases Will Affect Everyone You Know
5.0 / 5 (0 votes)