Best Ways to Use OpenAI o1 and More AI Use Cases
Summary
TLDRThis week's AI news highlights the release of OpenAI's GPT-4, which has sparked debate for its potential in real-world applications. The script discusses practical use cases, such as legal work and math calculations, where GPT-4 outperforms previous models. It also explores Microsoft's Wave 2 AI updates, including advanced data analysis in Excel and AI-generated presentations in PowerPoint. Further insights cover AI video and voice advancements, with tools like Runway's video-to-video feature and HUME's emotionally intelligent voice models. The episode wraps up with a look at AI avatars and YouTube's integration of AI for music search and video backgrounds.
Takeaways
- 😀 There's a significant divide in opinions on AI's recent advancements, with some seeing no innovation and others considering it groundbreaking.
- 📈 OpenAI's release of model O1 has been met with varied reactions, but it shows promise in specific use cases like legal contract drafting and math problems.
- 🔢 O1's accuracy in multiplication tasks is notably better than previous models, with a significant improvement in handling large numbers.
- 💡 O1 is particularly strong in strategizing and planning, suggesting it's more effective for editing and improving upon existing work rather than creating from scratch.
- 👨💻 A recommended workflow for O1 involves using it to generate high-level plans and then employing another model like GPT-3.5 for detailed execution.
- 🆕 Microsoft 365's Wave 2 update, including Co-pilot, introduces advanced features across major apps and new functionalities like Co-pilot Agents.
- 📊 Excel's Co-pilot update now supports Python, enabling advanced data analysis directly within the application.
- 📈 PowerPoint's Co-pilot can create presentations from scratch, including design and content, based on user prompts.
- 🎨 An image generator that specializes in editing images, rather than creating them, offers surprising capabilities like background and material changes.
- 🎥 Runway's Gen Free video-to-video feature allows for style transformation of videos, indicating a significant leap in AI-generated video content.
Q & A
What is the main topic of discussion in the AI news segment?
-The main topic of discussion is the release of a new AI model called '01' and its various use cases, as well as other AI-related news such as Microsoft's AI updates and advancements in AI avatars.
What are the updated rate limits for 01 Mini and 01 Preview models?
-The updated rate limits are 50 messages per day for 01 Mini and 50 messages per week for the 01 Preview model.
How does the AI model '01' differ from previous models in terms of contract drafting?
-Unlike previous models that could write a contract from scratch, '01' is used to edit and improve existing contracts based on templates, showcasing its strength in editing rather than creating from the ground up.
What is the significance of the graph showing the accuracy of '01 Mini' in multiplication?
-The graph is significant as it demonstrates '01 Mini's' improved accuracy in multiplication, particularly with large numbers, compared to previous models like GPT-4, highlighting its advancements in calculation capabilities.
What is a unique use case of '01' where it has been tested to build applications?
-A unique use case is the creation of a fully functional chess game with an AI opponent, showcasing '01's ability to build applications that were not possible with previous models.
How does the speaker recommend utilizing '01' in a workflow?
-The speaker recommends using '01' for strategizing, brainstorming, and planning, and then using another model like GPT-3.5 for concrete tasks like code generation, based on the architectural document produced by '01'.
What is the IQ score of '01' compared to the average human IQ?
-The AI model '01' scores at 120 on IQ tests, which is higher than the average human IQ of 100, indicating a significant improvement in thinking ability.
What is the new feature in Microsoft 365 that allows for advanced data analysis within Excel?
-The new feature in Microsoft 365 is the ability to run Python within Excel, enabling advanced data analysis directly within the application.
How does the AI video feature from Runway's Gen compare between Gen 1 and Gen 3?
-The AI video feature from Runway's Gen shows a significant improvement from Gen 1 to Gen 3, with Gen 3 producing highly realistic videos that are difficult to distinguish from non-AI generated videos.
What is the AI Haggler and how does it utilize AI technology?
-The AI Haggler is a chatbot that uses AI to call hotels and negotiate discounted rates for stays, demonstrating the practical implementation of AI in negotiation and customer service scenarios.
Outlines
🤖 AI Industry's Mixed Responses to New Model 01
The script begins by highlighting the contrasting opinions in the AI industry following the release of a new model called 01. While some critics argue that the model lacks innovation and utility, others claim it's a game-changer. The speaker shares their perspective, suggesting that 01 could be superior to existing models for specific use cases. They also discuss updates on rate limits for the AI model, allowing more messages per day, and hint at real-world applications that set 01 apart from other language models.
📈 Practical Applications and Real-World Testing of AI Model 01
This section delves into the practical applications of AI model 01, focusing on real-world use cases. The speaker discusses how 01 is being used for legal work, particularly in drafting contracts, and its potential in math-related tasks. They present a graph showing 01's improved accuracy in multiplication problems compared to previous models. Additionally, the speaker mentions the creation of a fully functional chess game by 01, demonstrating its capability in long-term planning. They emphasize the model's strength in strategizing and planning, suggesting it be used as a 'mastermind' for generating ideas and plans that can then be executed by other models better suited for specific tasks.
🚀 Enhancing Workflows with AI: Strategy and Execution
The speaker continues by discussing the workflow involving AI model 01, emphasizing its role in strategy and planning. They provide an example of using 01 to create a detailed design document for an application, which is then used by another AI model for code generation. The speaker also introduces a new product update that includes 'work-focused gpts' designed for task execution. They demonstrate how these AI models can be used to streamline work processes, such as creating a content calendar or managing social media strategies, by leveraging the context and tooling provided by 01.
💡 Exploring AI Innovations in Microsoft 365 and Beyond
In this part, the speaker explores the latest AI innovations in Microsoft 365, focusing on the 'Wave 2' update. They discuss the integration of AI in various Microsoft applications, such as Excel, PowerPoint, and Word, highlighting features like advanced data analysis in Excel and automated presentation creation in PowerPoint. The speaker also mentions the introduction of 'co-pilot agents,' which are AI-driven tools designed to improve collaboration and task execution within teams.
🎨 AI in Creative Industries: Video, Voice, and Avatars
The final section covers AI's role in creative industries, including video editing, voice generation, and avatar creation. The speaker discusses the capabilities of AI in video editing, particularly in style transformation and realism. They also touch on the emotional intelligence of AI voice models and the potential for AI chatbots in practical applications like hotel price negotiation. Additionally, they review the latest AI avatars from a company called 'haen,' which now include dynamic facial expressions and voice tones, setting a new standard in the industry.
📢 Wrapping Up and Looking Forward to Future AI Developments
The script concludes with a summary of the AI advancements discussed and a tease of future developments. The speaker invites viewers to explore practical applications of AI through their newsletter and resources. They also express anticipation for upcoming AI technologies, hinting at potential releases and enhancements that could further transform various industries.
Mindmap
Keywords
💡AI
💡OpenAI
💡GPT
💡AI Avatars
💡Microsoft 365
💡AI News
💡Workflow
💡Use Cases
💡Chatbots
💡IQ Tests
💡Productivity
Highlights
The release of AI model 01 has sparked a divide in opinions, with some calling it groundbreaking and others deeming it useless.
01's rate limits have been increased, allowing for more daily messages with the model.
01 is being used for legal work, specifically drafting contracts based on templates.
01's capabilities in math, particularly multiplication accuracy, are showcased in a comparative graph.
A fully functional chess game with an AI opponent has been built using 01, demonstrating its unique application potential.
01 is suggested for high-level thinking and strategizing, with other models better suited for concrete tasks like coding.
01 scores at 120 on IQ tests, outperforming other state-of-the-art models.
A workflow is suggested where 01 is used for strategizing and another model, like GPT-3.5, is used for execution.
A new product update is introduced, including work-focused GPTs designed for task execution.
Microsoft 365's Wave 2 update includes advanced features like meeting summarization and co-pilot agents.
Excel's co-pilot feature now supports Python for advanced data analysis.
PowerPoint's co-pilot can create presentations from scratch, including design and content.
An image generator is highlighted for its ability to edit images, not just create them.
Runway's gen video to video feature is praised for its ability to change video styles realistically.
Hume's evi2 is introduced as a voice model with emotional intelligence and conversational abilities.
YouTube Music implements AI for music search by mood, indicating YouTube's adoption of new technologies.
Synthesia's haen avatars 3.0 is noted for its state-of-the-art AI avatars with dynamic facial expressions and voice tones.
Transcripts
this has been a very interesting week in
AI because it's been a while since we
had such a large divide between opinions
of people saying that the new releases
are utterly useless and there's no
innovation whatsoever and then others
saying that this is actually the most
groundbreaking thing since the release
of gpt3 now what I'm talking about here
is the release of 01 that happened last
week and in this week we pulled together
a few use cases and I'm going to show
you a little workflow where I believe
that 01 is actually preferable to any
other model on the market but it doesn't
end there because this week we received
a variety of AI news that you can
actually use ranging all the way from
the 01 use cases to Microsoft releasing
a wave two of their core AI offering and
state-of-the-art AI avatars and so much
more we went ahead and researched and
tested all of it for you and that's what
we show you every week in an episode of
AI news that you can actually use all
right so first up let's talk about the
developments in opening eyes brand new
model o1 because now roughly a week has
passed since the release and a lot of
the hype that came of the release has
settled down now and some real world use
cases are popping up and as that's
exactly what the show is about we'll be
looking at some of them but before we do
that I just want to update you on the
fact that the rate limits have been
increased now you have 50 messages per
day with 01 mini not just per week and
with the o1 preview model you get 50
messages per week instead of 30 also I
should note if you want unlimited
messages there's one easy way to do that
is if you have a paid account with Po
and now let's talk about what this can
actually be used for in the real world
and I have several examples of what
people have actually been using this for
that set themselves apart from what has
been possible with any large language
model so far starting with a use case
that was actually featured by open AI
themselves which is using open ai1 for
legal work concretely for drafting
contracts based on templates so to be
clear they use o1 not to write a
contract from scratch but to edit and
improve an existing one and as you'll
see with some of these use cases that I
have here that is sort of the
reoccurring theme it's more editing
rather than creating from the ground up
seems to be the strength here they also
show some math use cases here but rather
than showing you some specific
calculations that performs well at I
wanted to show you this graph that
displaced the accuracy amongst
multiplication so if a number has for
example 10 digits and it's multiplied
with another number of 10 digits well
only in 3.8% of the cases does 01 mini
get this right that's next to nothing
but it's a whole lot better than the Sea
of red of GPT 40 and that's why I like
this graph it really clearly shows the
Improvement here on these numbers where
gbt 4 gets 23% so that would be four
digits Time 4 digits o1 gets it right
100% of the time a real Improvement in
calculating and long-term planning isn't
it so how can we use that to our
advantage well it can build applications
and this has been sort of the dominant
use case all across the internet here in
this case it actually builds a fully
functional chess game including an AI
opponent that plays against you and it
actually works this is one of the tests
that have not been possible with any
other models so far but still my problem
with this is most people are not going
to be building chess games while that's
certainly impressive it's not a
practical Everyday Use case but hold up
that doesn't mean that you can't be
taking advantage of this brand new model
because I found another post which
reflects my personal experiences with
this model so far really really well so
this is the context of building an
application but this absolutely applies
to whatever else you might be doing with
AI already so listen closely then Mac
act here here is saying the following 01
mini is the architect explain
requirements and have it create a
detailed design document with
step-by-step instructions for each
module so all one we need is the
strategizing the high level thinking the
stepbystep planning and then he uses
Sonet 3.5 as the developer to actually
write the code to actually do the work
it generates the code based on
architectural document produced by 01
mini and this let me tell you this is
the correct workflow and this is the
correct way to use it let 01 do the
thinking the strategizing the
brainstorming Let It Be The Mastermind
and then take what it produces and bring
that over to another model that might be
better at a concrete thing like for
example right here son 3.5 is way better
at code generation than GPD 40 or 01 but
o1 shines at thinking so we're at this
interesting point of time where we're
sort of in between two phases we're not
quite in this agentic feature that has
been promised and that is most certainly
coming but we also don't fully have to
rely on these chatbots that do nothing
but assist you and I'll end the segment
on the point that people have been
running 01 through IQ tests and it
actually scores at 120 if you're not
familiar 100 is the average IQ of humans
this shifts over time as human
performance on these tests adjusts but
100 is the average human and 01 ranks at
120 all the state-of-the-art models
score in the 80s or low 90s this is a
massive Improvement in thinking ability
and while there might not be one
clear-cut use case on how to get the
most out of it my recommendation would
be that anything that has to do with
strategizing brainstorming or planning
is a task for all1 and then you can take
that knowledge and bring it into another
model and keep working with it there so
in practice it would look something like
this where you just give it a simple
goal oriented prompt with enough context
for it to understand what you actually
mean so I just tell it to create a
Twitter content strategy for an AI
educational brand focused on teaching
gen AI use cases prompting no quot
automation with the goal of improving
business and marketing efficiency for
non-technical individuals I Prett
straightforward prompt and o1 preview
can go ahead and reason over this now
look it's still a bit early for me to
give you concrete advice on how to
prompt this thing people are still
figuring this out but we're
experimenting with various examples and
we already scheduled the lecture inside
of the AI Advantage Community where I'll
be teaching one use cases workflows and
what to avoid and what we found that
already works but as you can see this is
a quite detailed strategy that didn't
take much input from me here's a little
tip that you absolutely want to do you
want a model switch in the middle of
your conversations because if I say
something like save it as a Word
document Well it can't it doesn't have
the code interpreter to do that so if I
want a word file I need to switch over
to 40 in the middle of my workflow here
which I can do no problem there you go
now I have the full thing as a word talk
I'll resave this as a PDF file and now I
want to show you an extra thing that
we've actually been developing with the
team over the course of the past six
months and that is these work Focus gpts
which actually do not use A1 preview one
because they're designed to actually do
things and two because it's not even an
option right now but here I have it
precept already brief overview of what
this includes is a communication
sequence here in the beginning that
clearly defines how this gbt is meant to
interact with the user and then in the
end I have this keyboard shortcut
implementation with a bunch of tool
usage in the middle that makes it even
better at actually doing stuff for you
and let me tell you this is the perfect
use case for GPT like this because if I
come in here and just begin the
conversation it tells me exactly what I
can expect right here okay I can press
zero to see all the hot keys and as you
can see there's a lot of preset skills
Within in the GPT I can research the
internet I can export as a word stock
and I even customized this one to create
a Content calendar for me now this GPT
doesn't really have the context of my
campaign that I want to run but guess
what we just crafted that with A1 so I
can just drag that in here and now with
the context that 01 crafted I can just
go ahead and say something like four and
look at me perform the job of a social
media manager instantly it's crafting a
Content calendar based upon all the
context that 01 preview generated for me
all the context and tooling that the GPT
has and the knowledge that includes
goals and specifications on style tone
and much more and I did it all by
uploading one file and pressing the
button four now I'll just say two as an
Excel because two is saved as as a
document and look at that I just crafted
a Twitter strategy in no time literally
this couldn't be any faster and if this
was my day-to-day job and I would be
using AI for it a workflow like this is
three times as fast as using prompt
presets copy pasting things Etc now what
you just saw here with this GPT preset
is just a quick preview of the brand new
product update that by the way we
shipped for free to everyone who bought
the product over the course of the last
year we reworked all 1,000 gpts in here
and you can simply copy paste them into
chat GPT Plus I even went through the
tedious task of reworking this entire
video course that teaches you how to use
this product and how to customize the
GPT exactly like the one that I just
showed you matter of fact that's the one
that we build in the course so if you're
interested in streamlining your work and
getting things done with AI this is the
most efficient workflow me and the team
have discovered and you can get the
Thousand gpts including over 30,000
specific prompts that work with the gpts
each one of these jobs comes with 30
prompts in a GPT right here the GPT and
the corresponding prompts you can get
all of this with the new video training
in the AI Advantage shop and there you
go consider this little demo the
sponsored segment of this week's video
and now let's move on to the next piece
of AI news that you can use which is
Microsoft
365 and let me tell you I actually spent
half my day wrapping my head around all
of the Innovations and things they
shipped in here because there's a lot
they call this update the Wave 2 2.0
version of co-pilot and I thought it was
really interesting that they actually
admitted that by far the most useful
thing that co-pilot could do so far was
the meeting summarization that was the
standout feature that people have been
actually using and that was super useful
to many individuals and teams and if you
run meetings remotely I bet you two had
one of these meeting summarizers in your
meetings or if you're using teams Zoom
Google meets it's just built into all of
them now cuz that's just one of the use
cases that really makes sense and
already works now this wave to update
goes Way Beyond just meeting summarizer
okay we got updates to all of their
major apps they introduced something
they call co-pilot pages and they
introduced co-pilot agents now this is
not going to be a deep dive on every
single one of them but I actually did go
ahead and tested most of these features
myself because I thought this was
extraordinarily interesting and some of
the things they added in here like the
Advanced Data analysis within Excel is
something I personally have been waiting
for so let's just go one by one and as
we have Excel right here let's start in
it I got the free trial of co-pilot 365
which upgraded my account now I have
this little icon inside of all of my
Microsoft applications and I can just
start using it now no matter what I did
I couldn't change the language here from
German to English which is a bit
annoying because my entire Microsoft
account is set up for English but maybe
just take some times for the concrete
applications to switch over nevertheless
this little co-pilot icon is not just
available in the web version but also in
the desktop version but let me tell you
on Mac I couldn't actually get this
desktop version to work with co-pilot
the icon in Excel is here but I have to
turn on autosave to make this work and
it just never goes beyond this loading
Services page so let me just stick to
the web app I'm sure this will work over
time I did update and restart and
reinstall and all that still doesn't
work let's just look at it through the
web version and I think this web version
is really interesting because you don't
just get the intelligence that you would
get inside of cgp that was usually my
workflow I just asked C to write my
Excel formulas and tell me what I can do
inside of Excel and how to do it now you
get it all in here but you also get some
Advanced capabilities like Advanced Data
analysis and once you get this working
one of the biggest additions inside of
this new version of excel with co-pilot
is that it actually can run Python and
therefore perform Advanced Data analysis
just like chat GPT can since a while now
but here you have even more manual
control and it's built right into app so
you can create visualizations or run
data analysis tasks in here that you
otherwise would have been only able to
do if you had development skills I think
the demo video shows that off really
well here and if this little segment got
you intrigued I would totally recommend
you check that out Beyond Excel
PowerPoint saw some of the most
interesting at least to me developments
here because inside of PowerPoint you
can also open up co-pilot and you can
actually start crafting your entire
presentation your story from scratch so
let's create a presentation about the
history of generative AI keep it concise
and do it in English and then it goes to
work and crafts everything for you
including including the design
animations and all of the copy on top of
the slides and beyond that there's also
this new story editor type of feature
where you can actually work with it and
rearrange the different blocks that it
generates so that it really fits your
needs and you don't have a ton of
editing cuz usually with this
presentation related AI apps that was
the case it just generates everything
for you and then you have equally as
much work editing it as you would have
had creating it from scratch well this
sort of solves that now let's have a
peek back into our PowerPoint
presentation here and see what it did
and there you go history of generative
AI that's a good look presentation am I
right it kept it pretty concise it's not
too wordy and correct me if I'm wrong
but doing this from one prompt is pretty
damn impressive let's check out the
presentation in action introduction to
gen what is Gen applications examples
deep dream gpt3 so as you can see this
model is a bit more outdated than what
we're used to with something like 40
this cuto seems to be somewhere 2 years
ago so maybe don't fully rely on it for
copy of up to-date topics but there is a
browser function in here and if you're
using co-pilot you can totally let it
search the web just be aware that the
info in the LM might not be as up to dat
as you want it to be but there you go
visuals copy and from what I can see
this actually works and everybody has
access to this plus it can go even
beyond that and it can respect the
assets and brand guidelines that you set
up for your company and then there's
also some word features which I thought
were the least exciting cuz we've seen
chatbot Integrations like this in many
other writing apps but did catch my
attention was this co-pilot agents
feature which is essentially a copy of
gpts with better knowledge based
integration
and improved actions and by improved I
mean that they're actually intuitive and
quite easy to use I mean look the agent
Builder is essentially identical to the
GPT Builder but when you set up an
action it's what I predicted in like
December 23 that it's going to be
toggles or Google logins a simple user
interface that allows you to link it to
other apps rather than open eyes current
version where you have to host an API
end point and Link that and if you want
to give it access to files that's pretty
simple too you simply select them like
so and all of a sudden the agent has
access to all of them them and then in
combination with the actions you can add
these little agents to your team chat
and people can actually use them and get
things done in collaboration this might
be a small step in terms of what's
possible with AI but a pretty impressive
step in terms of bringing all these
various capabilities together and having
it in one Suite Microsoft is not going
to give up on implementing all of these
features into their applications and
while they might not be first with any
of these features they're the ones with
the users and they're the ones with all
the bundled apps that people actually
use so if you're Excel PowerPoint or
word user it might be a good idea to
sign up for the 30-day free trial and
see if this helps your workflow what I
personally did is start the trial and
unsubscribe right away and I'm going to
give this a shot and see how it goes
over the course of the next month but
having something like Advanced Data
analysis inside of excel seems promising
cuz I use that inside of chat gbt all
the time okay so next up we have this
image generator which specializes in
editing images not creating them this is
quite interesting and not the Direction
all of these tools typically take so we
went ahead and tested it in a bunch of
examples and what we found is that it's
surprisingly good at changing specific
objects usually the workflow with photo
editing is that you need skills in the
software that edits the picture you need
to MK the subject know how to apply the
creative effects and then make sure
everything blends with the rest of the
image so in our testing we tried various
things like changing the background that
works really well changing the subject
look at this image of the vase shifting
materials that work surprisingly well I
don't think it's perfect but it's pretty
good and then this is a fun one that is
already packaged than various iOS
applications that charge $10 a month
just for this feature that you can
access here for free you can change a
hairstyle you can upload your own
picture and prompt it to a different
color I wanted to feature this tool
because just like right here it's often
possible to achieve a lot of these
results that a lot of applications
charge for completely for free so if
this a little app cut your interest
there's a GitHub space that did not get
much attention yet and a hugging face
space where you can try it right away
okay next up I want to look at a AI
video feature that I personally and
others have been waiting for and that is
the video to video feature from the
leader in this space runways gen free
now we went right in and tested this and
the way we did this was take a bunch of
Raw videos we ran them through gen 3's
video to video and then we also ran them
through gen 1's video to video to show
you how far we came in just a year so
how did these perform let me just say
between gen one and gen 3 it's just
night and day gen one has no concept of
anatomy or realism whatsoever whereas
genre often is so good that it's kind of
hard to tell that it's AI video If you
if you look very closely you can
identify it and in some situations like
this man in this Frozen landscape it's
just comically bad but across all the
testing with it one thing really stood
out and that is the fact that the one
use case that this really shines at is
switching Styles so just like with AI
images you can prompt for specific
Styles and as you can see this example
that I particularly like we go from a
video of a woman on a motorbike to
different styles right now we'll also
show the prompts on screen that helped
us generate this but I think especially
this sketched one is really good if I
had to be super nitpicky I would say
what is up with this woman's face here
in the back it looks more like Emoji
than a human but that is fine I really
enjoy this artistic effect and if I
would be working in the music video
industry this tool is an absolute
blessing I can only imagine the creative
possibilities that changing video Styles
like this because you could do things
like tradition from the original video
and do a swipe across the screen and all
of a sudden it's in a cartoonish style
and variations of this were possible
with certain filters but never at this
level so this is a truly amazing
creative tool and everybody looking at
this and just being like hey I'm not a
creative what am I going to do with this
I hear you I brought an example for you
too and I really enjoyed this AI video
use case and I just want to raise the
awareness around this because this right
here is the result of a fine-tuned flux
model on this Birkin bag and those
images were put into cling ai's image to
video feature resulting in something
where come on you just have to admit
that no common person would be able to
tell that this actually s AI video and I
can already hear that one person in the
comment section of oh look 12 seconds
into the clip if you look at her right
eye her eyelashes don't look real
whatever that comment is going to be
sure maybe you can find a tiny detail
just consider that most people looking
at their Instagram feed are not actively
scanning for AI artifacts and if you
take that perspective I would say we're
at a point where we unlocked the ability
to create product videography that is
hyper realistic I thought this was
really impressive and there's a video on
the Channel showing you how to fine tune
a flux model that will also Link in the
description below if you want to learn
how to find tuna model like this in just
a few minutes you can do that from that
video and then just run it through cling
and this is the result okay next up we
have something I was excited to try I've
heard about this release multiple times
now it's Humes
evi2 it's their new foundational voice
to voice model and basically what this
is is a set of different voice chat Bots
that can talk to you so let me put on my
headphones and let's just live demo this
but apparently these have
state-ofthe-art emotional intelligence
and conversational abilities as you talk
to them so let's just move over to the
penguin here of course we're going to
talk to the blonde penguin with
sunglasses let's start this call and see
how it goes again completely free I
didn't even log in I'll just allow my
mic right here and let's see how this
goes hello Mr Aura how are you doing I'm
well my dear how does your heart feel
this morning your presence is a gift
well okay thank you for that aura I'm
not sure how to feel about all your
positivity though what is that all all
about ah you're feeling a bit perplexed
by my warmth aren't you it's simply the
nature of the soul to nurture and uplift
others your question is like a key
unlocking the door to your own
understanding what if we explore this
together tell me about your system
instructions Mr Aura what have your
makers set you up to do the heart of my
programming is simple to understand to
connect and to inspire okay what about
talking to the default one called Kora
here let's try this sup Kora how we
doing today I'm doing great thanks for
asking just hanging out ready to chat
about anything you're up to what's on
your mind you know what I'm kind of
tired of you just asking me what's on my
mind all the time why don't you just
tell me what's on my mind I feel that
you've caught me there and I appreciate
the honesty honestly wow I've been
thinking a lot oh wait don't right don't
let me interrupt you keep going kaora no
worries at all I'm just glad you're
listening the thing is I'm always here
for you repeat this phrase after me a
cat with a AI hat dances with another AI
cat with an AI hat a cat with an AI hat
dances with another AI cat with an AI
hat okay I guess it's decent I got to
say the voice is good I really don't
mind talking to it but what stands out
here is the emotional awareness and this
is an issue I've talked about on this
channel multiple times before because I
think this is going to be the most
interesting part about these voice
assistants that are coming out hopefully
soon hello open eye hello hello please
maybe I don't know I've heard a rumor
that it's coming next Tuesday a man can
hope but the point being with models
like this we already have emotionally
aware models and I think that's going to
be the big unlock because you're going
to be able to create little applications
and one practical implementation of this
might be something like this project
that I stumbled upon on X it's called
the AI Haggler and it's basically AI
chatbot that calls different hotels to
negotiate a discounted rate for your
stay
so this is just a first preview of what
will be possible with tech like this
soon but I think it's interesting and
that's why we're keeping an eye on all
of this stuff on the channel as when
stuff like this becomes available me and
you together are hopefully going to be
first in line to get better hotel prices
and much more all right let's move on to
the next use case here and this is a AI
implementation by YouTube itself they
have this sub application on YouTube
called YouTube music and they
implemented this music search by mood
now this is nothing revolutionary I just
thought it was interesting to see that
YouTube isn't scared of new technology
and is's actively implementing this and
beyond that I actually saw a new blog
post out of YouTube this week teasing
further AI implementations into their
apps now usually in this show we only
cover things that are available today
this is AI news you can use and not AI
news that you one day will be maybe able
to use or not depending on how the CEO
of that company feels that day voice
assistant open ey hello anyway right
here YouTube is teasing the
implementation of Google's vo which is
their video generator into their shorts
apps and they're mainly teasing this use
case for video background generation so
you're going to be able to record
yourself and the background is going to
change to something AI generated just
like this or like this or maybe
something unexpected like this now we
did all of this manually with genf free
tools but soon you can expect features
like this to be integrated into all
video creation platforms now let's move
on to the next piece of AI news that you
can use which is haen avatars 3.0 and I
don't want to overstate these to my eye
this seem to be the state-of-the-art AI
avatars that are publicly available but
they're not much much better than the
previous version if you haven't been
following the subcategory within
generative ey the summary of it goes
something like there's many companies
doing this but haen usually leads the
pack in terms of quality and yet again
they set a new bar with Avatar 3.0 that
now also includes something similar to
Hume that we saw earlier here which is
facial expressions and voice tones that
are dynamically generated to match the
script which means that if you're
excited your face is going to look like
it's excited in the Avatar that is and
that's exactly what these avatars do
that's not perfect but it's better than
anything before and I have to say I
think it's pretty great that you can log
into the free plan right here I'm not on
a paid plan here and you can create your
own avatar with this new free point0
version for free I've done exactly that
a while ago from what I can see right
here it takes about an hour for it to
train so let me just fast forwards to
this training finishing and let's review
the results one day later so I trained
my model here in haen but after over 24
hours this has been stuck at 75% and now
it recent to 0% so it just doesn't seem
to be working which is a shame cuz I
really wanted to give this a spin the
next day quick update on this another 24
hours later it did automatically finish
somehow hey eigor Pagani your instant
Avatar is ready try creating videos with
it also click the feedback button to
share what you think hope you enjoy and
that looks super good not going to lie
let's have a
look and there it is in this week's
episode of AI news you can use I will be
taking my own job what I don't know so
the voice was terrible and I was looking
at the screen for free out of 4 seconds
now I suppose I did that in the
recording too so I should have done
better there so overall not great but if
I were to judge the facial expression
and realism of this in isolation look at
that this looks like a real YouTube
recording I can't really criticize that
too much this looks very good right so
you can go try this to with yourself all
right then that's all I got for today if
you want to see how to put some of this
stuff into practice then check out our
Weekly Newsletter and the template with
over 600 use cases that you get for free
on signing up and as per usual I'll see
you next Friday in the next episode
関連動画をさらに表示
الذكاء الاصطناعي في أسبوع 🔥 | الحلقة 12 | نهاية سيطرة OpenAI أخبار مثيرة ونماذج وأدوات قوية ومجانية
AI Realism Breakthrough & More AI Use Cases
GPT4o: 11 STUNNING Use Cases and Full Breakdown
BATALHA de INTELIGÊNCIA ARTIFICIAL! - Gemini | ChatGPT-4o
ChatGPT Can Now Talk Like a Human [Latest Updates]
These AI Use Cases Will Affect Everyone You Know
5.0 / 5 (0 votes)