26 Incredible Use Cases for the New GPT-4o

The AI Advantage
15 May 202421:57

Summary

TLDRThe video script discusses the myriad of use cases for the newly released GPT-40 model, highlighting its advanced capabilities such as understanding and expressing emotions, multi-personality conversations, voice modulation, and professional applications like medical diagnosis assistance and data analysis. It also touches on the potential of GPT-40 in creative fields, including music composition and 3D object synthesis, as well as its implications for education, customer support, and accessibility for the visually impaired. The script also mentions a community challenge that invites users to share their unique GPT-40 use cases, fostering a collaborative exploration of the model's potential.

Takeaways

  • 🚀 GPT 40 model introduces a wide range of new capabilities and use cases, enhancing its utility for various applications.
  • 📱 The model allows for hands-free operation by integrating with smartphones, providing instant responses without interrupting the user's workflow.
  • 🎭 GPT 40's advanced AI companion features include a more human-like interaction, with the ability to express and understand emotions.
  • 📊 It offers improved data analysis through the code interpreter, enabling users to upload files for deep technical and statistical analysis, and generating visualizations.
  • 🤖 The model can simulate multiple personas, facilitating mock conversations or debates, and can modulate its voice to sound robotic or human-like.
  • 🧑‍⚕️ There's potential for GPT 40 to be used in professional fields such as medical diagnosis, although it's more suited for diagnosis than treatment.
  • 📈 The model's performance is benchmarked against other AI models, showing improvements in vision and code interpretation, making it more effective for tasks like analyzing spreadsheets.
  • 🎓 It has educational applications, where it can act as a tutor guiding users through problems step by step, offering an alternative to traditional classroom learning.
  • 🎭 The model's ability to understand and use sarcasm is a significant advancement in AI's understanding of human language and communication.
  • 👶 Accessibility features are highlighted, such as describing visual scenes to those with no eyesight, offering a new level of assistance to users with disabilities.
  • 👩‍💼 For businesses, GPT 40 can act as a customer support representative, handling tasks and simulating conversations, signaling a future where AI has a more integrated role in business operations.

Q & A

  • What is the main focus of the video?

    -The main focus of the video is to explore various use cases for the GPT 40 model, as demonstrated by the company's announcement and additional examples found across the internet.

  • What is the challenge issued at the end of the video?

    -The challenge is to find GPT 40 use cases that work for the participants and to share these in a public space for review and potential prizes.

  • How does the GPT 40 model enhance the user experience?

    -The GPT 40 model enhances the user experience by being more human-like, understanding and expressing emotions, providing instant responses without changing the user's workflow, and offering capabilities such as voice modulation and multi-personality conversations.

  • What is the potential application of GPT 40 in professional fields?

    -In professional fields, GPT 40 can be used for tasks such as medical diagnosis assistance, deep technical and statistical analysis of data, and facilitating meetings with summarization.

  • How can GPT 40 be used in the context of education?

    -GPT 40 can act as a tutor, guiding users through solving problems step by step, which can be particularly helpful for individuals who struggle in school or need an alternative to traditional teaching methods.

  • What new capability does the GPT 40 model have regarding voice interaction?

    -The GPT 40 model has a new capability for voice interaction that allows it to be sarcastic and modulate its voice, making it more versatile and human-like in conversation.

  • How does GPT 40 assist users with accessibility needs?

    -GPT 40 can assist users with accessibility needs by providing visual descriptions of scenes for those with no eyesight and potentially acting as a second set of eyes in situations where the user's attention is divided.

  • What is the significance of GPT 40's integration into AI-powered IDEs?

    -The integration of GPT 40 into AI-powered IDEs allows for faster and more efficient code writing and testing, with reported improvements in coding abilities and cost savings for developers.

  • How does GPT 40's new 3D object synthesis feature work?

    -GPT 40's 3D object synthesis feature works by generating multiple images of an object from different views and reconstructing the object in 3D using those images.

  • What is the purpose of the AI Advantage Community mentioned in the video?

    -The AI Advantage Community is a platform that provides learning materials and resources to help members stay up to date with AI skills and tools, and to share and explore various use cases for AI models like GPT 40.

  • What is the timeline for the new capabilities of GPT 40 to become available to all users?

    -Many of the new capabilities of GPT 40, including the voice assistant feature, will be rolling out over the next week and will be available to all users soon.

Outlines

00:00

🚀 Introduction to GPT 40 Model Use Cases

The video script introduces the GPT 40 model, highlighting its diverse use cases. It discusses the model's potential applications, as demonstrated in videos released by OpenAI, and invites viewers to share their own use cases. The host also proposes a challenge for the audience to find personalized GPT 40 use cases and provides a link to a separate video for more technical details on the model's announcement and functionality.

05:01

📱 AI Companion and Multi-Persona Conversations

The script covers the AI's ability to act as a companion, providing instant responses without interrupting the user's workflow. It also touches on the model's advanced human-like characteristics, including emotional understanding and expression. The script then delves into the model's capability to conduct multi-personas conversations using two phones, simulating debates or arguments, and its potential applications in professional fields such as medical diagnosis and data analysis.

10:02

🎓 Educational Applications and Contextual Limitations

The video script explores the GPT 40 model's potential in education, suggesting it could assist students struggling with complex subjects by guiding them through problems step by step. However, it acknowledges the controversy surrounding its use in education, including concerns about a lack of human touch and the potential for cheating. The script also mentions the model's ability to understand and use sarcasm, thanks to its multimodal capabilities.

15:02

👓 Accessibility and Real-time Data Analysis

The script discusses the GPT 40 model's potential to assist individuals with visual impairments by describing their surroundings in real-time, offering a second pair of eyes in situations where attention is divided. It also covers the model's use in customer support, simulating a conversation between a customer and a support representative, and its integration into development tools, leading to significant cost savings and improved coding abilities.

20:02

🌐 Community Challenge and Future Directions

The video script concludes with an invitation for viewers to participate in a community challenge to share their GPT 40 use cases and potentially win prizes. It outlines the steps to participate and judges' criteria. The host also shares a vision for an AI learning community, offering free guides and event recordings to keep members updated on AI advancements. The script teases upcoming features and capabilities, emphasizing the rapid pace of development in AI technology.

Mindmap

Keywords

💡GPT 40 Model

The GPT 40 Model refers to an advanced version of an AI language model developed by OpenAI. It is characterized by its ability to understand and generate human-like text based on given prompts. In the video, it is discussed as having various use cases, from acting as an AI companion to performing complex data analysis, showcasing its versatility and potential impact on different fields.

💡Use Cases

Use cases are specific scenarios or applications where a particular technology, in this instance, the GPT 40 Model, can be utilized. The video explores a wide range of use cases, demonstrating the model's capabilities in areas such as customer support, education, accessibility for visually impaired individuals, and creative tasks like generating 3D models.

💡AI Companion

An AI companion is an artificial intelligence system designed to interact with humans in a natural, conversational manner. In the context of the video, the GPT 40 Model is described as an AI companion that can understand and express emotions, making it more human-like. It is used as an example of how the model can enhance user experience by providing instant responses without interrupting their workflow.

💡Voice Assistant

A voice assistant is a software agent that understands and responds to natural language commands given by users. The video mentions the GPT 40 Model's ability to function as a voice assistant, highlighting its potential to provide a more seamless and interactive user experience, such as guiding users through tasks or answering questions in real-time.

💡Multimodal Capabilities

Multimodal capabilities refer to the ability of a system to process and understand multiple forms of input, such as text, voice, and images. The GPT 40 Model's multimodal capabilities allow it to interpret and generate responses that can include elements of sarcasm and tone, making interactions with the AI more nuanced and human-like.

💡Accessibility

Accessibility in the context of the video refers to the use of AI to assist individuals with disabilities. The GPT 40 Model is shown to have features that can help visually impaired users by describing visual scenes or objects, thereby providing a more inclusive experience for people with different needs.

💡Code Interpreter

A code interpreter is a feature that allows an AI to understand, execute, and generate code. The video discusses how the GPT 40 Model can be used to analyze and manipulate code, which can be particularly useful for developers looking to automate certain programming tasks or create software more efficiently.

💡3D Object Synthesis

3D object synthesis is the process of creating three-dimensional models from two-dimensional inputs or descriptions. The video highlights the GPT 40 Model's ability to generate 3D models from simple prompts, which can be used in various creative and professional contexts, such as graphic design or virtual reality.

💡Educational Tool

An educational tool is any instrument or technology used to facilitate learning. The video presents the GPT 40 Model as an educational tool that can guide students through solving problems step-by-step, potentially transforming how education is delivered, especially for those who may struggle with traditional teaching methods.

💡Customer Support

Customer support refers to the assistance provided to customers in relation to a company's products or services. The video suggests that the GPT 40 Model could be used to enhance customer support by simulating conversations between a customer and a support representative, potentially improving efficiency and user satisfaction.

💡Technical Reports

Technical reports are documents that communicate the findings of technical research or analysis. In the video, it is mentioned that users are testing the GPT 40 Model's ability to process and summarize massive technical reports, which could save time and provide a more digestible format for complex information.

Highlights

GPT 40 model offers a wide range of use cases from personal productivity to professional fields like medical diagnosis and education.

The model can act as an AI companion, providing instant responses without interrupting the user's workflow.

Users can simulate conversations between multiple personas, potentially useful for debates or arguments.

The AI can modulate its voice, offering a range of vocal expressions from human-like to robotic tones.

GPT 40 model has the potential for medical applications, such as melanoma detection and pulmonary distress analysis.

The model can analyze spreadsheets and generate charts and visualizations, enhancing its utility in data analysis.

AI Advantage Community member used GPT 40 to analyze the conflict between Drake and Kendrick Lamar, showcasing its ability to handle complex data sets.

The model can act as a game host or meeting facilitator, summarizing events and directing conversations.

GPT 40 can serve as an educational tool, guiding users through solving problems step by step.

The model has the ability to understand and replicate sarcasm, thanks to its multimodal capabilities.

GPT 40's vision feature can assist people with no eyesight by describing the environment and events.

The model can act as a customer support representative, simulating conversations between customers and support agents.

Developers can integrate GPT 40 into AI-powered IDEs, leading to faster and more efficient code writing and testing.

GPT 40 can generate consistent text and styles, creating fonts and visual representations of logos and words.

The model can create images representing original characters with a single reference image, maintaining character consistency.

GPT 40 introduces 3D object synthesis, generating images that can be reconstructed into a simple 3D model.

The model can create 3D objects using the code interpreter, with examples including the quick creation of an STL file for a table.

A community challenge has been set up to explore and share use cases for GPT 40, encouraging innovation and practical application.

Transcripts

play00:00

open eyes GPT 40 model is here and one

play00:02

of the main questions is what can I use

play00:04

this for and that's why today we'll be

play00:05

looking at all the different use cases

play00:06

that open eyes showed us but also what

play00:08

the internet has come up with so far if

play00:10

you want to learn about the details of

play00:11

this announcement what it includes and

play00:13

how it works I created a separate video

play00:14

on that that I'll link on screen right

play00:16

now but with that being said let's get

play00:17

into some of the first use cases

play00:19

starting out with their announcement

play00:20

block post but I'll be adding in various

play00:21

use cases that I found across the

play00:23

internet in between some of these but

play00:24

this is a great place to start because

play00:26

they created a batch of videos showing

play00:28

off different things that you can do

play00:29

with it and I'll just be giving you

play00:31

quick summaries and my take you can

play00:32

watch all of these by yourself in full

play00:34

length if you so desire links below but

play00:36

hold up it doesn't even end there

play00:37

because look a lot of people are viewing

play00:39

this Channel and all of us are AI users

play00:40

some of us are AI power users and I was

play00:42

just wondering wouldn't it be amazing to

play00:44

find out what everybody that is watching

play00:46

this video actually uses this for so

play00:48

we're doing a brand new thing where we

play00:49

issue a challenge and then we have a

play00:51

public space where you can review all

play00:52

the people's submission to the challenge

play00:54

and the challenge is all about finding

play00:55

GPT 40 use cases that work for you so

play00:58

stick around because at the end of the

play00:59

video I'll give you more details and

play01:00

I'll tell you exactly how to participate

play01:02

and how to view what other people have

play01:04

been doing with gbt 40 so let's actually

play01:06

start with a clip that came out in the

play01:08

recent hours it's an interview of Sam

play01:10

Alman and he was asked the following

play01:12

question now check out his answer which

play01:13

hints at something that all of us might

play01:15

be doing in a few weeks here are there

play01:16

use cases that you've gravitated to one

play01:19

surprising is putting my phone on the

play01:22

table while I'm like really in the zone

play01:25

of working and then without having to

play01:27

like change Windows or change what I'm

play01:28

doing using it just like like another

play01:30

channel so I'm like working on something

play01:32

I would normally like stop what I'm

play01:34

doing switch to another tab Google

play01:36

something click around or whatever but

play01:38

while I'm like still doing it to just

play01:40

ask and get like an instant response

play01:42

without changing from what I was looking

play01:44

at on my computer that's been a

play01:45

surprisingly cool thing so let's talk

play01:47

about this one on top I think they

play01:48

picked this because it has the most

play01:50

human characteristics and it really

play01:52

shows the capability of this new version

play01:54

of jat GPT to be an AI companion matter

play01:57

of fact the voice is so hot that people

play02:00

across Twitter have been complaining

play02:01

what is this this is not an assistant

play02:02

this is a flirty girlfriend from what I

play02:05

can see it looks like you're in some

play02:07

kind of recording or production setup AI

play02:09

girlfriends have arrived and look this

play02:12

is one of the big new improvements it's

play02:13

way more human it doesn't just Express

play02:15

emotion but it also understands emotion

play02:17

from the phone's camera right here and

play02:19

here's just guessing something based on

play02:20

the information you provide it with

play02:22

which might not be very practical but

play02:24

some of these others definitely are so

play02:25

let's move on because in this example

play02:27

Greg Brockman actually uses two phones

play02:28

with two GPD 4 O's talking to each other

play02:31

and this shows of a capability that we

play02:33

already had in GPT 4 but now it's

play02:34

upgraded to the next level namely I'm

play02:36

talking about the capability of setting

play02:37

up multiple personas have't talked to

play02:39

each other with two phones you can

play02:40

simulate various conversations and you

play02:42

could apply this to your very own

play02:43

context this conversation might be a

play02:45

debate you might be pairing for or an

play02:47

argument you could just play it out

play02:48

between the two phones I mean that's

play02:50

amazing but also kind of freaky right

play02:52

and in the end they also show the

play02:53

ability for the app to modulate the

play02:56

voice so it can SN for you it can sound

play02:58

like a robot like they showed in the

play02:59

demo have a quick

play03:05

listen this essentially hints at the

play03:07

fact that you're going to be able to

play03:07

filter The Voice just like I have a

play03:09

soundboard here and I can do things like

play03:10

this these capabilities are going to be

play03:13

very surprising to most I can't wait to

play03:15

show my grandmother these updates now

play03:18

your AI assistant is going to be able to

play03:19

do that too and look some of these use

play03:21

cases might seem like a nice toy to have

play03:23

but if you get a little serious about it

play03:25

and you think about how this is

play03:26

applicable to some professional Fields

play03:28

you get something like this top comment

play03:29

on my latest YouTube video which as I

play03:31

mentioned summarizes this entire update

play03:33

as I asked for use cases that people are

play03:34

excited for male care here says melanoma

play03:37

detection retina exams pulmonary

play03:39

distress analysis this one I'll quickly

play03:41

have to look up ah okay so it's a

play03:42

diagnosis for breathing

play03:45

difficulties and yeah comments point out

play03:47

this is going to be for diagnosis not

play03:49

treatment but these things are amazing

play03:51

and admittedly this comment is a bit

play03:53

speculative but as you can see it

play03:54

clearly captured the imagination of a

play03:56

lot of people and talking about

play03:57

work-related capabilities apparently the

play03:59

benchmarks do translate to use cases

play04:01

because as we looked at in the summary

play04:02

video it performs better on almost all

play04:04

benchmarks when you compare it to the

play04:06

other best AI models in the market right

play04:07

now and this includes upgrades to Vision

play04:10

but also things like the code

play04:11

interpreter so you can actually upload

play04:13

files and do things like this now more

play04:15

effectively where you give it an Excel

play04:16

sheet and it analyzes the spreadsheet

play04:18

and does deep Technical and statistical

play04:20

analysis and generates chart and

play04:22

visualization this is a very simple

play04:24

prompt that you could reapply to your

play04:25

own charts and get these results

play04:27

immediately and this is not the voice

play04:28

assistant that might not be available to

play04:30

you yet this is the GPT 40 model that

play04:32

all plus users have now and all free

play04:34

users will have very soon okay and

play04:35

here's one that really caught my

play04:36

interest this comes from within the AI

play04:38

Advantage Community where a member used

play04:39

it to analyze the conflict between Drake

play04:41

and Kendrick not sure if you caught this

play04:43

recently they're beefing publicly now

play04:45

and they created dis tracks about each

play04:47

other and if you're not following this

play04:48

closely it gets messy very quickly

play04:50

there's a lot of news coming out I

play04:51

personally am not following this closely

play04:53

but look at this conversation with gbt

play04:55

40 he uploaded two CSV files that

play04:57

included the different events that

play04:58

happened with dates next to them now

play05:00

here's an important detail these data

play05:02

sets also include Google Trends data

play05:04

with how popular were both Drake and

play05:06

Kendrick in Google search over the last

play05:09

few months so how would you compile this

play05:10

into some visualization I'm not exactly

play05:12

sure but we don't have to worry about

play05:14

that let's just let gp4 oh figure this

play05:16

out so the whole interaction starts by

play05:18

Daniel uploading these files and then

play05:19

prompting it and then with simple

play05:20

conversational prompts he managed to

play05:22

make sense of all of the data and for

play05:24

the conversation if you want to read

play05:25

this you can stop the video at any point

play05:26

in time I'll slowly scroll over this he

play05:28

managed to create visualizations on top

play05:30

of which he could iterate I'll show you

play05:32

the final result here in a second okay

play05:33

because this chat history doesn't

play05:35

include images but it basically goes

play05:36

ahead it Maps out everything that is in

play05:38

the Excel sheets it reconstructs a

play05:40

timeline from all the events scattered

play05:42

across the Excel sheets and then using

play05:44

the web browsing tool that is super

play05:46

Snappy now because we have increased

play05:48

speeds it adds further context so you

play05:50

can look for new stories add that into

play05:52

your conversation like so look it pulls

play05:54

up Wikipedia and Hollywood Life articles

play05:57

that recently happened and then it

play05:58

extends the the timeline that you

play06:00

provide it with now you have all of this

play06:02

in your conversations you can prompt it

play06:03

with something like redo your analysis

play06:05

with the added context and then it

play06:06

finally creates a visualization that

play06:08

wraps all of that information into one

play06:11

image and then here it is and I think

play06:13

this is actually really useful if you're

play06:14

interested in this topic you see a

play06:16

timeline of events dating back to the

play06:17

13th of April 2024 with the first

play06:20

freestyle releasing and then it's all

play06:22

mapped against the Google Trends data on

play06:24

how popular the terms Drake and Kendrick

play06:26

were in Google search matter of fact it

play06:29

creates two different views with two

play06:30

different scales of the timelines isn't

play06:32

this incredible I think it is and sure

play06:34

it's not perfect look it made a little

play06:36

mistake down here with the date but this

play06:37

data does track with the original files

play06:39

that were provided to it and just think

play06:41

about the possibilities here you could

play06:42

easily be pulling Google Trends data and

play06:44

mapping that onto different events by

play06:46

yourself now really simply and it's not

play06:48

going to take you 20 minutes anymore

play06:49

because before running the code

play06:50

interpreter it took about a minute or

play06:51

two every single time running web search

play06:54

gave you one or two results and that

play06:55

also took about a minute or two if you

play06:57

didn't like the results you had to

play06:58

reprompt it and sit there for another

play07:00

full minute that's not the case anymore

play07:01

look in real time I'll put in a simple

play07:03

prompt and it's already searching the

play07:05

internet it found five sites and before

play07:07

I managed to finish the sentence it gave

play07:09

me all the links and a summary of what

play07:10

happens in those articles this is a

play07:12

massive change in user experience and

play07:14

the fact that they're making this

play07:15

available to everybody at this speed

play07:17

level is going to open a lot of people's

play07:20

eyes so how about this next video where

play07:21

he's preparing for an interview and he

play07:23

puts on a hat that might be a bit

play07:24

inappropriate for job interview in this

play07:26

interview prep use case the thing that

play07:27

fascinated me was actually the fact that

play07:29

it picked up on the fact that it had to

play07:30

deploy empathy to talk to him just

play07:33

listen to this answer this is not a

play07:35

neutral factual answer like take off the

play07:37

hat this is inappropriate have a listen

play07:39

what do you

play07:42

think Rocky that's quite a statement

play07:46

piece I I mean you you'll definitely

play07:48

stand out though maybe not in the way

play07:51

you're hoping for an interview okay I

play07:54

got it what a fantastic response and the

play07:56

display of emotional awareness of this

play07:58

new jet gbt model impressive let's move

play08:00

on to the next one which is the fact

play08:01

that it can act as a Game host and as a

play08:04

parallel to this they also show this

play08:05

other use case where you can use it as a

play08:07

meeting AI essentially where it

play08:10

facilitates the meeting and then in the

play08:11

end summarizes everything and I think

play08:13

this is absolutely amazing if you give

play08:14

it access to your screen and all the

play08:17

audio it can effectively direct the

play08:18

conversation and wrap up the meeting I

play08:21

don't know how well this will work in

play08:22

practice the main thing that I would be

play08:23

looking at here is the context

play08:25

limitation they only show a 2-minute

play08:26

demo how is this going to perform on a

play08:28

1our meeting it probably doesn't have

play08:29

have enough context length but we have

play08:31

yet to find out this voice assistant is

play08:32

rolling out over the next weeks right

play08:34

now we just have the GPT 40 model

play08:36

without the voice assistant either way

play08:37

you're going to be able to use this as a

play08:39

game master whether it's rock paper

play08:40

scissors Dungeons and Dragons or the

play08:42

same concept but in a work context which

play08:44

would be a meeting let's have a brief

play08:45

look get ready and three two one shoot

play08:49

let's see those hands who won and it's

play08:52

another tie going back to the voice

play08:53

assistant use cases I even featured this

play08:55

one in the summarization video because I

play08:57

just figured that this would be so

play08:58

groundbreaking for Education young

play09:01

people or anybody trying to learn a new

play09:02

skill you can just open up chat GPT on

play09:04

let's say your iPad on one part of the

play09:06

screen and on the rest of the screen

play09:07

you're solving some problem and then by

play09:09

talking to it and using the pen and

play09:11

highlighting having a conversation like

play09:13

you would have with a tutor it can guide

play09:15

you through the entire problem step by

play09:17

step kind of like a human would can you

play09:19

find which one is the hypotenuse um I

play09:22

think the hypotenuse is this really long

play09:24

side from A to B would that be correct

play09:27

exactly well done amazing and look I

play09:29

know educational is a controversial

play09:31

topic we actually hosted a little post

play09:32

event discussion in our private AI

play09:34

Advantage Community where several people

play09:35

brought up that actually the educational

play09:37

sector is the one that has the most

play09:39

resistance to this technology because

play09:40

the teachers just feel like this is a

play09:42

direct replacement to what they're doing

play09:43

it lacks the human touch plus students

play09:45

can cheat on all these assignments

play09:47

homeworks Etc right so look I think that

play09:49

makes perfect sense but simply put a lot

play09:51

of people that struggle in school will

play09:52

have this technology available to them

play09:54

and they will have an alternative to

play09:56

Simply failing out of their classes I

play09:58

remember moments in high school and

play10:00

University were particularly on

play10:01

accounting and then later in University

play10:03

during statistics classes I was so lost

play10:06

I didn't even know where to start and at

play10:07

that point the tutor wasn't really an

play10:08

option cuz we simply couldn't afford it

play10:10

and I very vividly remembered these

play10:12

moments of pure desperation where I just

play10:14

didn't know where to go from that point

play10:15

I looked at the textbooks but I didn't

play10:17

understand anything I attended the class

play10:19

it didn't make any sense to me the

play10:20

friends that I had were busy thinking

play10:21

about the weekend so I was completely

play10:23

lost and if I had something like this I

play10:25

would have gave it a shot and it would

play10:26

have probably helped in some of those

play10:28

classes so just from that experience

play10:29

experence alone I do see the benefit of

play10:31

this for many people while I don't think

play10:32

it replaces a human teacher I think for

play10:35

now it's a fantastic addition to how our

play10:37

system works and let's be real the

play10:38

school system is broken in many places

play10:40

maybe rethinking it while considering

play10:42

tools like this could be the change we

play10:44

need just think about you learning a new

play10:46

piece of software and this guiding you

play10:47

while you do that amazing but hey at the

play10:49

end of the day I'm just here to point

play10:51

out what's new and I think this is one

play10:52

of the most amazing use cases now let's

play10:54

move on to the next one okay here's an

play10:56

interesting one that really was a big

play10:58

problem up until last it's sarcasm

play11:00

rarely was AI able to pick up on sarcasm

play11:02

and now it can actually replicate it and

play11:04

use sarcasm this is possible because

play11:06

this new model is natively multimodal or

play11:09

omn modal as they name it it's not fre

play11:11

separate steps of transcribing The Voice

play11:13

to Text then using chat to process the

play11:15

text and then afterwards turning the

play11:17

text back into voice it all happens in

play11:19

one and you get things like the

play11:20

capability to be sarcastic things

play11:23

sarcastic all the time isn't exhausting

play11:25

or anything I'm so excited for this H

play11:29

kind of incredible more of a capability

play11:31

than a use case but interesting

play11:33

nevertheless if you're enjoying this

play11:34

video that obviously took a lot of work

play11:35

to put together hit the like button it

play11:37

really helps the channel but with that

play11:38

being said let's move on to the next one

play11:40

okay this next one is particularly

play11:41

interesting and that is this

play11:42

accessibility feature where you're going

play11:44

to be using gp40 Vision to help people

play11:47

with no eyesight try and tell me exactly

play11:50

what they're doing right now please um

play11:52

right now the Ducks are gently gliding

play11:54

across the water this last part left be

play11:56

a little speechless I'm just going to

play11:58

play too I E even know when a taxi is

play12:00

coming with its orange light on I think

play12:02

I'll hail it to get

play12:04

home yes I spotted one just now it's

play12:08

heading you away on the left side of the

play12:10

road get ready to wave it

play12:16

down great job hailing that taxi I mean

play12:19

wow you can only imagine how

play12:21

transformative this would be for people

play12:23

without eyesight or other limitations

play12:25

and one interesting thought I had here

play12:26

was that there's situations in life

play12:28

where you yourself can't really see

play12:29

something or your attention might be

play12:31

divided between multiple tasks and it

play12:32

would be just great to have a second

play12:34

pair of eyes right and I found a similar

play12:37

idea on Twitter here where kit I suppose

play12:39

came up with this use case for GPD 4

play12:41

where as a parent you could set up your

play12:42

phone to watch your kids for a second

play12:44

and look I can already see a comment

play12:46

section of like seriously you want your

play12:48

AI assistant to watch over your kid look

play12:50

fair enough maybe not in this version

play12:51

then you absolutely have to test it

play12:52

first but just this idea of like hey

play12:54

just let me know if my son starts

play12:56

crawling off to the side and you place

play12:57

him in the middle of the living room and

play12:59

then you're going to be able to leave

play13:00

the room for like 20 seconds I think

play13:01

that sounds pretty feasible but again

play13:03

this is for people themselves to decide

play13:05

and over time the tech is obviously

play13:06

going to be good enough for that to be a

play13:07

reasonable use case overall I just want

play13:09

you to realize that phones are soon

play13:11

going to turn into a second set of eyes

play13:13

with a certain amount of intelligence in

play13:15

them and that's just going to open up

play13:16

opportunities that we haven't even

play13:17

considered right now oh and while we're

play13:19

kind of on the topic of child care

play13:20

there's this one use case in here where

play13:22

he actually lets the phone sing a laabi

play13:25

and then he adjusts the voice to be

play13:26

softer more silent a little louder and

play13:29

then it comes up with a song on the spot

play13:32

I think this one is indisputably amazing

play13:34

and I guess this falls into the

play13:35

entertainment category but that also is

play13:37

a thing that you could use this for oh

play13:39

Majestic potato spoons of clo okay okay

play13:43

it's it's a little too whispery maybe

play13:45

maybe go like a little

play13:47

louder got it let's find that sweet

play13:51

spot oh Majestic

play13:54

potato in the moon

play13:57

sof creepy all right so let's switch

play13:59

gears and let's talk about something

play14:01

that will inarguably be useful for

play14:03

businesses which is a customer support

play14:05

rep that is going to be able to handle

play14:06

tasks now they do have this two-minute

play14:08

video in here where they use two phones

play14:09

to simulate a conversation between a

play14:11

customer and a customer support rep

play14:13

awesome all right I've just sent the

play14:16

email can you check if Joe received it

play14:19

and I think this one is particularly

play14:20

interesting to talk about because it

play14:21

hints at the future of these products

play14:23

right because for this you absolutely

play14:25

need Integrations with other tools and

play14:27

that is something that chat doesn't

play14:29

really offer you have actions within the

play14:31

gpts but doing a full customer support

play14:33

agent in there is just not feasible as

play14:35

of now you would need multiple actions

play14:36

you would need reliability you would

play14:38

need longer context length but just by

play14:40

them uploading this video this clearly

play14:42

signals to me that this is a direction

play14:43

that they'll take it in not as if that

play14:45

was any news to me I was already when

play14:46

gpts came out back in November I

play14:48

remember saying that this is the

play14:50

clearest sign by them that this is the

play14:51

future of the product it's this

play14:52

independent AI that have a set of tools

play14:54

that they can access and then use those

play14:56

tools to act on their own behalf by the

play14:58

way this was even pointed out by Sam

play15:00

Alman in a recent interview he said that

play15:02

there's really two directions this can

play15:03

go in one of them is kind of this

play15:05

assistant that will help you do your

play15:06

work better and the second one is a

play15:08

senior employee where it will not just

play15:09

act by itself but also have a certain

play15:11

level of autonomy to override your

play15:13

decision- making and your prioritizing

play15:15

because it's a senior employee so that's

play15:16

what you can expect from this overtime I

play15:18

think for now this isn't really there

play15:20

yet they even pointed out that this is

play15:21

just a proof of concept nevertheless I

play15:23

wanted to talk about it as it might show

play15:24

you a glimpse of the future and what

play15:25

we'll be getting soon here okay so

play15:27

here's another use case and this one is

play15:28

more development focused and this is all

play15:30

about integrating GPT 40 into this AI

play15:33

powered ID if you're not familiar that's

play15:35

a software that developers use to write

play15:37

and test code basically they were

play15:38

extremely fast in integrating GPT 40 I

play15:41

mean they took less than 24 hours to

play15:42

integrate it and every single chat GPT

play15:45

rapper as people call them will have

play15:47

this upgrade very quickly because all

play15:49

you need to do is exchange one line of

play15:51

code and you have this better model and

play15:53

all the use cases we talked about in

play15:54

this video will very quickly arrive in

play15:57

the applications that you might be using

play15:59

already Plus for the developers it's a

play16:00

50% saving in cost as it's 50% cheaper

play16:03

to use all of this overnight that's

play16:05

pretty amazing anyway why is it

play16:06

significant because people across the

play16:08

internet report that the coding

play16:09

abilities are actually improved but this

play16:11

did cause a little discussion because it

play16:13

seems to be improved in certain areas

play16:14

and worse than others and here's just a

play16:16

quick example of what Sawyer Hood

play16:18

managed to do with this in the first 24

play16:19

hours and that is rebuild Facebook

play16:21

Messenger with one prompt and he's

play16:24

reporting that GPT 40 did this in 6

play16:26

seconds I mean look at that that's wild

play16:28

this is one HTML file we're getting

play16:30

closer and closer to people being able

play16:32

to create games of their own in just a

play16:34

single sentence we might not be there

play16:35

yet but this is going to get wild soon

play16:37

and talking about technical capabilities

play16:39

there's also this brand new ability to

play16:41

generate consistent text matter of fact

play16:43

it's so good that you can do something

play16:45

like text to font where you prompt it to

play16:47

create the basic letters and then you

play16:48

can keep prompting it to give you

play16:50

various Styles look at that futuristic

play16:52

here it created a Victorian style font

play16:54

and what that means is that they solved

play16:55

consistency of transferring text to

play16:57

other objects so as you can see here

play16:59

image of a coaster their logo it

play17:00

perfectly combines the two like a week

play17:02

ago you needed to know Photoshop to do

play17:04

this stuff pretty wild and there's all

play17:05

these other use cases here like taking a

play17:07

logo and mapping a word on top of it or

play17:09

writing a poem and then visualizing it

play17:11

as if it was handwritten look at that

play17:12

want it in a different color no problem

play17:14

and while we're on a topic character

play17:16

consistency is one of the main things

play17:17

that has been missing in many tools mid

play17:18

Journey added it recently but now also

play17:20

with GPT 40 you're able to generate a

play17:23

robot and then create multiple images of

play17:25

it like so while maintaining the same

play17:27

character so you can tell stories now

play17:29

instead of just generating one image and

play17:31

then if you regenerate the robot it

play17:32

looks like a whole different character

play17:34

oh and here's one more interesting

play17:35

capability and that is creating images

play17:37

that represent original with a single

play17:38

reference image this is not the easiest

play17:40

problem it has been solved by some other

play17:42

tools but now you will have all of this

play17:43

packaged into one tool so you can upload

play17:45

an image of yourself and then recreate

play17:46

it as a caricature then just a quick

play17:48

note before you jump into chat GPT a lot

play17:50

of these things will be shipping over

play17:51

the course of the next week so you might

play17:53

not have it yet the voice assistant or

play17:54

the availability for free to all users

play17:57

all of these things are coming over the

play17:58

next week but that's not going to stop

play17:59

us from exploring what's possible

play18:01

because I have a few more here and one

play18:02

of them is this 3D object synthesis okay

play18:04

this is something nobody really expected

play18:06

from them in this announcement and they

play18:08

didn't even mention it in the live

play18:09

stream it's just hidden in their blog

play18:11

post under this one option here so from

play18:13

a simple prompt and giving it view zero

play18:15

view 1 2 3 4 and then view five it

play18:17

generates five images of the same thing

play18:19

and then you can reconstruct that object

play18:21

from the six generated images so this is

play18:23

a combo of the consistency that we

play18:24

talked about a second ago with the robot

play18:26

and then this new capability to

play18:27

construct multi images into a simple 3D

play18:30

model here you have the same thing

play18:32

happening with a c line and if it wasn't

play18:33

clear yet they really did solve how to

play18:35

represent letters inside of these images

play18:38

all of the AI image generators could not

play18:40

solve text yet maybe with the exception

play18:42

of ideogram but their quality was not up

play18:44

to par with some of the leading models

play18:45

like Firefly mid Journey or stable

play18:47

diffusion Excel all of these top models

play18:49

couldn't do text but now you can prompt

play18:51

it with the exact text you want there

play18:53

and create mockups like this where it

play18:54

gets every single letter right and sure

play18:56

if you look at the buttons here you get

play18:57

some funkiness like what what kind of

play18:59

space bar is this what's up with these

play19:00

buttons but that is being nitpicky if

play19:02

you're creating social media content the

play19:03

main thing you want to get right is the

play19:05

fact that the text is correct and that

play19:07

is a capability now there's actually one

play19:09

more thing that I found in reference to

play19:10

the 3D object generation and that is the

play19:12

fact that you can actually use the code

play19:14

interpreter to create 3D objects in

play19:16

around 20 seconds as Min Choy reports

play19:18

here this table is the result of the

play19:20

generation and he simply created it with

play19:22

a simple prompt that says make a STL

play19:24

file of a table with four legs and

play19:26

random attributes 20 seconds later

play19:28

finished work working and here you have

play19:30

the result and just mind you this is the

play19:31

beginning of the 3D capabilities so I

play19:33

just want to point towards one of these

play19:34

top comments which is like hey just

play19:35

remember this was mid Journey version

play19:37

one in January of 2022 okay these things

play19:40

move fast and a simple table like this

play19:42

is version one so I had an idea on how

play19:44

to really superpower this video and how

play19:46

to show you what not just you can do but

play19:48

what also other people are already doing

play19:50

with this what I did is I set up a

play19:51

community space where you can simply

play19:53

sign up for free and you can share what

play19:55

you're using GPT 404 and at the very

play19:58

least you can reach what other people

play19:59

are using it for matter of fact I

play20:00

packaged this in a challenge that is now

play20:02

running for one week if you want to

play20:03

check out the details and participate in

play20:05

win prizes you can do so it's going to

play20:06

be the first link in the description but

play20:08

basically we created a step-by-step

play20:10

guide and clear judging criterias and

play20:12

what tools you will be using and how we

play20:13

recommend you present the challenge it's

play20:15

all in this brief guide if you want to

play20:16

participate but what you really need to

play20:18

know is this gp4 to GPT 4 is a major

play20:20

leap and they made GPT 40 available to

play20:24

everyone for free now fair enough a lot

play20:26

of the capabilities pointed out today

play20:27

like the voice assistant are not going

play20:29

to be available to everyone but as you

play20:30

can clearly see this opens up a whole

play20:32

new world of use cases and here in this

play20:34

public space that is freely available

play20:36

all you need to do is create a free

play20:37

account on the platform couldn't be any

play20:38

simpler and then you can share your very

play20:40

own use cases here win prizes or review

play20:43

what other people are doing with GPT 40

play20:45

today like s here that is using it to

play20:47

reorganize his bookshelf or Justin

play20:49

giving it massive technical reports and

play20:51

seeing how well it can handle it he

play20:52

uploaded a detail file on the exact

play20:54

prompts and the workflow that he used

play20:56

and yes this challenge idea is just a

play20:58

small part of of a wider Vision that I

play20:59

had for AI learning community that makes

play21:01

sure you stay up to date with all these

play21:03

skills that are going to change the

play21:04

world over the coming months and years

play21:06

and we starting to do more and more in

play21:07

public like this challenge but as you

play21:09

can see this is Challenge number 19

play21:10

we've been doing this for months in our

play21:12

paid community that is all about

play21:13

learning and applying these tools just

play21:15

one more thing to point out the main

play21:16

focus of the community is actually

play21:17

providing you with learning materials

play21:19

like these free guides that are also

play21:20

available on the website or some of

play21:21

these event recordings in this one for

play21:23

example I show you how to integrate

play21:24

stripe checkout into a GPT and I

play21:26

basically bundled everything I know I

play21:28

have in to here so in the Learning

play21:29

Center for paid community members you

play21:30

actually get the full prompt engineering

play21:32

course you get 40 hours of event

play21:33

recordings and much more so there you go

play21:36

a lot of use cases generated by the AI

play21:37

Advantage Community a bunch of free AI

play21:39

learning resources and will keep you

play21:41

updated on the YouTube as a lot of these

play21:43

capabilities shipped to the public I

play21:45

sincerely hope this video was helpful to

play21:46

you and if you want even more use cases

play21:47

I'll be doing a live stream on the 21st

play21:49

of May where we evaluate all the results

play21:51

of this Challenge and have a look at all

play21:52

the use cases that people submitted to

play21:54

this challenge that I issued all right I

play21:56

hope you have a great day

Rate This

5.0 / 5 (0 votes)

相关标签
AI CompanionProfessional UseInteractive LearningVoice AssistantMultimodal AIEmotional AITechnical AnalysisEducational ToolSarcasm DetectionAccessibility AidCustomer SupportCoding Assistance3D Object SynthesisCommunity ChallengeAI LearningGPT 40 ModelWebinar Insights
您是否需要英文摘要?