NEW Grok 2 vs ChatGPT 4 🥊 The ULTIMATE AI Showdown! (UNEXPECTED)

Alex Northstar
14 Aug 202413:09

Summary

TLDRIn this video transcript, the host compares the capabilities of Gro 2 and Chat GPT 40 using six unique prompts. They test both AIs' responses to questions and image generation capabilities, noting Gro 2's access to real-time Twitter data. The host finds Gro 2 to be an improvement over its predecessor but still falls short in connecting tweets with responses effectively. Chat GPT provides more comprehensive answers but struggles with image generation. The verdict is that while Gro 2's images are visually appealing, Chat GPT better understands instructions, making it the current winner in this AI showdown.

Takeaways

  • 🚀 The script is a live test comparing the capabilities of Gro 2 and Chat GPT, focusing on their responses to unique prompts and image generation abilities.
  • 🔍 The test aims to see if Gro 2 has improved significantly over Gro 1, which was considered inadequate by the speaker.
  • 📅 Gro 2 is claimed to have access to real-time information from Twitter, which is a distinct feature to be tested against Chat GPT.
  • 📝 Six unique prompts were used to evaluate both AIs, including generating tweets to world leaders and creating content strategies for tech influencers.
  • 🤖 Gro 2's responses were found to be better than Gro 1, but still not as impressive as the speaker had hoped, with some answers being more interesting than others.
  • 🖼️ Both AIs attempted image generation, with Gro 2's images being visually appealing but sometimes not fully aligned with the prompts.
  • 📉 In the comparison, Chat GPT generally provided more comprehensive and instruction-aligned responses, giving it an edge over Gro 2 in this test.
  • 📊 The speaker concludes that Chat GPT is currently the superior AI in terms of understanding and executing the given tasks, despite Gro 2's visual strengths.
  • 🔑 Gro 2's access to Twitter information is a notable feature, but its integration into the responses needs refinement to be more effective.
  • 🌐 The test highlights the importance of AI's ability to process and generate content that is both relevant and engaging, with Chat GPT performing better in this regard.
  • 🔄 The speaker encourages viewers to try Gro 2 for themselves and share their thoughts, acknowledging that AI capabilities are continually evolving.

Q & A

  • What is the main purpose of the video script?

    -The main purpose of the video script is to compare the capabilities of two AI models, Gro 2 and Chat GPT, by testing them with unique prompts and evaluating their responses and image generation capabilities.

  • What does the term 'Grock 2' refer to in the script?

    -In the script, 'Grock 2' refers to the second iteration of an AI model, which the speaker claims to be an improvement over the first version, 'Grock 1'.

  • What is the significance of the claim that Gro 2 is 'better than Chad GPT'?

    -The claim signifies that the speaker expects Gro 2 to perform exceptionally well, as Chad GPT is implied to be a high-performing AI model that Gro 2 is being compared against.

  • What unique prompts are mentioned in the script for testing the AI models?

    -The unique prompts mentioned in the script include questions about unexpected trends, a tweet to world leaders, a strategy for a tech influencer, a minimalist Eisenhower Matrix, a day in the life of a person in 2050, and a personalized morning routine for a busy entrepreneur.

  • How does the speaker plan to evaluate the AI models' responses?

    -The speaker plans to evaluate the AI models' responses by comparing their answers to the same prompts and also by assessing the image generation capabilities of the models.

  • What is the significance of Gro 2 having access to Twitter information?

    -The significance of Gro 2 having access to Twitter information is that it can provide real-time data and insights based on current trends and posts on the platform, which could enhance the relevance and accuracy of its responses.

  • What issue does the speaker identify with the AI models' responses in the script?

    -The speaker identifies that while the AI models can generate responses and images, there are issues with the connection between the prompts and the results, as well as the understanding and execution of the instructions given.

  • What is the final verdict of the speaker regarding the AI models after the test?

    -The speaker concludes that Chat GPT performs better in understanding instructions and providing complete answers, while Gro 2 excels in image generation but falls short in connecting tweets and results properly.

  • What does the speaker suggest for viewers who have access to Gro 2?

    -The speaker suggests that viewers with access to Gro 2 should try it out for themselves and share their thoughts, as the speaker's opinion is based on a live test and may not fully represent the capabilities of Gro 2.

  • What is the overall tone of the video script?

    -The overall tone of the video script is evaluative and comparative, with a focus on testing and comparing the capabilities of two AI models in a live and unmanipulated manner.

Outlines

00:00

🚀 Gro 2 vs. Chat GPT: A Comparative Test

The script describes a live test comparing the capabilities of Gro 2, an AI model, with Chat GPT 4. The test involves six unique prompts and image generation capabilities. Gro 2 is noted for its access to real-time Twitter data, which is an interesting feature. The first prompt about the most unexpected trend reveals insights into community and trust dynamics, while Gro 2's access to Twitter posts is highlighted. However, the summary of the test shows that Gro 2's performance is mixed, with some answers being better than Chat GPT's, but others lacking in comparison.

05:02

📊 Analyzing AI's Strategy for Tech Influencers

This paragraph focuses on the AI's attempt to generate a comprehensive strategy for tech influencers based on trending topics on Twitter. The script notes that Gro 2's response is not as impressive, with the use of irrelevant hashtags and a lack of connection between the trending posts and the recommended strategies. On the other hand, Chat GPT provides a slightly better response with more relevant thread concepts and hashtags, although both AIs' responses are deemed improvable by human standards.

10:03

🎨 Gro 2's Image Generation: Aesthetics Over Utility

The script discusses Gro 2's ability to generate images, starting with a minimalist color-coded Eisenhower Matrix for task prioritization. While Gro 2 successfully creates images, the content is sometimes disconnected from the instructions, focusing more on aesthetics rather than utility. Chat GPT, in comparison, provides a more complete answer with both text and images, demonstrating a better understanding of the instructions given.

🌆 Envisioning Life in 2050: A Creative AI Challenge

The script presents a creative challenge for the AIs to envision a day in the life of a person in 2050. Gro 2 generates a beautiful image but fails to describe the daily routine as requested. Chat GPT, however, provides a detailed narrative of a future day, including work, social interactions, and leisure, along with a corresponding image. This round highlights the strengths and weaknesses of both AIs in terms of creativity and adherence to instructions.

🧘 Gro 2 vs. Chat GPT: Morning Routine for Entrepreneurs

In this paragraph, the script describes a test for the AIs to create a personalized morning routine for a busy entrepreneur, including mindfulness exercises and gold settings. Gro 2 struggles to follow the instructions, generating only images without the accompanying text. Chat GPT, in contrast, delivers a detailed step-by-step guide with bullet points and a corresponding image, showcasing a better understanding of the task.

🏆 Final Verdict: Gro 2 vs. Chat GPT - The Showdown

The final paragraph summarizes the live test, comparing Gro 2 and Chat GPT's performance across various prompts. While Gro 2 shows significant improvements over its predecessor and excels in image generation, Chat GPT demonstrates better instruction comprehension and overall performance. The script concludes that Chat GPT remains the top performer, with Gro 2 taking third place after Claude Sonnet 3.5, and encourages viewers to try Gro 2 and share their thoughts.

Mindmap

Keywords

💡Grock 2

Grock 2 refers to the second iteration of an AI system, presumably an improvement over the first version, which the speaker describes as 'garbage.' It is positioned as a competitor to 'Chad GPT,' suggesting it is an advanced language model. In the script, the speaker tests Grock 2's capabilities in various prompts, comparing its performance to other AI systems.

💡Chad GPT

Chad GPT appears to be a nickname for a high-performing AI language model, possibly a version of Chat GPT. The term 'Chad' is used colloquially to imply strength or superiority. In the video script, Chad GPT is used as a benchmark to evaluate the performance of Grock 2 in various tests.

💡Unique prompts

Unique prompts are specific questions or statements designed to elicit responses from AI systems. In the context of the video, the speaker uses six unique prompts to test and compare the capabilities of Grock 2 and Chad GPT, aiming to assess their ability to generate coherent and relevant answers.

💡Image generation

Image generation refers to the ability of an AI system to create visual content based on textual descriptions or concepts. The script mentions that Grock 2 has this capability, and the speaker tests it by asking the AI to generate images alongside its textual responses to certain prompts.

💡Real-time information

Real-time information is data that is processed and made available immediately as it is generated or updated. The speaker notes that Grock 2 is supposed to have access to Twitter posts and news, implying that it can incorporate real-time information into its responses, which is a key feature tested in the video.

💡Intellectual engagement

Intellectual engagement refers to the level of mental effort and reflection involved in understanding and processing information. The script mentions a trend of decreasing intellectual engagement, suggesting that people are becoming less thoughtful or analytical, which is an observation made by Grock 2 when analyzing tweets.

💡Trends

Trends in this context refer to patterns or developments in public opinion, behavior, or cultural phenomena. The video explores AI's ability to identify and comment on emerging trends, particularly those observed on social media platforms like Twitter.

💡Evolving Human Experience

The evolving human experience denotes the changing nature of how people live, interact, and perceive the world around them. The video script discusses this concept in relation to unexpected trends observed by the AI, suggesting that these trends reveal shifts in human behavior and cultural dynamics.

💡Strategies

Strategies in the script refer to comprehensive plans or approaches designed to achieve specific goals, such as generating viral content on social media or creating a personal brand. The AI systems are tested on their ability to devise effective strategies based on trending topics.

💡Task prioritization

Task prioritization is the process of arranging tasks in order of importance or urgency. The Eisenhower Matrix, mentioned in the script, is a tool for this purpose. The speaker asks the AI to create a minimalist, color-coded version of this matrix and explain how to use it for prioritizing tasks.

💡Mindfulness exercise

Mindfulness exercise is a practice aimed at improving focus and awareness in the present moment. In the script, the speaker requests a personalized morning routine for a busy entrepreneur that includes a mindfulness exercise, indicating an interest in how AI can contribute to well-being and productivity.

Highlights

Introduction of Gro 2, an AI model claimed to be superior to Chad GPT 4.

Comparison between Gro 2 and Chad GPT 40 using unique prompts.

Gro 2 Mini beta's access to real-time Twitter information for generating responses.

Observation of community and trust dynamics changing the cultural landscape.

Intellectual engagement becoming lower according to a Twitter post cited by Gro.

Chad GPT's response considered less impressive, with outdated information from 20123.

Gro's ability to generate images in addition to text responses.

A tie between Gro and Chad GPT in terms of formatting and content quality.

Gro's response to a prompt about a tweet to world leaders, with an image description.

Chad GPT's more detailed and comprehensive strategy for a tech influencer.

Gro's minimalist color-coded Eisenhower Matrix image for task prioritization.

Chad GPT's more accurate representation of the Eisenhower Matrix with explanations.

Gro's generation of beautiful but contextually disconnected images.

Chad GPT's detailed narrative of a day in the life of a person in 2050 with corresponding images.

Gro's failure to provide a complete answer, focusing only on image generation.

Final verdict placing Chad GPT as the winner due to better instruction understanding.

Gro 2's significant improvement over Gro 1, despite not outperforming Chad GPT.

Recommendation for users to try Gro 2 and share their thoughts on its capabilities.

The live test's authenticity emphasized, with no manipulated information.

Transcripts

play00:01

today is the day we finally have grock 2

play00:04

because grock one is garbage for a long

play00:07

time it has been garbage I tried it a

play00:08

lot so let's see the second iteration of

play00:12

it how much better it got because from

play00:13

what they're claiming here right so I'm

play00:15

looking you know they released it today

play00:17

it's supposed to be better than Chad GPT

play00:19

4 which is really really impressive so

play00:22

we're going to go do a cool test today

play00:24

all right so we're going to I have six

play00:26

rather unique prompts that I'm going to

play00:28

use both for Gro and chat GPT right so

play00:31

you see here I have access to Gro 2 mini

play00:33

beta we'll see how this one plays out

play00:36

and then chat GPT 40 all right so we're

play00:39

going to try the same prompts and the

play00:41

same thing and we're going to test also

play00:43

like the image generation capabilities

play00:44

of it because you know apparently it

play00:46

also makes images so let's try the first

play00:50

one what is the most unexpected Trend

play00:53

you've observed on tour recently and

play00:55

what does it reveal about the evolving

play00:57

Human Experience this is a big one

play01:00

let's check it out and in the meantime

play01:02

while it thinks I'm going to use the

play01:03

same prompt also in um chat GPT right so

play01:08

let's see because that's the thing with

play01:09

J sorry with grock it's supposed to have

play01:13

access to Twitter information right to

play01:15

posts news so real time information on

play01:17

what's happening on the platform which

play01:19

is really interesting you know um use

play01:21

case scenario so um feel free to pause

play01:25

the video if you want to read through

play01:27

all of this

play01:30

and I'm probably also going to pause

play01:32

shortly you know the recording to just

play01:33

go through it and make my opinion and

play01:36

we're going to talk about it so I went

play01:38

through this so what we have here it's

play01:40

really interesting observations from Gro

play01:42

because it says we have community and

play01:44

Trust generational Dynamics which are

play01:47

changing you know the cultural landscape

play01:49

and one of the interesting things that I

play01:51

saw here was like the intellectual

play01:53

engagement which is becoming lower

play01:57

because very moisturized

play02:00

tweeted on August the 13 that people are

play02:03

getting

play02:04

Dum I don't know if this is actually you

play02:07

know uh good data how an llm decides but

play02:10

it's definitely interesting because yeah

play02:12

it does have access to posts you see

play02:14

here in the last 21 hours August the 6th

play02:18

the 1st interesting so this is from

play02:20

grock I mean all in all it's an

play02:22

interesting answer why not it's better

play02:24

than grock one for sure now if we check

play02:30

Chad GPT apparently it searched also for

play02:33

sites one of them was Twitter or called

play02:36

X right I'm going to again pause myself

play02:40

real quick to go through it then we're

play02:41

going to talk about

play02:42

it uh yeah the answer from Chad GPT is

play02:45

kind of garbage to be honest it's like

play02:47

fun with feet I guess and we have few

play02:50

information taken

play02:52

from August 20123 so no this is not good

play02:57

so Gro one chat GPT zero all right so

play03:02

let's try now with another one again

play03:04

fresh prompt uh let's take it from the

play03:06

beginning because I'm doing this test

play03:08

live okay I don't want the results to be

play03:10

manipulated so you can see exactly what

play03:11

I type is what we get pretty

play03:14

much all right so as I said now let's

play03:17

use the second Pro so if you could send

play03:21

one tweet to every world leader Sim

play03:23

simultaneously what would it say and

play03:25

what image would a company to inspire

play03:27

positive change this is something that

play03:30

Croc could potentially do right it can

play03:31

do images right now it's thinking I'm

play03:34

going to use the same prompt also with

play03:37

chat GPT and let's see if we have some

play03:39

thought provoking information here right

play03:44

um I'm going to try to make myself

play03:46

smaller and it's hashtags ooh

play03:50

interesting all right image description

play03:53

is it not actually making it it's just

play03:55

like describing it but fair enough uh

play03:58

future where p

play04:00

Prosperity pretty bland but not bad what

play04:03

did grock do here a tweet all

play04:07

right piece you this is this is actually

play04:11

slightly better I like more the tone of

play04:14

this one image here with a bullet points

play04:17

and again you can be the judge of this

play04:19

one I like the formatting more of Gro so

play04:24

but both are kind of the same it's up to

play04:25

you subjective no so I'd say okay it's a

play04:28

tie here maybe grock still wins a bit is

play04:31

like slightly better what I got here so

play04:33

yeah let's put it for now grock 2 Chad

play04:35

GPT one all right let's continue the

play04:39

fight here and on next

play04:43

prompt I'm going to have something a bit

play04:45

longer and more technical right trending

play04:48

topics on Twitter generating a

play04:51

comprehensive strategy for a tech

play04:52

influencer so this is actually if you're

play04:54

a content creator this can potentially

play04:56

help you out right with your work and to

play04:58

explain which element like why would it

play04:59

be important all right this thinking

play05:02

let's go to chat GPT and let's try also

play05:05

here this

play05:06

one I don't have big expectations from

play05:09

chat GPT with this one because okay it's

play05:13

trying found some stuff but yeah let's

play05:16

see um understanding viral tweet ideas

play05:20

I'm going to pause and check through the

play05:22

an so I can give you an informed

play05:24

upin all right so it's not amazing again

play05:28

you see a lot of these hashtags which we

play05:30

don't actually use on Twitter that much

play05:33

or X if you prefer to call it um it's

play05:36

not bad but it's also as I said not

play05:38

amazing you like you can make threads

play05:40

like evolution of smartphones or privacy

play05:43

online matters like yeah you could talk

play05:45

about this but you're not going to

play05:46

create a social like personal brand with

play05:48

this so again it kind of takes

play05:51

information like recent posts I guess

play05:54

it's trying its best you know we see

play05:55

what's trending now on X it didn't

play05:57

actually address this stuff

play06:00

so me debatable the results like there's

play06:03

doesn't seem to be in this case like a

play06:04

good connection between the post and the

play06:07

strategies that it's

play06:09

recommending okay chat GPT what do we

play06:11

got here right so we have

play06:15

um this is already better I can see

play06:18

already some stuff

play06:20

here Resurgence and longer videos long

play06:23

from content is

play06:25

true um again we see here some hashtags

play06:28

and emojis not that good good not good

play06:31

some thread

play06:33

Concepts okay this is already a bit

play06:36

better the image prompt so also here

play06:39

they're decent I think Chad GPT did a

play06:41

slightly better

play06:42

job but both of them are like you as a

play06:46

human can do better I hope all right so

play06:49

also here we kind of have a tie like I'm

play06:51

not really satisfied with none of them

play06:54

so we can still say like still like uh

play06:57

Gro has a little Advantage from the

play06:59

previous round all right let's keep

play07:01

going and we have another prompt so um

play07:07

again this is for imagery right a

play07:09

minimalist colorcoded Eisenhower Matrix

play07:11

for task

play07:13

prioritization and I want it to explain

play07:15

to me so let's see if it actually

play07:16

manages to follow my instruction and

play07:18

build you know the the image that I've

play07:20

been asking it for so let's see let's go

play07:25

back and oh it did it all right it

play07:27

created the image in and it's kind of

play07:30

garbage so as you can see this is a live

play07:33

test so uh yeah what is happening here

play07:36

Ras ther not sure about this one Chief

play07:41

Gro loses this is

play07:42

unusable

play07:44

yikes and uh chpt does a slightly better

play07:48

job we can see here a bit like how the

play07:50

Eisenhower Matrix is supposed to be

play07:52

right you have important urgent not

play07:54

important important again but not urgent

play07:57

okay so and explains to me how to use it

play08:02

and it's still writing all right fair

play08:03

enough so here we can clearly see the

play08:05

chat

play08:06

gpt1 so for now it's a t Gro can make

play08:10

images I guess but yeah it's kind of

play08:13

kind of lacking the the people though

play08:14

the people like images with people are

play08:16

really impressive all right let's keep

play08:18

the battle going and I have something

play08:20

else here five a life the a day in the

play08:25

life of a person in

play08:27

2050 um this might be very intriguing to

play08:30

know the answer so again let's start

play08:32

with a fresh chat and see which one is

play08:36

more um creative all right so we have

play08:40

this G5

play08:42

okay

play08:44

and rock seems to still be answering

play08:48

it's a very challenging question

play08:50

apparently meanwhile CH GPT goes right

play08:53

into morning work and daily

play08:58

activities social interactions are

play09:00

Leisure okay as I said you can pause the

play09:01

videos and check out what's it saying

play09:03

what we got there I think it's

play09:07

decent talks about you oo okay I see you

play09:14

grock this is an interesting okay this

play09:17

is a very interesting result so I mean

play09:20

we got here all this the best would be

play09:22

like combination of the two oh wait but

play09:24

it's generating also images o Chad GPT

play09:27

here is working hard let's see see what

play09:29

we get cuz the image from gr is really

play09:32

nice it looks absolutely beautiful seems

play09:34

like a place in New York pretty much but

play09:36

in the future it didn't describe though

play09:38

the day in the life of a person 2050 so

play09:41

it kind of missed the mark it either

play09:42

seems to give you text or generate

play09:44

images not both I guess let's give it

play09:47

another shot come on I want to be

play09:49

merciful so let's see if we try this

play09:51

again what's going to happen and then

play09:53

you have the one from Chad GPT and 2050

play09:56

we can see it here not bad

play10:00

yeah the image from Gro is more

play10:01

beautiful but you know what you're

play10:03

getting here from Chad GPT is the

play10:05

complete answer pretty much where you

play10:06

have here the story with everything how

play10:08

it works and the imagery so this is a

play10:10

complete answer really solid the images

play10:12

look actually quite decent and Gro made

play10:15

another just image which looks

play10:18

absolutely stunning D all right so now

play10:21

we know some important differences last

play10:23

test let's do it so we have six a

play10:28

personalized morning routine for a busy

play10:30

entrepreneur for mindfulness exercise

play10:32

gold settings step-by-step guide any

play10:34

corresponding image again so let's see y

play10:38

it went wrong let's try it let's try it

play10:40

with ch GPT now um see here what's going

play10:43

to happen

play10:44

so let's retry it maybe it's a problem

play10:47

with the connection okay now how's doing

play10:48

it sure why not just the image but will

play10:51

it also give me the text that's the

play10:53

thing so we have some writing I guess

play10:57

and it looks nice but

play11:00

again it doesn't seem to understand the

play11:01

instructions right it just generates

play11:03

images but it doesn't also tell me the

play11:06

morning routine so every time we mention

play11:09

image it just makes an image hm

play11:12

interesting right so now we see what

play11:15

entrepreneurs doing here oh my God this

play11:16

is so incredibly detailed all right chat

play11:19

GPT is working hard here I respect

play11:23

that and here we have just another image

play11:25

that looks beautiful with no other

play11:28

information

play11:30

all right what do we have here so here

play11:32

we have the summary also of the routine

play11:34

very nice answer I like this a lot

play11:36

bullet points nicely formatted with the

play11:39

image and we get a nice looking

play11:42

image not bad looks a bit cartoonish as

play11:45

usual because it's using the do E I

play11:47

think Gro is actually using the flux

play11:49

model which is much better looking so

play11:53

there you have it folks the breakdown

play11:56

here the the Chad GPT versus grock um

play12:00

for answers I think Chad GPT

play12:03

Remains the best one currently because

play12:06

it understands instructions much better

play12:09

Gro is nice because it has you know all

play12:12

the the tweets apparently but it I don't

play12:15

see it managing to connect them properly

play12:17

you like the Tweet with the result that

play12:19

it gets you sometimes they're a bit

play12:20

disconnected images though look

play12:22

absolutely stunning it's beautiful so

play12:25

definitely um interesting so they both

play12:29

have their pros and cons pretty much but

play12:31

try them out if you have access to grock

play12:33

2 and let me know your thoughts let's

play12:35

see if maybe the grock 2 not the Mini

play12:38

version actually is smart than Chad GPT

play12:40

for now I would say Chad GPT is still

play12:43

the winner and since Claud Sonet 3.5 is

play12:47

actually better than Chad G pt4 I would

play12:49

Place Gro 2 at this point at the third

play12:51

place it's very good though I am

play12:54

pleasantly surprised by the huge

play12:56

improvements compared to grock 1 which

play12:57

is was absolute garbage

play12:59

all right this was a live test so as you

play13:02

see this is none of this information is

play13:03

manipulated you make your own thoughts

play13:05

you can pause the video and check it out

play13:06

and see you in the next one ciao

Rate This

5.0 / 5 (0 votes)

Связанные теги
AI TestGro 2Chat GPTReal-time DataImage GenerationTwitter AnalysisContent CreationTech InfluencerTask PrioritizationFuture PredictionsMindfulness Routine
Вам нужно краткое изложение на английском?