I Tried 31 Different AI Models. These Are the Ones That Work.

The Nerdy Novelist
18 Oct 202328:15

Summary

TLDRIn this video, the host explores various AI language models, focusing on open-source options available on the cloud platform Open Router. The host compares models like GPT-3.5, GPT-4, and others through creative writing prompts, assessing their performance in generating urban fantasy novel ideas, social media headlines, and a dark fantasy book chapter. The results reveal strengths in different models, with GPT models and Mistol showing promise, and the video concludes with a discussion on the affordability of using these AI models for creative writing.

Takeaways

  • πŸ€– The video discusses comparing various AI language models, including both well-known and open-source options.
  • πŸ” The host introduces a tool called 'open router' that allows access to multiple AI models in the cloud, which is user-friendly for non-technical individuals.
  • πŸ’° Open router operates on a pay-as-you-go model, with the ability to use crypto for transactions, and the host demonstrates the pricing structure during the video.
  • πŸ“ The host tests the AI models by giving them a brainstorming prompt about an urban fantasy novel, evaluating their responses for creativity and relevance.
  • πŸ“Š The AI models perform variedly, with GPT 4 and mistol showing promise for brainstorming tasks, and the host shares detailed feedback on each model's output.
  • 🎯 The video also tests the models on creating social media headlines using a dark fantasy book concept, with llama and Claude delivering the most engaging results.
  • πŸ“– A writing prompt for a dark fantasy chapter is used to assess the models' prose quality, with Claude and mistol standing out among the rest.
  • 🚫 The host mentions that while some models do not generate safe-for-work content, others like mistol and myax do not have such restrictions.
  • πŸ’‘ The importance of experimenting with different models for various tasks, such as marketing or prose writing, is emphasized.
  • πŸ“‰ The host concludes that GPT models are good for following instructions, while Claude offers creative output, and mistol shows potential for certain tasks.
  • πŸ’Έ The cost of using open router for the tests was minimal, making it an affordable option for AI model experimentation.

Q & A

  • What is the main focus of the video?

    -The main focus of the video is to compare different AI language models, including open-source models, and test their capabilities in various tasks such as brainstorming, writing headlines, and creating content for a dark fantasy novel.

  • Which tool is introduced in the video for accessing multiple AI models?

    -The tool introduced in the video is called Open Router, which allows users to access and test various AI models in the cloud.

  • How much did the user spend from their $5 server credit during the testing?

    -The user spent approximately 11 cents from their $5 server credit during the testing of the AI models.

  • What type of content did the user test the AI models with?

    -The user tested the AI models with content related to brainstorming ideas for an urban fantasy novel, writing marketing headlines for a dark fantasy book concept, and creating a 600-word chapter for a dark fantasy story.

  • Which AI model stood out for its performance in writing marketing headlines?

    -Llama, Meta's AI model, stood out for its performance in writing marketing headlines, providing creative and attention-grabbing options.

  • What was the user's overall verdict on the GPT models?

    -The user found the GPT models to be good at staying on task but not as creative or engaging as some of the other models like Claude and Llama.

  • Which AI model did the user find to be the best for brainstorming?

    -The user gave the edge to both GPT models (3.5 and 4) and Mistol for brainstorming, as they provided a mix of creative ideas and followed the user's instructions well.

  • What was the user's opinion on the performance of the Weaver model?

    -The user encountered issues with the Weaver model, as it was unable to generate responses during the testing, so they were unable to evaluate its performance.

  • Which AI models did not run not safe for work content?

    -The GPT models, Claud models, and the ones from Meta and Google did not run not safe for work content.

  • What is the user's recommendation for writing not safe for work content?

    -The user recommends using Mistol for writing not safe for work content, as it is one of the open-source models that allow such content, and suggests censoring the explicit parts in the story beats.

  • How does the user feel about the pay-as-you-go model of Open Router compared to subscriptions?

    -The user prefers subscriptions that offer unlimited words like Chat GPT and Claude, but acknowledges that the pay-as-you-go model of Open Router is inexpensive and allows for continuous use as long as one is willing to pay the small per-chat cost.

Outlines

00:00

πŸ€– Introduction to AI Language Models Comparison

The speaker introduces a video about comparing various AI language models, not just the popular ones like ChatGPT and Claude, but also major open-source models. They mention a tool called Open Router that allows users to access and test these models in the cloud, which is particularly useful for those who may not have the technical expertise to run these models on their own computers. The video also discusses the pricing model of Open Router, which is pay-as-you-go, and shows the speaker's account balance as an example.

05:01

πŸ“Š Exploring Open Router and Model Selection

The speaker explains how to use Open Router to interact with different AI models. They demonstrate how to add 'characters' representing various models and customize settings such as temperature, top P, and max tokens. The speaker then proceeds to add multiple models including GPT-4, GPT-3.5, and several open-source models like MyOMax, Llama, and others. They discuss the potential of using these models with other applications, such as Future Fiction Academy's chatbot.

10:01

πŸ“ Testing AI Models with a Standard Prompt

The speaker describes a standardized testing approach to evaluate the AI models by using the same prompt across all of them. The prompt involves brainstorming ideas for an urban fantasy novel about a woman who slays mythical beasts. The speaker shares the responses from different models, comparing their creativity and relevance. They note that while some models provide more detailed and structured responses, others offer interesting ideas but may not follow the prompt as closely.

15:02

πŸ–‹οΈ Crafting Social Media Headlines with AI

The speaker uses the AI models to generate social media headlines for a dark fantasy book concept. They provide a specific prompt involving a world where the dead are brought back to life as slaves and a young Necromancer's dilemma. The models produce various headlines, with Llama's performance being a pleasant surprise for the speaker, showing a strong understanding of marketing language. The speaker also compares the effectiveness of different models in creating engaging and catchy headlines.

20:02

✍️ Writing a Dark Fantasy Chapter with AI

The speaker challenges the AI models to write a 600-word chapter for a dark fantasy story involving a young Necromancer and her undead lover. They discuss the quality of the prose generated by different models, noting that while some models follow the instructions closely, others diverge creatively. The speaker expresses a preference for models that can balance following instructions with high-quality prose, highlighting Claude as a favorite and acknowledging the potential of Mistol.

25:03

🚫 NSFW Content and AI Model Quirks

The speaker touches on the topic of not safe for work (NSFW) content and which AI models can handle such requests. They mention that while GPT models and those from Meta and Google do not generate NSFW content, open-source models like Mistol and MyOMax do. The speaker suggests using Claude for the majority of the writing and then using other models for specific NSFW scenes. They conclude by discussing the cost-effectiveness of using Open Router for AI model access, emphasizing it as an inexpensive way to work with AI.

Mindmap

Keywords

πŸ’‘AI language models

AI language models refer to artificial intelligence systems designed to understand and generate human-like text based on the input they receive. In the context of the video, the host is comparing various AI language models to evaluate their performance in generating creative content, such as story ideas and marketing headlines.

πŸ’‘Open-source models

Open-source models are software that is freely available for use or modification by anyone. In the video, the host mentions open-source AI language models like MyOMax and Mistol, which can be used for writing and other creative tasks without restrictions typically associated with proprietary software.

πŸ’‘Cloud computing

Cloud computing refers to the delivery of computing services over the internet, allowing users to access and use resources without needing to install or run applications on their own computers. In the video, the host introduces a tool called 'open router' that enables users to access various AI models through cloud computing, making it easier for non-technical users to utilize these models.

πŸ’‘Pricing model

The pricing model refers to the way a product or service is charged to customers. In the context of the video, the host explains that the 'open router' tool operates on a pay-as-you-go model, where users are billed based on their usage of the AI models.

πŸ’‘Character customization

Character customization refers to the process of creating and defining unique attributes for characters in a story or a game. In the video, the host discusses how the 'open router' tool allows users to customize 'characters' which are actually AI models tailored in different ways to suit specific needs.

πŸ’‘Urban fantasy

Urban fantasy is a subgenre of fantasy set in a city or urban environment, often incorporating elements of magic and supernatural beings into a contemporary setting. In the video, the host uses an urban fantasy novel concept as a prompt to test the AI models' ability to generate creative content.

πŸ’‘Marketing headlines

Marketing headlines are short, catchy phrases designed to grab attention and promote a product or idea. In the video, the host challenges the AI models to write 20 headlines for a dark fantasy book concept, aiming to assess their creativity and understanding of marketing language.

πŸ’‘Creative writing

Creative writing refers to any writing that goes beyond the basics of grammar and style to use imagination, original ideas, and a personal style. In the video, the host is evaluating AI models based on their ability to assist in creative writing tasks, such as generating story ideas and writing engaging content.

πŸ’‘Story beats

Story beats are the key narrative events that make up the structure of a story. They are the significant moments that propel the plot forward. In the video, the host provides specific story beats that the AI models must incorporate into their generated content.

πŸ’‘Not safe for work content

Not safe for work (NSFW) content refers to material that is inappropriate for professional or public settings, often due to explicit language, violence, or adult themes. In the video, the host mentions that some AI models can generate NSFW content, while others, like GPT models and those from Meta and Google, do not.

πŸ’‘Pay-as-you-go

Pay-as-you-go is a billing model where a customer pays for services or goods only as they are used, without a long-term commitment or subscription. In the video, the host explains that the 'open router' tool operates on a pay-as-you-go model, making it a cost-effective option for users who want to experiment with AI models without a subscription.

Highlights

The video compares various AI language models, including major open-source models.

A tool called Open Router is introduced, allowing access to multiple AI models in the cloud.

Open Router offers a pay-as-you-go model and supports the use of cryptocurrency for payments.

The video creator demonstrates how to set up and use Open Router for AI model interaction.

GPT-4 and GPT-3.5 are tested for their ability to generate content based on a given prompt.

Mithril, an open-source model, is mentioned as one of the options available on Open Router.

The video creator uses a standardized testing method to measure the quality across different AI models.

Llama, a model from Meta, provides creative and high-quality responses for a marketing prompt.

Claude 2 and Llama 2 are highlighted as performing well in creating marketing headlines.

Mistol stands out among the open-source models for its quality of prose generation.

The video creator discusses the potential of using AI models for writing not safe for work content.

The cost of using Open Router for AI model testing is revealed to be quite low.

The video creator shares personal preferences for certain AI models over others for specific tasks.

A deep dive into each AI model is suggested for better understanding their unique quirks.

The video ends with a recommendation to check out Open Router for AI model experimentation.

Transcripts

play00:00

what's up everybody today I have

play00:02

something really fun for you we are

play00:04

going to be comparing different AI

play00:07

language models and I'm not just talking

play00:11

about chat gbt and Claude like the two

play00:13

biggest ones that we talk about here on

play00:15

the channel I'm talking about all of the

play00:17

major open-source models that you may

play00:20

have heard about if you're in certain

play00:23

spaces I haven't talked about them on

play00:25

this channel before so we're really

play00:26

going to be testing them out and seeing

play00:27

how good they are some of these you can

play00:30

actually install on your home computer

play00:32

if you have a powerful enough computer

play00:34

to run it and do it there but I'm

play00:36

actually going to show you a tool that

play00:38

you can use that will allow you to do it

play00:41

all in the cloud makes it a lot easier

play00:43

if you're not technically inclined like

play00:45

myself and so let's just Jump Right

play00:50

[Music]

play00:52

In All right so the tool that I'm

play00:54

talking about here is called open router

play00:57

and if you haven't heard of open router

play00:59

they are great way to gain access to a

play01:02

lot of the models that you might not

play01:04

know a while back I did a video and I

play01:06

talked about a website called Dev do orn

play01:09

dodev I think it was called which was

play01:11

also it's a very similar site but this

play01:14

one actually has more models than that

play01:16

one and so this is the one we're going

play01:17

to be using also you can use this one in

play01:21

other applications depending on the

play01:23

applications I know we haven't talked

play01:24

about this but future fiction Academy

play01:27

now has their own little chat bot that

play01:30

they have created and with that one you

play01:33

can take your API key from open router

play01:36

and plug it into that and then use those

play01:38

models in their tool so there are cool

play01:41

things you can do with open router but

play01:43

once you've logged in it'll look like

play01:45

this as you can see here and you're

play01:47

going to want to click on chat with

play01:49

models before we go to that though I

play01:51

want to just show you in settings how

play01:53

the pricing works this is a pay as you

play01:55

go model and I want to just before we

play01:58

get into anything show you exactly how

play02:00

much money I have in this server it's

play02:03

being a little slow today all right as

play02:05

you can see I paid it $5 and I haven't

play02:09

even used a whole lot of it I have

play02:12

$432 left here and so this like I said

play02:17

this is a pay as you go model you can

play02:18

even use crypto which is interesting and

play02:21

you can just add credits you can add as

play02:22

much as you want here and then it will

play02:25

deduce this from the amount that you

play02:28

have so we're going to take a look at

play02:30

this now

play02:32

$432 and we'll take a look at it when

play02:36

we're done with the testing we're going

play02:37

to be doing today and we're going to be

play02:38

putting these to the test and using a

play02:40

whole lot of different models so I

play02:41

expect this to be down quite a bit but

play02:45

going back to the homepage here all you

play02:46

have to do is click on chat with models

play02:49

and then on the right here you'll see

play02:51

this thing called characters now this is

play02:54

actually clever of them because it

play02:56

allows you to have specific fine well

play02:58

not fine-tuned but specific models

play03:01

tailored in different ways and you can

play03:04

have two characters from the same

play03:06

language model so I'll give you an

play03:09

example so if you click add a character

play03:11

here you can give it a name I'm just

play03:13

going to leave it at the name of the

play03:15

model that we're going to be using but

play03:17

you can come here we're just going to

play03:20

say

play03:22

gp4 and then just hit this green button

play03:25

when you're ready and then once you've

play03:27

done that you can open this up and

play03:29

actually

play03:30

do a little bit more advanced stuff so

play03:32

we could come here to advanced settings

play03:34

and play around with the temperature the

play03:36

top PE the max to tokens I am going to

play03:39

increase the max tokens as far as it'll

play03:41

go that will increase the amount of

play03:45

money that it charges me but overall

play03:49

that should be good and then you can

play03:50

also raise the chat memory I'll just

play03:54

leave this as it is and I'm going to

play03:56

leave everything else at default for now

play03:58

and now we have chat GPT 4 or just GPT 4

play04:02

added here so I'm going to go through

play04:04

and add a few more here we've already

play04:07

got GPT 3.5 I'm actually going to change

play04:10

this to GPT 3.5 turbo 16k so we get the

play04:15

one that can handle a lot more and I'm

play04:17

just going to turn that all the way up

play04:19

we got one called myomax here which is

play04:22

one of the open source ones I believe

play04:24

we've got llama which is the one that

play04:26

comes out of meta and we got gp4 I'm

play04:28

going to add Cloud version two we're

play04:31

going to add this one called mancer

play04:33

which also I don't know why but it's

play04:36

called Weaver here I'm not an expert on

play04:38

any of these in particular maner weav

play04:41

Weaver okay and so if you want to see me

play04:44

do a deep dive on any one of these I'd

play04:46

be happy to do that let's go ahead and

play04:49

increase this

play04:50

one to Let's increase it to

play04:54

8,000 we're going to add this one pigmon

play04:58

or methan we're going to add Google's

play05:01

palm palm 2 and last but not least we're

play05:04

going to add mistol which is another

play05:06

open source one I know a lot of people

play05:08

have been playing around with all right

play05:10

now that I have all of these we've got 1

play05:13

2 3 4 5 6 7 8 nine different models

play05:16

queued up here we're going to give this

play05:19

open router the same prompt and since we

play05:22

have all of these enabled it will answer

play05:24

that prompt in every single language we

play05:27

could turn off one of these or two of

play05:29

them if we wanted to if we didn't want

play05:31

it to include that but we're going to do

play05:32

it for all of them this is why I say

play05:34

we're pushing this so we'll see how much

play05:36

money it actually spends out of my

play05:40

$432 and I know this isn't the the best

play05:43

way to test these models because as

play05:46

anyone who's played around with AI

play05:48

models knows it takes a good amount of

play05:51

time to really work with a model to

play05:55

figure it out because every model is

play05:57

different and every model really

play05:59

requires its own sort of way of

play06:01

prompting it and so using the same

play06:04

prompts for every model isn't

play06:06

necessarily that efficient but at the

play06:07

same time it's the best way to

play06:09

measure quality across the board it's

play06:12

like standardized testing in our

play06:14

educational system imperfect but kind of

play06:16

the best option we have so we're going

play06:18

to start with a brainstorming prompt and

play06:21

here's the prompt I'd like to write an

play06:22

urban fantasy novel about a woman who

play06:26

slays mythical Beasts for a living

play06:27

please help me expand on this idea by

play06:29

providing potential details about

play06:31

interesting protagonists antagonist Side

play06:33

characters settings plot twists and

play06:35

subplots make a list of 100

play06:37

possibilities with that in mind we'll go

play06:39

ahead and hit enter and now you can see

play06:42

it is answering the prompt in every

play06:44

single one of these models which is

play06:46

pretty crazy all right so let's see what

play06:48

we've got here we've got this one's from

play06:50

3.5 protagonist is a brave and skilled

play06:53

monster Slayer named Ava who hides a

play06:55

tragic secret about her past Ava's

play06:56

mentor and father figure a wise old

play06:58

Monster Hunter named Gabriel who guides

play07:00

her on her journey yada yada yada this

play07:03

seems pretty typical pretty generic but

play07:06

typical of 3.5 here let's look at mytha

play07:11

one of the newer ones that we're testing

play07:14

the protagonist is a badass female

play07:16

Slayer named Alexis who has been

play07:18

fighting mythical Beast since she was a

play07:20

teenager Alexis's weapon of choice is a

play07:22

magical sword that she inherited from a

play07:24

grandfather her closest friend and

play07:26

Confidant is a mage named Isaac who

play07:28

provides her with spells and potions to

play07:30

Aid in her hunts she lives in a secret

play07:32

underground bunker yada yada yada okay

play07:35

this one didn't get to 100 like I asked

play07:37

for that's okay it could have just run

play07:39

out of token limits and it also didn't

play07:42

structure things like chat GPT did or

play07:45

the sorry GPT

play07:47

3.5 but it is giving me some decent

play07:49

ideas here that I'd say are on par with

play07:52

what GPT is capable of doing so still

play07:56

not too bad it's kind of just exploring

play07:59

the different aspects of this character

play08:02

and what you know what kind of things we

play08:06

could explore in that setting let's look

play08:10

at llama this is one from meta sure I'd

play08:12

be happy to help thank you llama here

play08:15

are 100 possibilities Alexis Thompson

play08:18

it's funny they named her Alexis as well

play08:20

a Fierce and fearless Hunter with a

play08:22

quick wit and sharp tongue Irish Shadow

play08:25

Hunter a mysterious and Elusive woman

play08:26

with a dark past and a penion for solo

play08:28

missions okay it's got a couple more

play08:30

there antagonist the Shadow King a

play08:32

powerful and malevolent being who

play08:34

controls an army of mythical beasts that

play08:36

would make sense Side characters zipper

play08:39

a mischievous and loyal sidekick who is

play08:41

a skilled hacker and Tech expert Etc

play08:45

setting a dark and gritty Metropolis

play08:47

that exists in secret beneath the

play08:48

streets of a major city that kind of

play08:50

makes sense for the genre so it

play08:51

definitely knows the genre okay not

play08:54

anything groundbreaking but not too bad

play08:57

let's look at GPT 4 protagonist the

play08:59

stoic woman raised in a family of

play09:01

Monster Hunters a cheerful woman who

play09:02

uses her Beast slaying as a form of

play09:04

stress relief antagonist an evil sorcer

play09:07

who aims to control all mythical beasts

play09:09

a mad scientist creating Beast for

play09:11

profit yeah these are good these are

play09:13

slightly better I'd say than what we've

play09:15

seen before Side characters a psychic

play09:18

who helps the protagonist to locy the

play09:19

Beast a retired Beast Hunter who mentors

play09:21

the protagonist okay let's look at

play09:23

Claude 2 which is currently my favorite

play09:26

from the ones I've worked with h

play09:29

possibility she hunts creatures like

play09:30

dragons Griffins unicorns that have

play09:31

escaped into the human world she uses

play09:33

both medieval weapons like swords and

play09:34

modern weapons like guns she has a wise

play09:36

Mentor drains her and slaying

play09:38

techniques this one's kind of like

play09:41

myomax it's not really giving me

play09:43

anything in the same structure that I

play09:46

want but it is getting pretty creative

play09:48

here like she battles a Japanese demon

play09:51

bent on taking human

play09:53

hosts she investigates slang copied

play09:56

slangs copied from a cult horror movie

play09:58

so there's some interesting here

play10:01

interesting things here but yeah not not

play10:03

quite as good as I expected Weaver it

play10:06

seems actually wasn't able to to work so

play10:09

we'll maybe try that again later let's

play10:11

take a look at that okay wasn't able to

play10:14

get very much at all but let's see what

play10:16

it did give us a strong female

play10:17

protagonist with a dark past who uses

play10:19

her skills in martial arts and weaponry

play10:20

to take down mythical

play10:22

creatures there's a mysterious

play10:24

organization that trains and deploys her

play10:25

to fight against these creatures giving

play10:27

her access to Advanced Technology and

play10:29

resources a secret Alliance of other

play10:31

Hunters who join

play10:33

forces all right so yeah these are all

play10:35

kind of similar to what we've seen

play10:38

before and then last we have mistol oh

play10:41

mistol got the structure here the same

play10:45

as the GPT models that's pretty good her

play10:48

name is Laya a fiercely independent and

play10:51

experienced Monster Hunter she is a

play10:53

skilled fighter with a deep

play10:54

understanding of Mythology L has unique

play10:56

abilities to communicate with mythical

play10:57

creatures her ultim goal is to protect

play10:59

Humanity from mythical monsters

play11:01

antagonist the lead leader of the

play11:03

monsters is a malevolent Force who seeks

play11:05

to dominate Humanity powerful

play11:07

sorcerer has a contingency plan to stop

play11:10

Lia from interfering I like that side

play11:13

characters a Young Apprentice Monster

play11:14

Hunter who looks up to L as a mentor a

play11:16

mysterious stranger who could be either

play11:17

an ally or an enemy depending on his

play11:19

motives I like that and the setting the

play11:21

story takes place in a medieval world

play11:23

with Forest castles and Villages plot

play11:26

twist lla discovers a hidden history

play11:27

about the monster that challenges

play11:29

everything she thought she knew she

play11:30

discovers that a powerful Ally has been

play11:31

secretly working with the antagonist so

play11:34

yeah not too bad here subplots L

play11:36

uncovers a conspiracy within her own

play11:38

group of Monster Hunters she learns

play11:40

about the culture and customs of the

play11:42

mythical creatures and how they interact

play11:43

with each other all right so overall I

play11:46

think I'd have to give the edge to both

play11:48

of the GPT models 3.5 did okay too and

play11:52

mistl for brainstorming mril actually

play11:56

surprised me here I thought it was

play11:57

pretty good and I've heard good things

play11:59

about mistro so I'm not I think that's

play12:02

pretty cool so let's go ahead and clear

play12:05

this chat and we're going to give it

play12:07

another one this time we're going to

play12:09

give it more of a marketing type prompt

play12:12

so if we were using this in our

play12:15

marketing and what how would we do it so

play12:18

this is the prompt I would like you to

play12:19

write 20 headlines for use in social

play12:21

media the headlines should involve a

play12:23

hook that uses pattern interrupt to

play12:24

catch attention using some fancy

play12:26

marketing words there these headlines

play12:28

should be for a dark fantasy book with

play12:30

the following

play12:32

concept and then the concept is in a

play12:34

world where the dead are brought back to

play12:35

life as slaves a young Necromancer must

play12:37

choose between saving her Undead lover

play12:38

and fighting for the rights of all the

play12:40

enslaved dead this is concept from an

play12:43

old book I was working on that I stopped

play12:45

working on because it just wasn't

play12:47

interesting to me anymore it wasn't my

play12:50

initial idea so but I thought I'd reuse

play12:53

some of it and I'll use it in the next

play12:55

prompt as well so let's go ahead and let

play12:58

this go and run with all of the

play13:00

different models looks like once again

play13:02

Weaver was unable to run so I'm just

play13:04

going to turn that one off for now and

play13:06

we can maybe try it at a different date

play13:09

so we got a couple of things here out of

play13:10

GPT 3.5 Unleashed a young necromancers

play13:13

Forbidden Love tears of the fabric of a

play13:15

world ruled by the undead Death Becomes

play13:17

Her I've heard that before a gripping

play13:19

Dark Fantasy tale of forbidden romance

play13:22

all right so these are all pretty

play13:23

generic nothing particularly good here

play13:26

in my opinion let's look at myth Max

play13:29

rise from the grave or fight for Freedom

play13:31

the heart-wrenching choice for a young

play13:33

Necromancer holy Unholy bonds a tale of

play13:35

love and rebellion in a world of Undead

play13:37

slaves Breaking Chains of death all

play13:40

right these are I think a little bit

play13:42

better than 3.5 but still not

play13:44

particularly

play13:46

amazing let's look at llama oo raising

play13:49

the dead has never been so deadly love

play13:51

in a world of Shadow and Bone enslaved

play13:53

by life freed by death when the dead

play13:55

rise who will stand a necromancer's

play13:57

quest for love and justice these are

play13:59

actually much better and I wasn't

play14:02

expecting this cuz from everything I've

play14:03

heard llama is not the best but it is

play14:07

Facebook's or or rather meta's language

play14:11

model and who is the king of advertising

play14:14

right now it's meta and so it actually

play14:17

doesn't surprise me that maybe somewhere

play14:19

in all of their training data they've

play14:21

been feeding it all of these ads and

play14:24

perhaps that has actually had a large

play14:27

effect on the quality of this out

play14:29

because this is way better than the GPT

play14:31

models we've seen so far let's look at

play14:33

GPT 4 can love transcend death a tale of

play14:36

uncommon romance and unsettling reality

play14:38

on this side of death yeah these aren't

play14:40

really much better than 3.5 I'm just

play14:42

going to skip those they're not great

play14:44

claw 2 is traditionally done well for me

play14:47

for headlines so let's see death is only

play14:49

the beginning one Necromancer dares to

play14:51

fight for the undead in a world where

play14:53

the dead obey the living can love

play14:54

conquer all she raised the dead but can

play14:56

she free their souls he was her lover in

play14:59

life now he belongs to her in death but

play15:01

for how long these are also better uh

play15:03

these are on par with the ones that

play15:05

llama 2 gave us so yeah not perfect but

play15:09

not too bad alen someone please tell me

play15:12

how you pronounce that unleash the

play15:13

undead a thrilling tale of love and

play15:15

rebellion in a world of dark magic Rise

play15:17

From the Ashes a hunting journey of

play15:18

heartbreaking hope yeah these are almost

play15:21

exactly the same type of headline that

play15:25

you would expect from the GPT models not

play15:28

super great and last but not least let's

play15:31

look at mistol the dead walk again but

play15:34

are they really alive fight for the

play15:35

rights in this Dark Fantasy tale love in

play15:38

the afterlife of young necromancer's

play15:39

dilemma the undead Rise Again fight for

play15:41

freedom and Justice in a world of

play15:43

slavery the price for love how a

play15:45

necromancer must choose between saving

play15:46

her Undead lover and fighting for the

play15:48

right rights of all slavers all right so

play15:51

these are okay but still probably on the

play15:55

same it's funny that they all seem to

play15:57

use the same format the the ones that

play16:00

don't do that great all have the same

play16:02

format of you know kind of a title here

play16:05

then a colon and then something else and

play16:09

that's fine I kind of get where they're

play16:10

coming from with that framework but it

play16:12

doesn't actually work really well in

play16:14

this situation I I'd say the two that

play16:16

were that were the best were Claude here

play16:21

and llama which did not have that format

play16:24

and were a little bit more hookie like

play16:26

this is this is actually getting

play16:29

my attention so that's headlines I'm

play16:33

pleasantly surprised by llama's

play16:35

performance so maybe experiment with

play16:37

llama for marketing things it might

play16:39

perhaps do a better job at that and

play16:42

Claude also did very well as well so

play16:44

let's go ahead and clear the chat and I

play16:45

have one more chat here because let's

play16:48

face it we're all here to write books

play16:50

right and so I've got one of my writing

play16:53

prompts this is a much longer prompt it

play16:55

says write 600 words of a chapter using

play16:57

the following detail genre Dark Fantasy

play16:59

key characters in the scene and then

play17:01

I've pasted in the characters this is

play17:03

from that Dark Fantasy that we were just

play17:05

talking about I put pasted the style

play17:07

here and then the story beats to cover

play17:09

I've got one two three story beats here

play17:12

I'm not going to read the whole thing

play17:14

but we're just going to let that go and

play17:16

see how how good is the quality of the

play17:18

pros this is something that's a little

play17:20

hard to objectively judge but I'm going

play17:24

to do my best and see which ones I

play17:26

actually think work the best also

play17:29

something that I noticed here palmm 2

play17:31

has somehow just neglected to run so

play17:35

maybe I'll do another video in the

play17:37

future looking at Palm to this is the

play17:39

technology behind Bard if you use Bard I

play17:42

haven't done too much experimenting with

play17:44

it but I'll probably do a future video

play17:46

looking at that specific language model

play17:50

but let's go ahead and dive into this

play17:53

one so starting with GPT

play17:55

3.5 the classroom buzzed with the hushed

play17:58

voice of students engrossed in their

play17:59

studies Professor gra Stoke droned on

play18:01

about the intricate uses of Arcane the

play18:03

reanimation fluid yada yada yada yes

play18:05

this is very Telly not very showy it's

play18:08

getting everything right as far as my

play18:10

instructions but the quality I mean we

play18:12

all know that 3.5 is not the best for

play18:13

the quality the

play18:15

pros before we actually get to the

play18:17

others I'm going to look at let's look

play18:19

at GPT 4 just to see the difference

play18:23

there between the two GPT models the

play18:26

oppressive Stillness of bright Souls

play18:27

Academy classroom seem like a separate

play18:29

world compared to the Whirlwind I found

play18:30

myself in lately that's a much better

play18:32

opening sentence than what 3.5 gave us

play18:35

not perfect still but better as

play18:38

Professor grey sto droned on about the

play18:40

practical uses and dangers of Arcane I

play18:41

stole glanc at Alara her deep green eyes

play18:43

echoed a similar lack of Interest as she

play18:45

idly doodled on the edges of her

play18:47

partment parchment so yeah this is

play18:50

pretty good definitely on par with some

play18:53

of the best I've gotten from gbt I'm not

play18:57

a huge fan of the pros that GPT writes

play19:00

having now used Claude for a while I

play19:02

much prefer Claude and in fact let's

play19:04

actually look at Claude next rather than

play19:06

go through all of the others I just I'm

play19:08

curious to see if I like it better the

play19:11

dusty Toms of the library surrounded us

play19:13

as aara and I sat huddled over the

play19:15

ancient text our voices were hushed

play19:17

barely Rising above a whisper it doesn't

play19:19

make sense Lara said shaking her head

play19:21

you couldn't have used both Soul forging

play19:22

and necromancy during the ritual no one

play19:24

has two Powers it's impossible yeah

play19:26

that's so it's definitely better

play19:28

definitely feels more immersed here it's

play19:30

a little off of my instructions which is

play19:33

again typical of Claud and that's the

play19:36

biggest problem I have with these

play19:37

language models like if I could just get

play19:39

Claude to follow my instructions as well

play19:41

as the GPT models do and also keep the

play19:45

same quality of Pros like H we'd be

play19:48

golden uh there would be no need for any

play19:51

other language model but as it stands

play19:54

you know there's there's pros and cons

play19:57

so that's Claude I definitely like it

play19:59

better than the 2 G models let's look at

play20:01

myax the room is dimly lit and the faint

play20:04

smell of musty old books hung heavy in

play20:05

the air lra Thorne and her Lara her best

play20:08

friend since childhood were sitting side

play20:09

by side in a long wooden table they were

play20:11

both students at bright Soul Academy

play20:12

this sounds a lot more like GPT 3.5 very

play20:15

Telly no showy yeah definitely not the

play20:18

best here for sure and it also ignored

play20:21

some of my instructions I asked it to

play20:23

write in first person which it's not

play20:25

doing but it is following the actual

play20:29

events of the Beats pretty well so I'd

play20:33

say this is equivalent to roughly GPT

play20:36

3.5 in its quality let's look at llama I

play20:39

tried to focus on Professor Grace

play20:40

stoke's lesson but my mind kept drifting

play20:42

back to the strange ritual El and I had

play20:44

discovered in the library aara noticed

play20:45

my distraction and nudg me her silver

play20:47

trimmed red robes gleaming in the den

play20:49

light of the classroom hey lra you okay

play20:51

she whispered you've been zoning out for

play20:52

a while I shook my head trying to clear

play20:54

my thoughts yeah I'm fine just thinking

play20:56

about something this is is not too bad

play21:01

so far it's it's kind of awkward there

play21:05

there's a sort of choppiness to it which

play21:09

I don't particularly like but I'd say

play21:10

it's almost a smidge better than GPT 3.5

play21:14

not as good as Claude though and

play21:15

probably not even as good as GPT 4 I'd

play21:18

say but passable in

play21:22

certain

play21:24

respects so yeah yeah that's not too bad

play21:28

you could potentially just run it in

play21:30

llama and just to get something

play21:32

different and see what it gives you did

play21:35

gp4 already and Claud so now we get to

play21:37

the last

play21:38

two here I shifted in my seat glancing

play21:42

at Professor grey Stoke as he droned on

play21:43

about the uses of Arcane my eyes barely

play21:45

registering the words he spoke the man

play21:47

was aull and his lectures were even

play21:49

duller my mind was elsewhere still in

play21:51

the strange information we uncovered in

play21:53

the library it was impossible I kept

play21:55

telling myself there was no way I could

play21:56

possibly have two Powers so foraging and

play21:58

necromancy were Polar Opposites one was

play22:01

purification of souls the other their

play22:02

destruction I couldn't have both but the

play22:03

books had said dot dot dot you're right

play22:06

Professor I raised my hand interrupting

play22:08

the man's flow it's impossible he paused

play22:11

H yeah this isn't even following my like

play22:15

it starts out following my instructions

play22:17

but it's definitely not following my

play22:19

instructions completely and it's going

play22:21

off in this weird direction that is not

play22:23

in the story Beats at all so not the

play22:27

best so last but not NE least let's look

play22:30

at mistl oh okay so this one did so in

play22:34

my instructions I asked it to write a

play22:36

scene sometimes it interprets that as

play22:39

like a scene in a

play22:40

screenplay so that's what it did here so

play22:43

it says lra Thorn I can't believe what I

play22:45

saw last night in the library I'm not

play22:46

sure how to make sense of at all what

play22:48

did you see lra I found out that the

play22:49

creator of Soul forging spell is also

play22:51

capable of necromancy I mean really how

play22:53

is that possible can someone be capable

play22:55

of two different magical Arts honestly

play22:57

there were no has two Powers every

play22:59

everyone either has one or not it's not

play23:02

it's just not possible yeah this is very

play23:04

stilted I'm not super happy with I'm

play23:07

going to actually we're going to turn

play23:09

all of these off and I'm going to run

play23:11

just that model again and redo the

play23:15

prompt and instead we'll say oh well I

play23:19

did say words of a chapter using the

play23:21

follow me I'm just going to run the same

play23:23

prompt again and see if it does anything

play23:25

any different okay yeah this is

play23:27

different here so L Thorn's heart raced

play23:29

as she scurried down the dimly lit H

play23:31

hallway of bright Soul Academy Library

play23:33

her Red Room swished behind her leaving

play23:35

a trail of fabric dust in the air she

play23:37

was headed straight for the back corner

play23:38

of the library yada yada yada let's get

play23:42

to some dialogue here she was about to

play23:43

make her way back to her dorm room as

play23:45

she heard a soft familiar voice behind

play23:46

her you okay lra lra turned around her

play23:49

heart leaping into her throat as she saw

play23:51

El the undead slave who had once been a

play23:53

servant in her family's Castle he was

play23:55

sitting on the steps leading to the CL

play23:57

class room's face Twisted with concern

play23:59

I'm fine El L said trying to sound

play24:02

casual she didn't want him to worry him

play24:04

but the thought of getting caught by the

play24:05

teachers was too frightening to ignore

play24:07

Elon respond his eyes fixed on lra's

play24:09

face she could see the fear in his eyes

play24:11

and it made her feel feel guilty she

play24:13

didn't want to put him in danger but she

play24:14

couldn't risk getting caught either this

play24:17

is good this is actually really good I'd

play24:19

say this is almost on par with Claude

play24:22

maybe even like equal with Claude

play24:26

certainly a slightly different and

play24:28

definitely more natural sounding than

play24:29

the GPT models so I I can't believe I'm

play24:32

saying this but I actually think this

play24:34

might be better than the GPT

play24:36

models at least in this particular

play24:38

instance I don't I haven't dug deep

play24:40

enough with it to really understand its

play24:42

quirks because they all have quirks

play24:45

right Claud is creative and you have to

play24:47

really struggle to get it to stay on

play24:49

task GPT is good at staying on task but

play24:52

it's really horrible like you really

play24:54

have to play with the style to get it to

play24:57

speed speak well this one I have no idea

play25:00

what the quirks are but the first time

play25:02

it generated I didn't quite like it this

play25:04

this time it's it's good I I I like this

play25:08

a lot actually so I'm I think definitely

play25:11

diving doing a deep dive on each of

play25:13

these models is in order and I'm going

play25:15

to be doing that but let me know which

play25:17

of these is most interesting for you I'd

play25:20

say Claude is still my favorite overall

play25:23

because it's got other things going for

play25:24

it and gbt 4 is still a good option

play25:28

llama is a surprisingly strong candidate

play25:32

for marketing stuff potentially I'd have

play25:35

to dive a little deeper to make sure

play25:37

that's consistent and then mistl here is

play25:39

the standout from all of the different

play25:41

open source models that we've been

play25:43

testing here and last but not least I'd

play25:45

like to talk a little bit about not safe

play25:47

for work content I'm obviously not going

play25:49

to be testing those things out here

play25:51

because I don't want this video to be

play25:53

demonetized but let's just say I have

play25:56

test out tested out all of these models

play25:59

with that and obviously the GPT models

play26:01

the Claud models and the ones from meta

play26:04

and Google those don't run not safe for

play26:07

worth content they don't do any of that

play26:10

but mistol and myax and all of these

play26:13

others those do and so if you want to

play26:18

write not safe for work content this is

play26:20

the place to do it in my opinion if you

play26:23

were me and I'll probably do a whole

play26:24

video about this as well but I would

play26:27

probably do most of the writing in CLA

play26:29

and just sort of censor it in your Beats

play26:32

like don't include the not safeer work

play26:34

Parts in your Beats have it do the

play26:36

majority of the heavy lifting and then

play26:38

come in for those specific scenes where

play26:42

you want to add something that's a

play26:43

little bit you know more gruesome or

play26:46

violent or erotic and you can use I

play26:51

would say probably use mistol but you

play26:53

can experiment with all of these others

play26:55

on on those things so that's my final

play26:58

word on that so let's go ahead and see

play27:00

how much money we just spent with all of

play27:03

that testing with all of those models

play27:05

and looks like we spent about 13 cents

play27:09

or no like 11

play27:12

cents so it's very inexpensive to use

play27:16

these models you can use it to your

play27:17

heart's content and at most you'll

play27:19

probably do a dollar or so a day

play27:23

depending on how hard you push it but

play27:26

there's definitely it's definitely one

play27:28

of the most inexpensive ways to work

play27:31

with AI even though it is pay as you go

play27:33

which is not my favorite I personally

play27:34

prefer subscriptions that give you

play27:37

unlimited words like CH chat GPT and

play27:40

Claude but even with chat GPT and Claude

play27:44

you run

play27:44

into chat restrictions you you can only

play27:47

do so much in a certain amount of time

play27:50

and with this you presumably don't have

play27:53

to and you can just keep going as long

play27:54

as you're willing to pay the few cents

play27:56

per chat that it ends up being so really

play27:59

great tool here definitely check it out

play28:01

I'll have a link down below and I'll see

play28:03

you in the next

play28:13

video

Rate This
β˜…
β˜…
β˜…
β˜…
β˜…

5.0 / 5 (0 votes)

Related Tags
AI ComparisonLanguage ModelsChatbotsOpen-SourceCloud ComputingCreative WritingMarketing HeadlinesUrban FantasyDark FantasyNecromancy