Midjourney Version 6 - IS AMAZING!!!

Olivio Sarikas
21 Dec 202313:43

Summary

TLDRThe video discusses the new version 6 release of Midjourney AI and demonstrates how it has improved image generation capabilities. The host shares excitement about using Midjourney almost daily for expressive and stock-like images. Some key improvements in version 6 highlighted: - More accuracy for longer prompts - Increased coherence and knowledge, allowing more complex scenes - Less need for 'chunky' prompting conventions used in v5 - Ability to generate readable text Examples demonstrate v6 improvements: - Simpsons x Beauty and the Beast mashup retains styles well - Santa Claus image renders tongue realistically - 'Hello World' text renders clearly several times - Dragon battling warrior scene has excellent composition - Cat reading book image has cohesive style and pose - Sci-fi cockpit has functional, coherent elements - Giant robot over city street has effective scale - Old arcade scene captures lighting and mood well - Warrior against monster has strong color contrast Stock photos generated: - Food photo with appetizing realism - Office worker with skyline background looks authentic While giant woman walking through village is not perfect, it shows improved capabilities for complex prompts. Key takeaways: - V6 has significantly improved image coherence, accuracy, and complexity handling - Text rendering works fairly well now - Stock photos are generated very realistically - Will need to refine prompting approach compared to v5 - Still an alpha but available to try now and provide feedback Overall, v6 demonstrates major strides for Midjourney in generating accurate, high-quality images for a wide range of prompts and use cases.

Takeaways

  • Midjourney version 6 brings major improvements in image generation quality and accuracy
  • It has better coherence, composition, and artistic direction in complex images
  • Text rendering ability is improved in v6
  • Longer prompts work better now
  • V6 images look more realistic, like actual photos and stock images
  • Reprompting strategies from v5 may not work as well in v6
  • Describe feature will be upgraded to v6 later
  • Additional features like panum, very wide etc are coming soon
  • V6 handles prompts for complex scenes like spaceship interiors better
  • Overal v6 allows creating images closer to the intended prompt

Q & A

  • What are some of the major changes in Midjourney v6?

    -Better image quality, more accurate prompt following, improved text rendering, better results for long prompts, more realistic image generation.

  • How is artistic direction and composition better in v6 images?

    -V6 makes smarter decisions about lighting, contrast, saturation etc that enhance the overall artistic quality of images.

  • Why do old v5 reprompting strategies not work as well now?

    -V6 has been significantly upgraded so previous tricks like adding 'award winning photo realistic' don't improve quality much now.

  • What additional features are planned for future v6 updates?

    -Panum, very wide, upgraded describe v6 etc. Some announced features like text drawing are already partly implemented.

  • How does v6 better handle complex scenes like spaceship interiors?

    -It can now render complex functional scenes better even when few example images exist, thanks to improved model knowledge.

  • What does the improved text rendering ability allow?

    -V6 can now accurately generate written text like signs and labels within an image based on prompt.

  • Why do v6 images look more like real stock photos?

    -Enhanced coherence and realism in rendering various elements like food, office scenes results in more authentic looking images.

  • How can describe feature help in future?

    -Describe v6 will allow generating better prompts automatically that are tuned to capabilities of v6 model.

  • Why is relearning prompting strategies needed for v6?

    -The significant upgrades mean v6 works quite differently from v5 so previous prompting tricks don't apply well.

  • How does v6 allow getting images closer to prompt?

    -Increased accuracy, model knowledge and improved handling of artistic direction results in images that match prompts better.

Outlines

00:00

Introduction and Overview

Introduces that a new version of Midjourney is out and discusses some of the overall changes and improvements. Shares example images and prompts to showcase the capabilities. Also provides a personal update on the video creator's travels in Thailand.

05:02

Image Examples and Prompts

Shows specific image examples created with Midjourney v6, including a Beauty and the Beast / Simpsons mashup, a cartoon Santa Claus, and text rendered as images. Discusses how v6 handles complex scenes better like dragons, landscapes, and sci-fi spaceships. Also tests anime cat drawings, abandoned arcades, giant robots, and more.

10:02

Stock Photo Examples

Tests Midjourney v6's ability to create realistic stock images using food photography and office scenes as examples. Shows how it can generate high quality, authentic looking images that could be used in ad campaigns and commercial purposes.

Creating Gigantic Figures

Attempts to create images of gigantic figures integrated into landscapes, which is still challenging. Shares an example prompt and resulting silhouette of a 20-story tall woman walking through a village.

Announcement and Alpha Test Details

Summarizes details from the Midjourney v6 announcement, including improved accuracy for long prompts, better coherence and knowledge, and not needing certain prompting phrases used in v5. Mentions new features and capabilities that are still in development.

Mindmap

Keywords

💡midjourney

Midjourney is an AI image generation system that creates images from text prompts. The video discusses improvements and new features in version 6 of midjourney. It shows examples of images created with version 6 and analyzes how the quality and capabilities have advanced.

💡prompting

Prompting refers to the text prompts or instructions that are input to midjourney to tell it what kind of image to generate. The video examines how prompting strategies may need to change for version 6, with less dependence on certain prompting conventions used in version 5.

💡coherence

Coherence refers to how logically coherent, consistent, and meaningful an AI-generated image is. The video praises version 6 for improved coherence in complex scenes with multiple elements interacting logically.

💡artistic direction

Artistic direction refers to visual design choices that enhance the aesthetics, emotion, focus etc. of an image. The video analyzes strong artistic direction in several version 6 images related to lighting, contrast, saturation etc.

💡composition

Composition describes how elements are arranged within an image. The video frequently compliments version 6 images for excellent and meaningful compositions that align with the desired scene.

💡depth of field

Depth of field refers to the area of an image that is in focus. The video notes clever uses of depth of field blur in version 6 images to guide the viewer's attention.

💡silhouette

A silhouette refers to the outline or general shape of something, with interior details in shadow. The video points out the use of lighting and contrast to create strong silhouettes that help subjects stand out.

💡stock images

Stock images are reusable generic images that can be used across different contexts. The video examines version 6's ability to generate believable stock photos for areas lacking good actual stock images.

💡describe

The describe feature summarizes an AI-generated image into a text description. The video notes that a version 6 update to describe will help refine prompts to achieve a user's desired type of image.

💡capabilities

Capabilities refer to the spectrum of visual concepts midjourney can depict and the quality with which it can render them. The video discusses expanded capabilities in version 6 to handle more complex scenes and elements.

Highlights

Midjourney version 6 brings amazing new changes for more expressive and stock-like images.

New version has better accuracy for longer prompts created with GPT-3.

The AI model has more coherence and knowledge to create complex scenes.

Version 6 is trained on more diverse data to generate a wider variety of images.

Significant difference from version 5, will need to relearn prompting techniques.

No longer needs text prompts like 'award winning photo realistic'.

Now has minor text drawing capabilities to create text in images.

Additional styles like weird, subtle, remix, describe now work with version 6.

Describe version 6 coming soon to generate better prompts from images.

New version creates high quality, believable stock photos.

Can easily create authentic-looking office scenes.

Still challenging for giant characters, but quality improving.

Artistic decisions improved - contrast, lighting, color harmony.

Better functionality, coherence in complex spaceship interiors.

Exciting advancements, relearning prompts needed to benefit.

Transcripts

play00:00

midney version 6 is out and brings some

play00:02

amazing new changes hello my friends how

play00:04

are you doing I'm actually using moury

play00:07

almost on a daily basis especially if I

play00:09

want to have very expressive images or

play00:12

stock likee images so these new changes

play00:15

are actually really good I'm going to

play00:17

show you some example images and prompts

play00:19

and talk about how they changed and why

play00:22

it is so important to have these changes

play00:24

also shout out to robar for helping me

play00:27

with this video also here's a little

play00:28

update for my Thailand St it's just very

play00:31

beautiful here I'm seeing fireworks

play00:33

these amazing dance performances that

play00:35

are in the shopping centers I have spent

play00:37

some time with the monk talking about

play00:39

Buddhist philosophy and saw this

play00:41

gigantic Buddha statue shout out to

play00:44

Jurgen thank you for organizing that day

play00:46

for me of course making some new friends

play00:48

and meeting the Thailand stable

play00:50

diffusion Community all of that is

play00:52

really nice let's get started now it's

play00:54

time to look at some images and the

play00:56

prompts here so this is a prompt from

play00:57

robar Beauty and the Beast and the style

play01:00

of The Simpsons it's creating that in a

play01:03

pretty good way it actually looks like a

play01:05

scene from a Simpson comic so that is

play01:09

very nice easily understood now of

play01:10

course you will see when you look a

play01:12

little bit closer there's a little bit

play01:13

of a problem with the interaction of the

play01:15

hands and the arms but the style itself

play01:17

is pretty nice here's another example

play01:20

image from robar Santa Claus Superstar

play01:22

showing tongue cartoon hyper realistic

play01:25

now there's a little bit of a version

play01:26

five prompting in here but the

play01:29

interesting thing is the tongue sticking

play01:31

out and this is something that was

play01:33

pretty hard to do in version five

play01:35

especially in a way that looks good I

play01:37

would say here it looks really good now

play01:40

here's an example that I created with

play01:42

the text hello world and in three out of

play01:45

four Images this text was actually

play01:48

created correctly and I also like here

play01:51

it's not just creating the text but also

play01:53

you can see there is some depth of field

play01:55

blur on the text itself so the effect

play01:59

here is pretty pretty amazing here you

play02:01

can see the prompt that is pretty simple

play02:03

a photo of the text hello world written

play02:06

with a marker on a sticky note and this

play02:09

is exactly what we got here's the next

play02:11

scene that I had to test out we have

play02:13

here a very large dragon and then a

play02:16

smaller character a warrior standing on

play02:18

the left side in the ruins of a castle

play02:21

and I have to say this turned out really

play02:25

well this is a first Roll image and the

play02:28

composition is very nice but also

play02:30

followed my prompt very very nicely so

play02:33

here you can see the prompt I'm using a

play02:34

Burning Castle ruin on the left is a man

play02:38

in Warrior armor looking small in the

play02:41

image he's looking up to a large red

play02:44

dragon who is looking down at him with

play02:47

anger and I have to say that this

play02:49

actually is completely there even the

play02:52

interaction with the character the whole

play02:54

style is there the fire is there the

play02:57

colors work very nice and a lot of this

play02:59

also has very nice artistic Direction in

play03:02

the sense of what is more or less

play03:05

saturated what is brighter and darker

play03:07

where we have high and low contrast so

play03:10

an example here would be that the

play03:12

warrior has as a background lighter

play03:14

color so you can actually see his

play03:16

silhouette and even the sword he's

play03:18

holding in his hand very good artistic

play03:21

decision here from M Journey as you

play03:23

probably know I love cats and I love

play03:25

lowf music so of course I love lowf

play03:29

music wallpapers here I wanted to have a

play03:31

humanoid cat sitting in the sunset

play03:34

reading a book this kind of cozy

play03:36

scenario and this turned out really

play03:39

amazing now this is also a first rle and

play03:43

I'm really really happy with everything

play03:45

here the art style the composition the

play03:49

humanoid structure of the cat how the

play03:51

cat is actually reading and holding the

play03:53

book all of that is really beautiful

play03:56

here we have the prompt for that Co

play03:58

drawing in anime style of a cat with a

play04:01

human body reading a book in Sunset

play04:04

light this is not using Nichi this is

play04:07

using mid Journey version six here's

play04:10

another scene that can be quite complex

play04:12

for Mid journey I have to say I think it

play04:14

has improved a lot now here we have the

play04:17

cockpit of a Sci-Fi spacecraft and this

play04:21

is interesting because of course this is

play04:23

not something where there is thousands

play04:25

of images online that actually show you

play04:27

how stuff like that looks so in the P

play04:29

mid Journey had a lot of problems of

play04:31

creating these Interiors of spaceships

play04:34

because there's just not many examples

play04:35

there so I feel like here we have a very

play04:38

nice structure very nice composition we

play04:41

have a lot of functionality in the

play04:43

different elements in here and also the

play04:45

character is sitting correctly on a

play04:48

chair that actually makes sense and he

play04:50

is sitting there actually doing work so

play04:53

all of that has a lot of coherence and

play04:55

all of that has a lot of functionality

play04:57

in there here we have the prompt in this

play04:59

case I have used chat GPT to create a

play05:01

longer prompt it reads the spaceship's

play05:04

command center is a hive of activity

play05:07

with holographic displays the captain

play05:09

overseas operations from a central chair

play05:12

surrounded by a panoramic view of space

play05:14

through a transparent viport crew

play05:17

members in futuristic uniforms operate

play05:19

control panels the air is charged with

play05:22

urgency as the crew coordinates their

play05:25

efforts and the soft hum of the

play05:27

spaceship systems underscores their

play05:29

focus next we have your giant MEC robots

play05:33

standing in a city street also very nice

play05:37

again I love the dynamic between the

play05:38

different elements how he's standing

play05:40

there how the light is working in the

play05:42

same way on him and on the surrounding

play05:45

buildings and then as a contrasting

play05:47

elements we have down there at the

play05:49

ground very small people that are

play05:51

running around in the streets again you

play05:54

can see here the very nice decision that

play05:57

the Silhouettes of the people on the

play05:59

ground have as a background lighter FX

play06:01

so that we can actually see them very

play06:04

clearly I love also not just the

play06:07

composition but also the camera angle

play06:09

looking up at that large robot and

play06:11

giving us very nice reference point on

play06:14

how big this robot actually is here we

play06:17

have the prompt very simple in this case

play06:19

movie scene of a giant robot standing in

play06:21

a city another thing I really like is

play06:24

linal space and here we have an old

play06:27

arcade I wanted to have some green light

play06:30

here we see some dirt on the ground some

play06:32

of the machines are still turned on so

play06:34

all of that is a pretty cool very

play06:36

interesting scene love the lighting love

play06:39

the atmosphere in this again here we

play06:41

have a little bit of a longer prompt

play06:42

created with Chad gbt it reads photo in

play06:45

Gray and green tones of the dimly lit

play06:47

space of an abandoned arcade the

play06:49

flickering glow of the malfunction

play06:51

neolites cast an eerie Ambience the once

play06:55

vibrant arcade machines stand silent and

play06:57

lifeless they greens Frozen in Time dust

play07:01

particles dance in the air caught in the

play07:04

soft beams of faded pastel colored

play07:06

lights the air is thick with a sense of

play07:09

nostalgia and abandonment as the ghostly

play07:12

Echoes of long-forgotten game sounds

play07:14

linger in The Emptiness and then I have

play07:17

to say again a lot of that is found here

play07:20

in this image with the dirt on the

play07:22

ground with the broken ceiling it has

play07:25

this kind of linal eeriness in there so

play07:28

that's pretty cool here's another

play07:30

example of a dynamic between a very

play07:32

large monster and a warrior standing in

play07:34

front of it again I really like the

play07:37

composition here with the cave with the

play07:39

water flowing down and again I can't

play07:43

stress this enough the nice decisions of

play07:45

the color here most of that is blue and

play07:48

orange and we have most of the light and

play07:50

the fire where the hero is standing and

play07:53

it is really bringing out the silhouette

play07:56

of the heroes standing there and the

play07:58

dynamic to the monster because also

play08:00

because of that light reflecting on the

play08:02

face we can see that the monster is

play08:04

looking down at the hero so there's so

play08:07

much artistic nice decision in here and

play08:10

this is really what moury is very very

play08:12

good at again we have a prompt from Chad

play08:15

GPT at the entrance of a dark cavernous

play08:19

abis a stoic Warrior faces a colossal

play08:22

shadowy Titan emerging from the depths

play08:25

the air is thick with tension as the

play08:27

warrior based in the e glow of the

play08:30

enchanted runes confronts the Monstrous

play08:33

entity the clash between the two Titans

play08:36

Echoes through the cavern each strike

play08:39

resonating with raw power as the warrior

play08:42

battles to prevent the malevolent force

play08:44

from escaping into the world above next

play08:47

I want to look with you at some stock

play08:49

images and I personally think that this

play08:51

is also a Hallmark of good AI image

play08:54

generation how well do these look as

play08:56

actual stock images that you could use

play08:59

for a campaign how much do you believe

play09:02

the actual image and here I have to say

play09:04

the results are really good the food

play09:06

looks really nice and tasty and like

play09:09

actual food in there the salad the

play09:11

tomatoes the cheese the meat even the

play09:14

sauce in between the salad and the bun

play09:17

looks like actual sauce all of that

play09:19

really good the composition is very nice

play09:21

with the depth of Fieldy have a nice

play09:23

focus on the food itself so all of that

play09:26

looks like an actual product photo

play09:29

really really good here we have the

play09:30

prompt you can see it's very simple

play09:32

stock photo of a juicy burger with

play09:34

tomatoes and salads here's another

play09:36

interesting scene and this is one of the

play09:39

things I love to test these are office

play09:42

pictures because you will find that if

play09:44

you go to stock Pines and what to have

play09:46

any kind of office or business related

play09:49

image they mostly look really terrible

play09:53

really staged really not authentic at

play09:56

all so here I want to have a guy who

play09:59

who's working late at the office with

play10:01

the illuminated City in the background

play10:04

and mney has delivered on the first role

play10:08

now this is the future of stock images

play10:11

because instead of searching for hours

play10:13

for good image you can just create it

play10:16

like that here we have my prompt man

play10:19

working late in office sitting in front

play10:22

of a computer at night few of the city

play10:25

skyscrapers behind him through the large

play10:28

glass w windows and look at how

play10:31

everything is in here and everything

play10:34

looks really good and here's the last

play10:36

image that I want to show you today this

play10:38

is kind of a challenge for you because I

play10:40

found that creating a gigantic woman

play10:43

walking through a village or a city is

play10:46

something that is still very difficult

play10:49

to create with mid Journey especially in

play10:52

a way where the woman is not a statue

play10:54

but actually a woman walking around now

play10:57

here it is a silhouette with no deta

play10:59

details and it looks a little bit more

play11:00

like a monster but still I have to say

play11:02

in this case MIT has done a really good

play11:04

job The Prompt here reads gigantic woman

play11:07

who is 20 stories tall walking through a

play11:10

village and yes I see that I have a

play11:12

little bit of a typo here in through

play11:14

let's have a bit of a look at the

play11:15

announcement and of course you can set

play11:17

it up in the settings to use version 6

play11:20

but you can also use minus minus V6 to

play11:23

call it inside of the prompt without

play11:25

changing it in the settings now they

play11:26

talk here a lot about more accuracy also

play11:29

for longer prompts which is very good

play11:31

when you create prompts with jat GPT and

play11:33

also it has more coherence and model

play11:35

knowledge in there which is also good

play11:37

because then it can create more complex

play11:39

scenes and just understand better what

play11:40

you mean and the model is trained on

play11:43

more different kind of things you want

play11:44

to create and the SC we're going to see

play11:46

that in a second now something they say

play11:49

here that's pretty interesting is that

play11:51

this is a significant difference to

play11:54

version five and they say you will need

play11:56

to relearn how to prompt so things like

play12:00

aart winning photo realistic 4K 8K they

play12:04

call it chunk and that is no longer

play12:07

needed for the images not quite sure

play12:09

100% you don't need that but it actually

play12:12

is creating really good results even

play12:15

without these words in there another

play12:17

thing to point out here is that it

play12:18

actually has a minor text drawing

play12:20

ability so it can create text and in my

play12:23

test actually created a text pretty good

play12:26

in most of the rendered images and right

play12:29

now this is an alpha test this is not

play12:31

the final version of version 6 but it is

play12:34

out in the sense that everybody can use

play12:35

it so you can see here that all of these

play12:38

different features like different ratios

play12:41

chos weird tille styz styz raw very

play12:44

subtile very strong remix blend describe

play12:47

and so on are also available for version

play12:51

six although for the describe version

play12:52

says just the version five is used here

play12:56

and then there is some other stuff

play12:57

that's not working yet that that's

play12:59

coming soon that is for example panum

play13:02

very Reach In Tune and then also

play13:05

describe version 6 describe version six

play13:06

will be very very useful because then

play13:09

you can actually have prompts as they

play13:11

are meant to be and you can just create

play13:13

prompts based on images tuned to the use

play13:15

in mid Journey personally I'm very

play13:18

Amazed by these changes because they

play13:20

help me out a lot in generating the

play13:22

images I need let me know in the

play13:24

comments what you think about that

play13:25

thanks for watching and leave a like if

play13:27

you enjoyed this video bye oh you're

play13:30

still here so uh This is the End screen

play13:32

there's other stuff you can watch like

play13:34

this or that really cool and yeah I hope

play13:37

I see you soon uh leave a like if you

play13:40

haven't yet and well um yeah

Rate This

5.0 / 5 (0 votes)

您需要『中文』的总结吗?