Fooocus Tutorial Part 3 - FaceSwap/Pyracanny/CPDS & More Explained

Jump Into AI
2 Jan 202409:24

Summary

TLDR在这个系列的第三部分中,我深入探讨了图像输入标签页,并分享了一些我学到的技巧以及一些可能你不知道的生活质量提示。我解释了一些刚开始使用时让我困惑的设置,并详细分析了控制网格及其设置,以帮助你完全理解它们的作用以及哪些设置更适合什么目的。从历史记录日志到如何使用输入图像和变化标签页的高级技巧,我涵盖了如何利用这些工具改善和创造图像,包括如何调整表情和合并多个图像以产生创新的视觉效果。此外,我还介绍了不同控制网的特点和它们如何影响最终生成的图像。

Takeaways

  • 📖 这是关于Focus系列的第三部分,深入探讨输入图像标签页的技巧和生活质量提示。
  • 🔍 历史记录日志功能允许用户查看过去的生成记录和使用的设置,方便重复使用。
  • 📅 历史记录默认显示当天的图像生成,但通过Focus的输出文件夹可以查看更早的生成。
  • 🔄「加载参数」功能让用户可以快速应用以前的设置到新的生成中,但不包括输入图像的设置。
  • 🌈「非常细微」的设置用于在图像中添加轻微变化,如改变表情。
  • 🖼️ 图像提示功能允许将多达四张图片合并,以创建新的图像。
  • 🎨 控制网的设置(如Pyan和CPDS)有助于在生成图像时保留原图的某些特征或风格。
  • 🔄 面部互换功能专注于检测并应用图像的面部和头发到最终生成中。
  • ⚖️ 「权重」和「停止于」设置影响控制网的影响力和应用时长,从而细致调整最终图像的生成。
  • 👁️ 结合不同的控制网和文本提示可以创造出丰富多样的图像效果。

Q & A

  • 什么是输入图像标签?

    -输入图像标签允许用户上传自己的图像,AI 会使用这些图像的样式、人物、构图、颜色等要素来影响最终的生成图像。

  • Pian控制网的作用是什么?

    -Pian控制网会提取图像的线条图,保留图像的细节,然后应用到用户提供的文本提示词上,从而在生成新图像时保留原图像的某些结构。

  • 面部置换是如何工作的?

    -面部置换会检测图像中的面部和头发区域,然后将其应用到最终的生成图像上,无论是只使用文本提示词还是结合图像、控制网和文本提示词。

  • 权重和停止步数设置如何影响控制网?

    -权重决定每个控制网对最终图像的影响力度,停止步数决定每个控制网应用于多少个生成步骤。合理设置这两个参数可以平衡不同控制网的作用。

  • 生成图像的步数对图像质量有什么影响?

    -步数越多,图像的质量可能会更高,但需要的生成时间也会更长。默认性能设置下的步数为30。理想情况下,找到步数与质量及时间之间的平衡点。

Outlines

00:00

😊输入图像选项卡的提示和技巧

本段概述了一些使用输入图像选项卡的提示和技巧,包括历史日志的使用以及如何复制和查看以前的生成设置,微调表情的方法,组合使用多个输入图像的控制网等。

05:02

😃控制网设置的详细解释

本段详细解释了输入图像选项卡中的各种控制网设置,包括图像提示、Pyan、CPDS、面部交换等,以及权重、停止等参数如何影响控制网的作用程度。

Mindmap

Keywords

💡历史记录

历史记录是一个功能,用户可以通过它找到之前生成的所有作品。这在视频中被提到,用于回顾和重新利用旧的设置和输入。例如,如果用户创建了一个他们喜欢的图像并想要再次使用相同的设置,他们可以通过历史记录来复制这些设置到剪贴板,然后粘贴到输入框中,这样就可以重现之前的生成条件。

💡输入图像

输入图像是视频中讨论的一个关键特性,允许用户上传自己的图片作为生成过程的一部分。通过使用输入图像,用户可以指定一张或多张图片,以此来影响最终生成的图像。例如,用户可以上传一张人物的照片,并要求AI改变该人物的表情,或者将其放置在不同的背景中。

💡控制网

控制网是视频中解释的一个高级功能,它包括一系列的设置,如Canny、CPDS等,这些设置帮助用户精确控制图像生成的过程。例如,Canny控制网可以捕捉图像的细节并将其应用于文本提示,而CPDS则使用一种保持对比度的去色方法来影响图像。

💡变体

变体是在视频中提到的一个设置,允许用户在生成图像时添加微妙的变化,如改变人物的表情。通过调整变体设置,用户可以轻松地生成具有不同情绪或特征的相似图像,例如,从一个微笑的表情转变为大笑。

💡权重

权重是一个控制参数,决定了每个控制网对最终图像的影响程度。在视频中,通过调整权重,用户可以精细控制如图像风格、颜色等元素的强度。例如,增加一个控制网的权重会使其特征在最终图像中更为突出。

💡停止点

停止点是一个设置,它定义了控制网在图像生成过程中应用的阶段。通过设置停止点,用户可以控制某个控制网的应用时间,例如,设置为0.5意味着控制网将在生成过程的前50%中起作用。这允许用户在保留一些原始特征的同时,引入新的元素。

💡面部交换

面部交换是一个特定的功能,它检测图像中的面部和头发,并将它们应用到最终生成的图像中。在视频中,这个功能用于将一张图片中的面部特征转移到另一张图片上,同时保留或改变背景。这对于创造具有特定人物面部特征的定制图像非常有用。

💡图像提示

图像提示是视频中解释的一种方式,允许用户上传多达四张图像,并将它们的特征结合到最终生成的图像中。这是通过分析上传的图像的风格、色彩、构图等来实现的。例如,用户可以结合使用几张图像的风格来创造一个全新的图像。

💡Canny

Canny控制网是一种特定的算法,用于捕获图像的线条和细节,并将其应用于生成过程中。在视频中,通过使用Canny,用户可以确保生成的图像保留了原始图像的某些结构特征,这对于需要保持图像细节一致性的场景非常有用。

💡CPDS

CPDS(对比保持去色化结构)是一个由Focus团队创建的自定义控制网,它结合了深度控制和去色化方法,将图像转换为黑白,并使用该结构作为控制网。这种方法与Canny不同,但同样用于精确控制图像生成的过程。在视频中,这被用来影响图像的风格和结构。

Highlights

历史日志允许您查看并复制先前的生成设置

非常微妙设置可用于略微改变图像表达式

图像提示结合文字提示可生成混合图像

平安 NET 可捕捉图像细节并应用于新图像

保真去色网格可提取图像结构作为控制网格

人脸替换 ONLY 替换人脸和头发细节

权重控制网格的影响程度,停止控制应用的步骤数

示例展示不同设定下的图像生成差异

高级设置允许组合多种控制网格

控制网格可保留原图元素同时变更风格

人脸替换可与其他控制网格及文字提示组合

停止设置控制网格的持续步骤百分比

权重和停止共同控制每步影响程度

默认设定通常效果最佳

组合使用控制网格及文字提示创造新图像

Transcripts

play00:00

in this third installment of my series

play00:02

on Focus I want to dive deeper into the

play00:04

input image Tab and show you some tricks

play00:06

I've learned as well as some other

play00:08

quality of life tips that you might not

play00:09

know about I will explain some settings

play00:11

that even confused me when I first

play00:13

started to use them and break down the

play00:15

control Nets and their settings to help

play00:17

you fully understand what they are doing

play00:20

and which ones are better for what

play00:22

purpose if you haven't seen my previous

play00:24

videos feel free to start there if you

play00:25

like otherwise let's get started first

play00:28

thing I want to show you is something

play00:29

some of you may not know about and that

play00:31

is the history log click Advanced then

play00:34

click history log in here you can find

play00:36

all your previous generations you can

play00:38

use this if you created something in the

play00:40

past and wanted to know what settings

play00:42

and inputs you used if you want to reuse

play00:44

one of these settings all you have to do

play00:46

is click copy to clipboard then hit okay

play00:49

go back to the Focus window and paste

play00:51

that into the text prompt you will

play00:53

notice the generate button changed to

play00:55

load parameters this will change all the

play00:57

settings and focus to the ones you used

play00:59

for that previous image one thing to

play01:01

note is that while it loads all the

play01:03

settings if one of these images was

play01:05

generated while using the input image

play01:07

settings it will not load in the

play01:10

previous images you were using you will

play01:12

have to read those manually the second

play01:15

tip about the history log is that you

play01:16

may have noticed that it only shows the

play01:19

images you created that day you can see

play01:21

all the images you have created if you

play01:23

go to your focuses output

play01:25

folder go to your focus outputs folder

play01:28

then you can see where your Generations

play01:30

are separated into folders by date when

play01:33

you go into a folder you will see a log

play01:35

HTML file double click that and it will

play01:37

open that days log file where you can

play01:39

see those

play01:41

Generations if you don't see a copy to

play01:43

clipboard in a past log file it was

play01:46

probably made before that option was

play01:48

implemented this is something added

play01:50

recently okay I'm going to reload my

play01:52

page to go back to default settings and

play01:54

we will now go into input image in the

play01:56

upscale or variation tab I mainly want

play01:58

to talk about the very subtle setting

play02:00

this is a nice tool to add a little

play02:02

variation in your image one thing I use

play02:04

it for is changing the expression of an

play02:06

image this can also be done in the

play02:08

inpaint tool but it can come out a

play02:09

little more natural here sometimes I

play02:11

will load an image in here and select

play02:14

very subtle for this I'm using all the

play02:16

default settings other than changing the

play02:18

resolution to 768

play02:20

x344 and unchecking random with this

play02:23

image we will try to get a happy

play02:25

expression sometimes a few simple words

play02:27

will

play02:28

work and that definitely worked we got a

play02:31

little smile but if we want to go bigger

play02:33

maybe try to get her

play02:37

laughing and still a subtle expression

play02:40

so we can try going bigger adding mouth

play02:43

wide open lots of teeth showing and see

play02:45

what that gives us a little better but

play02:48

again this is subtle change so it's it's

play02:50

not going to go

play02:52

crazy you can obviously go the other way

play02:54

and get an angry

play02:57

expression it's different with each

play02:59

image some pictures don't take much to

play03:01

get exaggerated emotions While others

play03:03

can take some real

play03:05

effort the rest of these settings I've

play03:07

covered before so I'm going to move on

play03:09

to image

play03:10

prompt here we can combine up to four

play03:13

Images together first thing to do is

play03:15

click the checkbox advanced so we can

play03:17

get these extra settings to show okay

play03:19

I'm going to do my best to thoroughly

play03:21

explain all these settings in here and

play03:22

hopefully it will make a bit more sense

play03:24

to you the first is image prompt this is

play03:27

very similar to Mid journey in that when

play03:29

you upload your own image and use that

play03:32

as a base it will take everything from

play03:34

the image the style character

play03:37

composition colors and use it all to

play03:39

influence the final image I'll grab an

play03:41

image and add a simple text

play03:44

prompt and we will see what we

play03:49

get using this prompt you can see it

play03:51

mixed a little bit of everything from

play03:53

this image with our prompt now how much

play03:56

it influences has to do with the stop at

play03:58

and wait settings but we will go over

play04:01

those a little

play04:02

later next we have

play04:04

pyan this is a great setting if you are

play04:07

trying to get a character with a similar

play04:08

pose or keep details from one image to

play04:11

another what it does is is creates a

play04:13

line art picture that captures intricate

play04:15

details of your image then applies that

play04:18

to your text prompt and any other

play04:19

control Nets you

play04:21

have so let's try this out I will add

play04:25

this text prompt a woman in a business

play04:26

suit in an

play04:28

office

play04:31

using this text prompt I got a

play04:33

composition very similar to the image

play04:35

even the background shapes are included

play04:37

and then altered enough to match the

play04:38

scene so with canny you can keep an

play04:41

image and character similar to what you

play04:42

have and still change the entire style

play04:45

and come out with some great results

play04:46

it's a very useful

play04:48

tool cpds stands for contrast preserving

play04:52

decolorization structure sounds complex

play04:54

but it is a custom control net by the

play04:56

focus team that incorporates depth

play04:59

control combined with a decolorization

play05:01

method that converts the image to black

play05:03

and white and uses the structure of that

play05:05

as a control net to influence an image

play05:08

pyan and cpds are doing the same thing

play05:11

but using completely different methods

play05:13

you can successfully combine all of

play05:15

these control Nets but when it comes to

play05:17

these two it's best to only use one or

play05:19

the

play05:20

other face swap will detect only the

play05:23

face and hair from an image and apply it

play05:25

to the final generation either by using

play05:27

just a text prompt or a combination of

play05:29

images control Nets and text

play05:32

prompts let's add another image

play05:35

here and we will be adding this face to

play05:38

this image structure for this we will

play05:40

use pyate canny and on this image we

play05:42

will select face swap without any text

play05:45

prompt the AI will take the structure of

play05:47

this image add this face and then make

play05:49

up its mind to fill in the

play05:51

blanks it will not add this face onto

play05:53

this exact image as the control net is

play05:55

only going by the overall structure but

play05:58

with a text prompt describing a subject

play06:00

like our first picture then we get a

play06:01

composition very close to this

play06:04

image but really you can make this go

play06:07

wherever you

play06:08

want with this text prompt a clown woman

play06:11

at a

play06:12

circus you can see we have the structure

play06:14

of the first image the face of the

play06:16

second and now our setting is a circus

play06:19

you can see it's trying to put clown

play06:21

makeup on her but the face swap settings

play06:23

are pretty high since we want to keep it

play06:25

close to this face but we can keep

play06:28

adding to the text prompt to get more

play06:30

changes to the face or even start to

play06:32

lower the face swap settings a little

play06:34

eventually pushing the text to start

play06:36

overtaking the control net we can also

play06:39

um add an image prompt

play06:42

here let's say we like the color style

play06:44

of this

play06:46

image and want it to start mixing in a

play06:48

bit since we already have two control

play06:50

Nets I will increase the weight for this

play06:52

to help it start showing without a text

play06:55

prompt once again the AI will do its own

play06:57

thing but you can see it's adding those

play06:59

bright colors

play07:00

in now that you better understand what

play07:03

each of these control Nets do let's talk

play07:05

about how these settings work and affect

play07:08

them here are the settings we can change

play07:11

these will directly affect whichever

play07:13

control net we have

play07:15

selected weight is how much we want the

play07:17

selected control net to influence the

play07:19

final image the higher the weight the

play07:21

more it will push that control Net's

play07:22

influence per step the stop at takes

play07:25

that control net and the weight and then

play07:27

applies it to the gener a for however

play07:30

many steps we set it for with

play07:32

Performance Set to speed all images will

play07:35

go through 30 steps that is how many

play07:37

times the image will run through its

play07:38

algorithm until the final image is

play07:40

completed you see that every time you

play07:42

generate

play07:43

images think of the stop at setting as a

play07:46

percentage with the pyan here at 0.5 it

play07:49

is 50% so with 30 steps total this

play07:52

control net will be applied for the

play07:54

first 15 Steps only basically it will

play07:57

stop at 50% this gives the image a

play08:00

chance to get its basic structure formed

play08:03

and then at 50% let the other control

play08:05

Nets and text prompts continue to

play08:07

influence the image in whatever way you

play08:09

are asking them to setting it higher

play08:12

will have the control net applied to

play08:13

more steps and keep the structure

play08:15

likeness towards the original image and

play08:17

will give the AI less ability to draw

play08:19

Outside the Lines the same goes for

play08:22

setting it lower that will give more

play08:24

freedom to the AI and not adhere to the

play08:26

structure so much so the weight is how

play08:29

strong the control net affects every

play08:31

step and the stop at is how many steps

play08:33

the control net will run for there is no

play08:35

magic number between the settings but

play08:37

going off the defaults that each control

play08:39

net starts with is best we can delve

play08:41

deeper but I feel like any further and

play08:43

it will be overwhelming but I hope this

play08:45

is starting to make sense before

play08:47

wrapping this up I want to show some

play08:49

visual side by-side examples of

play08:51

different settings with stop at and

play08:55

wait I didn't run cpds in this vid but

play08:58

as you can see it's similar to

play09:03

pyan and here are a few featuring face

play09:07

swap and I think I will conclude this

play09:10

video here I don't ever ask for likes

play09:12

and subscribes I figure you will do that

play09:14

on your own but I will say thanks to

play09:17

those who do I appreciate it and it

play09:19

motivates me to keep creating I will see

play09:22

you in the next one

Rate This

5.0 / 5 (0 votes)

Do you need a summary in English?