Is Kandinsky-2 is better than Stable Diffusion?

Nerdy Rodent
28 Apr 202327:59

Summary

TLDRВ этом видео обзоре рассматривается инструмент генерации изображений Kandinsky 2, который может создавать разнообразные типы изображений, такие как текст в изображение, изображение в изображение, и т.д. Хотя Kandinsky 2 не имеет всех функций, доступных в стабильной диффузии, он предлагает различные стили и возможности, которые могут заинтересовать пользователей. В видео также сравниваются результаты работы Kandinsky 2 с другими популярными моделями, такими как стабильная диффузия, показывая различия в стилях и качестве изображений. В конце авторы предлагают использовать Google Colab для работы с Kandinsky 2 без необходимости установки программного обеспечения на локальный компьютер.

Takeaways

  • 🖼️ Kandinsky 2 - это инструмент генерации изображений, который может создавать различные стили изображений, включая текст в изображение, изображение в изображение, микширование изображений и другие.
  • 🇷🇺 Название Kandinsky 2 указывает на российские корни проекта, что может вызвать определенные озабоченности у пользователей по поводу политического контекста.
  • 📜 Лицензия Apache 2, которая используется у Kandinsky 2, является довольно разрешительной и позволяет свободное использование и распространение.
  • 📚 В репозитории есть несколько примеров ноутбуков Jupyter, которые демонстрируют, как использовать различные функции Kandinsky 2.
  • 💽 Для работы с Kandinsky 2 рекомендуется иметь видеокарту с VRAM не менее 10 ГБ для обработки изображений размером 512x512 пикселя.
  • ⚙️ Для локальной установки Kandinsky 2 предпочтение отдается использованию Linux с современной видеокартой Nvidia.
  • 🔧 Если у вас нет необходимой аппаратной платформы или вы не хотите устанавливать что-либо локально, можно использовать Google Colab для удаленного доступа к среде Linux с поддержкой Nvidia.
  • 📁 Установка Kandinsky 2 может потребовать создания отдельной среды Python с использованием таких инструментов, как Anaconda, Miniconda или venv.
  • 🔄 Kandinsky 2 предлагает функцию микширования изображений, что позволяет комбинировать различные изображения в одном изображении.
  • 🎨 В интерфейсе Kandinsky 2 есть возможность в реальном времени управлять параметрами генерации изображений с помощью слайдеров и текстовых полей.
  • 📈 В сравнении с другими моделями, такими как стабильный диффюзный (Stable Diffusion), Kandinsky 2 показывает хорошие результаты при генерации сложных изображений с учетом стиля и деталей.

Q & A

  • Kandinsky 2 является каким типом искусства искусственного интеллекта?

    -Kandinsky 2 относится к типу искусства искусственного интеллекта, который фокусируется на генерации изображений.

  • Какой тип лицензии имеет Kandinsky 2?

    -Kandinsky 2 имеет лицензию Apache 2, что является довольно разрешительной.

  • Какие функции предлагает Kandinsky 2?

    -Kandinsky 2 предлагает функции, такие как преобразование текста в изображение, изображение в изображение, смешивание изображений, заполнение участков изображения и вставка текста в изображение.

  • Какой объем видеопамяти (VRAM) рекомендуется для работы с Kandinsky 2?

    -Для работы с Kandinsky 2 рекомендуется иметь видеопамять объемом не менее 10 гигабайт.

  • Какой операционной системы и оборудования предпочтительнее использовать с Kandinsky 2?

    -Для лучшей производительности, удобства и совместимости с Kandinsky 2 лучше всего использовать Linux с современной видеокартой Nvidia.

  • Как можно использовать Kandinsky 2 без установки на локальном компьютере?

    -Для использования Kandinsky 2 без локальной установки можно использовать Google Colab, который предоставляет удаленный доступ к комбинации Linux и Nvidia бесплатно.

  • Какой командой можно установить Kandinsky 2 на локальном компьютере?

    -Для установки Kandinsky 2 на локальном компьютере можно использовать команду `pip install`, указанную на странице GitHub проекта.

  • Какие дополнительные библиотеки могут потребоваться для запуска всех ноутбуков Kandinsky 2?

    -Для запуска всех ноутбуков Kandinsky 2 может потребоваться установка дополнительных библиотек, таких как Jupyter Notebook, ipywidgets, open CV и protobuf.

  • Какие примеры использования представлены в ноутбуках Kandinsky 2?

    -Ноутбуки Kandinsky 2 содержат примеры использования различных функций, таких как генерация изображений из текста, преобразование одного изображения в другое, заполнение участков изображения и другие.

  • Какие сравнительные тесты проведены между Kandinsky 2 и стабильной диффузией?

    -В скрипте проведены сравнительные тесты по генерации изображений с использованием различных стилей и тем, таких как киберпанк, акварель, сурреализм, реализм и другие, для определения эффективности каждого из методов.

  • Какие результаты показала сравнительная работа Kandinsky 2 и стабильной диффузии?

    -В результате сравнения, Kandinsky 2 показал лучшие результаты во многих случаях, особенно в стилях, требующих деталей и специфического восприятия, таких как киберпанк, сурреализм и реализм.

Outlines

00:00

🌟 Введение в Kandinsky 2 и его возможности

В этом абзаце представлен обзор проекта Kandinsky 2, связанного с генерацией изображений. Рассматриваются различные функции, такие как преобразование текста в изображение, смешивание изображений, заполнение масок и другие. Также упоминается, что репозиторий имеет Apache 2 лицензию и предлагает две версии моделей: 2.0 и 2.1. Обсуждаются системные требования, включая необходимость достаточной видеопамяти для работы с изображениями размером 512x512. Рекомендуется использование Linux и современного Nvidia GPU для лучшей производительности. Кроме того, упоминается возможность использования Google Colab для удаленного доступа к системе Linux и мощности Nvidia.

05:02

📚 Установка и настройка среды Kandinsky 2

Абзац описывает процесс установки среды Kandinsky 2, включая создание нового окружения с использованием Anaconda, установку основной библиотеки и дополнительных компонентов, таких как Jupyter Notebook, ipywidgets и OpenCV. Также упоминается необходимость установки версии Protobuf 3.20.0. Далее автор копирует примеры ноутбуков из репозитория и обсуждает организацию файлов и каталогов для удобства работы.

10:04

🖼️ Испытание функций Kandinsky 2 с примерами

В данном абзаце автор демонстрирует, как использовать различные функции Kandinsky 2, включая преобразование изображений, заполнение масок и другие. Показывает результаты работы с ноутбуками версий 2.0 и 2.1, обсуждает интерфейс и возможности каждого из них. Также проводится сравнение с другими инструментами, такими как стабильное распространение, и автор дает своё мнение о качестве полученных изображений.

15:05

🎨 Тестирование стилей и сравнение с Stable Diffusion

Автор проводит серию тестов, сравнивая результаты работы Kandinsky 2 с Stable Diffusion на разных стилях и(prompt) запросах. Оценивается качество изображений, созданных с использованием различных стилей, таких как киберпанк, реалистичный, акварель, сюрреалистичный и другие. В ходе сравнения автор дает предпочтение результатам, полученным с использованием Kandinsky 2 в большинстве случаев.

20:06

🎭 Попытка создания антропоморфных и стилизованных изображений

В этом разделе автор пытается создать более сложные и стилизованные изображения, такие как антропоморфный счастливый огонь, римский солдат и другие. Проводится сравнение с Stable Diffusion, и автор оценивает, как хорошо каждая из программ обрабатывает сложные запросы и создает изображения в нужном стиле. Заключение о том, что Kandinsky 2 более удачно справляется с созданием стилизованных изображений.

25:06

🥪 Тестирование функций микширования изображений

Автор рассматривает функцию микширования изображений Kandinsky 2, объединяя римского солдата с ланцем и бутербродом. Показывает, как программа комбинирует элементы изображений и создает новый образ, который является миксом оригинальных элементов. Хотя автор не особо доволен окончательным результатом микширования, он подчеркивает потенциал данной функции для других пользователей.

Mindmap

Keywords

💡AI

Искусственный интеллект (AI) - это область информатики, которая разрабатывает программное обеспечение и машинное оборудование, способные функционировать на уровне человеческого интеллекта или лучше. В видео речь идёт о прогрессе в области AI, особенно в создании больших языковых моделей и генерации изображений, что демонстрирует разнообразие и развитие технологий.

💡Изображение

В контексте видео, изображение - это результат работы алгоритмов генерации, таких как Kandinsky 2, который превращает текстовые описания или другие изображения в визуальное представление. Генерация изображений является ключевым элементом в демонстрации возможностей AI, как в случае с Kandinsky 2.

💡Kandinsky 2

Kandinsky 2 - это программное обеспечение для генерации изображений, упомянутое в видео. Оно способо создавать различные стили изображений на основе текстовых описаний или других изображений, что показывает современные возможности искусственного интеллекта в области компьютерного зрения.

💡Текст в изображение

Текст в изображение (text to image) - это технология, которая позволяет на основе текстового описания создавать визуальное изображение. В видео это один из основных функционалов Kandinsky 2, и он демонстрирует, как AI может интерпретировать текст и создавать соответствующие визуальные сцены.

💡Изображение в изображение

Изображение в изображение (image to image) - это процесс, при котором исходное изображение преобразуется или модифицируется с использованием AI в соответствии с текстовыми prompt или другими образами. В видео это один из функционалов Kandinsky 2, используемый для демонстрации возможностей преобразования и слияния визуальных данных.

💡Слияние изображений

Слияние изображений (image mixing) - технология, которая объединяет два или более изображения в одно, часто с целью создания нового визуального содержания. В контексте видео, Kandinsky 2 может смешивать изображения, создавая уникальные визуальные композиции.

💡Заполнение изображения

Заполнение изображения (in-painting) - это процесс редактирования изображений, при котором AI заполняет или восстанавливает части изображения, которые были удалены или повреждены. В видео это функция Kandinsky 2, используемая для демонстрации восстановления и улучшения изображений.

💡Лицензия Apache 2

Лицензия Apache 2 - это одна из самых популярных открытых лицензий на программное обеспечение, которая предоставляет большую степень свободы использования, распространения и изменения программного кода. В видео упоминают, что Kandinsky 2 имеет такую лицензию, что означает открытость и доступность его использования и расширения функциональных возможностей.

💡VRAM

VRAM (видеооперативная память) - это тип оперативной памяти, используемый графическими процессорами (GPU) для хранения и обработки графической информации. В контексте видео, VRAM является важным показателем для работы с программами по генерации изображений, так как большие объемы VRAM позволяют обрабатывать более сложные и высокие разрешения изображений.

💡Google Colab

Google Colab - это облачная платформа от Google, которая предоставляет доступ к ресурсам для машинного обучения и анализа данных без необходимости установки программного обеспечения на локальный компьютер. В видео упоминается как альтернатива локальной установке программного обеспечения для работы с Kandinsky 2, позволяя пользователям использовать его в браузере и получать вычислительные ресурсы Google.

💡Анаконда

Анаконда (Anaconda) - это дистрибутив Python, который включает в себя среду выполнения Python, менеджер пакетов и многочисленные научные библиотеки. В видео автор использует Anaconda для создания изолированной среды разработки, в которой устанавливается и запускается Kandinsky 2, что обеспечивает удобство и изоляцию проектов.

Highlights

Kandinsky 2是一个图像生成工具,可以产生相当合理的图像,尽管它没有像Stable Diffusion那样的所有特性。

Kandinsky 2具有Apache 2许可,这是一个相当宽松的许可,并且拥有许多特性。

该工具支持文本到图像、图像到图像、图像混合、图像修复等功能。

Kandinsky 2的文本到图像功能与Dali 2和潜在扩散的混合体相似,是最受欢迎的特性之一。

存在两个模型集合,一个用于版本2,另一个用于版本2.1。

版本2.1比2.0稍微轻量级,对于512x512图像,它使用了8.7GB的VRAM。

对于最佳性能和兼容性,建议使用Linux加上现代Nvidia GPU。

Google Colab提供了一个免费的远程使用Linux加上Nvidia GPU的选项。

如果不想本地安装或没有所需硬件,Google Colab是一个很好的选择。

使用Anaconda可以轻松管理Python应用程序的单独环境。

Kandinsky 2提供了Jupyter笔记本示例,方便用户开始创建自己的程序。

版本2.1提供了一个稍微不同的界面,使用Jupyter Lab而不是Jupyter Notebook。

Kandinsky 2的图像后处理功能允许用户在保存图像之前进行调整。

Kandinsky 2在处理复杂提示时表现出色,如生成一个穿着格子裙和黑色军靴的大理石雕像老鼠。

在与Stable Diffusion的比较中,Kandinsky 2在多个风格和主题的图像生成中表现更好。

Kandinsky 2的图像混合器允许用户混合两个或更多的图像,创造出新的视觉效果。

尽管Kandinsky 2在某些混合图像的输出中可能不是每个人的口味,但它提供了强大的图像生成和编辑功能。

Transcripts

play00:00

hello and welcome to More nerdy rodent

play00:02

geekery you might have noticed that

play00:05

loads of AI stuff seems to be released

play00:08

almost every day now with lots of focus

play00:12

around large language models and image

play00:15

generation

play00:16

Kandinsky 2 falls into the image

play00:20

generation camp and can produce some

play00:23

fairly reasonable images sure it hasn't

play00:26

got all the features of something like

play00:28

automatic 1111 for stable diffusion but

play00:31

sometimes it's just nice to try other

play00:33

stuff isn't it I mean variety is the

play00:36

spice of life and all that as you may be

play00:39

able to tell from the name this is from

play00:42

Russia so you may not want to go

play00:44

anywhere near it as a supporter of

play00:46

Ukraine I know I was in two minds about

play00:49

even covering this at all but at the end

play00:51

of the day this isn't a politics Channel

play00:54

and we're all just interested in the

play00:56

nerdy stuff right okay the main features

play01:00

include an Apache 2 license which is

play01:03

fairly permissive and it's got a whole

play01:05

bunch of features too if we scroll down

play01:07

here we can see some things we've got

play01:09

text to image we also have image to

play01:12

image there's some image mixing or image

play01:15

fuse as it says there and also some

play01:17

image in painting as well here we scroll

play01:20

down we've got some texture image we've

play01:22

got an in-painting example there they've

play01:24

got the mask at the bottom and it's

play01:26

filled that in underneath like it says

play01:29

on the GitHub page here this is sort of

play01:31

a mix between Dali 2 and latent

play01:34

diffusion I really did get some Dali 2

play01:37

Vibes when using text to image and that

play01:40

is the feature I like the most out of

play01:42

this repository the other features I

play01:45

personally classify as a bit meh but

play01:48

that is just my personal opinion you may

play01:51

love image mixing and this is indeed a

play01:53

pretty good implementation of it there

play01:56

are actually two sets of models one for

play01:58

version two and a another set for

play02:00

version 2.1 and if you do download

play02:03

everything that totals around 17 gig

play02:06

personally I would ignore 2.0 and just

play02:09

go with 2.1 vram wise you'll want

play02:12

something at least capable of playing

play02:14

video games from 2023 so ideally 10

play02:18

gigabytes of vram or more 2.1 is

play02:21

actually slightly lighter than 2.0 and

play02:24

for a 512 by 512 image it used 8.7

play02:28

gigabytes of vram for me I have no idea

play02:31

if this will work on a Mac but it does

play02:33

seem unlikely I also don't have an AMD

play02:36

GPU so again I can't say if it will work

play02:39

with one of those either but maybe if

play02:42

you installed the correct version of

play02:43

pytorch afterwards who knows do let me

play02:46

know as with practically everything AI

play02:49

these days your very best option for

play02:51

performance ease and compatibility is to

play02:54

use Linux along with a modern Nvidia GPU

play02:57

other operating system options good and

play02:59

include windows subsystem for Linux also

play03:02

known as WSL or if you really love

play03:05

headaches then you could use a native

play03:07

Microsoft Windows installation on the

play03:10

other hand if you don't want to install

play03:12

anything locally or you don't have the

play03:14

required hardware then Google colab may

play03:17

be the thing for you there's a link

play03:18

there you click that open and there you

play03:21

get your very own Linux plus Nvidia

play03:24

power combo to use remotely for free and

play03:28

it's very very easy to use as you can

play03:30

see there's just a few cells so you

play03:33

click play on the first one click play

play03:35

on the second one click play on the

play03:37

third one and there is where you type

play03:39

your prompt so in this example Redcat 4K

play03:42

photo

play03:43

768x768 and there you get rather a nice

play03:47

picture of a red cat

play03:49

as another option there is also a link

play03:51

there to demo which will take you over

play03:53

to this Fusion brain demo so that's

play03:56

another way that you can start playing

play03:58

with this model without having to

play03:59

install anything if like me you are

play04:02

installing this python app locally then

play04:04

all you really need to do is run that

play04:06

one command there on your computer and

play04:08

you've got it done however I do like to

play04:11

keep all of my python apps in separate

play04:13

environments and thanks to Anaconda that

play04:16

is made really really easy if you like

play04:19

separate environments too but don't want

play04:21

to use Anaconda then mini conda or VNV

play04:24

are great options too basically whatever

play04:27

you feel the most comfortable with using

play04:29

is what you should use we're all about

play04:31

Freedom of Choice here in my case I'm

play04:34

using Anaconda so all I need to do is

play04:37

open my anaconda prompt create a new

play04:40

Anaconda environment and then I can run

play04:43

that single pip install command in my

play04:45

managed environment the first command is

play04:47

Condor create I've called mine Kandinsky

play04:50

too because I'm great at names and I of

play04:53

course have already created it so I'm

play04:55

just going to say no there but you would

play04:57

say yes the next thing to do is activate

play04:59

that environment there it is conda

play05:02

activate Kandinsky 2 and it has changed

play05:05

from bass to my new Kandinsky 2

play05:07

environment now I've got a nice new

play05:09

python environment to install things

play05:11

into let's actually install it there it

play05:13

is that one pip install command from the

play05:16

GitHub page just copied and pasted that

play05:18

from there and we'll let that run for a

play05:21

few seconds

play05:22

okay there you go you've got the main

play05:25

thing installed however this does come

play05:27

with a load of Jupiter notebooks as you

play05:30

can see there so what we're going to do

play05:31

as well as that is also install Jupiter

play05:35

notebook there I've got Condor install

play05:37

minus C Condor Forge notebook ipy

play05:40

widgets and also open CV again that will

play05:43

just take a few seconds to install

play05:46

final thing I found it needed on one of

play05:48

the notebooks I can't actually remember

play05:49

which one it was now was Proto buff as

play05:52

well so a quick pip install of protobuf

play05:55

version

play05:56

3.20.0 and that will give you everything

play05:58

that you need to run all these notebooks

play06:01

right if we scroll down a little bit

play06:03

here we've got the how to use section

play06:05

like it says there check out our Jupiter

play06:08

notebooks with examples in the notebooks

play06:11

directory it also has some examples

play06:13

there that you can just copy and paste

play06:14

and start creating your own programs

play06:16

coding up stuff in your own environment

play06:19

to match whatever your personal

play06:20

requirements are I on the other hand am

play06:23

super lazy so instead I just started by

play06:26

cloning this repo and then making a copy

play06:29

of that notebooks directory on my

play06:32

computer I keep files organized in a

play06:34

directory structure that I find easy to

play06:36

use but this may not be a structure that

play06:39

you want to use so don't worry you don't

play06:41

need to copy it especially if your name

play06:44

isn't nerdy on your computer you can use

play06:46

whatever directory structure or lack

play06:48

thereof is most suited to you I know

play06:52

it's that whole Freedom of Choice thing

play06:54

again you do you Boo for me I keep my

play06:57

GitHub stuff here in this directory that

play07:00

I created myself and called GitHub it's

play07:03

also where I'm going to run the get

play07:05

command from and I'm pretty sure get

play07:07

comes installed as standard with pretty

play07:09

much every Linux distro nowadays but if

play07:12

for whatever reason you don't have git

play07:14

installed then you can just run conda

play07:17

install git now and that will do it for

play07:19

you next up is to run that git clone

play07:22

command and that will download all the

play07:24

files from the repository I of course

play07:27

have already done that so I won't be

play07:29

running it now

play07:31

and will instead move straight on to the

play07:33

next step which is changing directory

play07:35

into that newly created area so there I

play07:38

am in GitHub Kandinsky too now I've

play07:41

already copied it let's just show you

play07:43

the contents there I've already copied

play07:45

the notebooks directory to my notebooks

play07:48

but you can do the same if you want to

play07:50

now I'm going to say upfront I found

play07:52

notebooks a little bit janky and I'm

play07:55

going to show you the ones for 2.0 quite

play07:57

quickly that way you can decide for

play08:00

yourself if you do indeed want to use

play08:02

2.0 at all I find these notebooks a

play08:05

little bit janky so I'm just going to go

play08:06

through them really quickly and only for

play08:09

2.0 this way you can see exactly how

play08:12

they work and also decide for yourself

play08:14

if you want to use 2.0 at all

play08:17

right so first up I'm going to change

play08:19

into my notebooks directory which is

play08:22

just a copy of notebooks I'll show you

play08:24

the contents there I've just got all

play08:26

those notebooks and also a couple of

play08:28

example images to run those notebooks

play08:31

I'm just going to run the Jupiter

play08:33

command with the notebook option and a

play08:36

couple of seconds and there it is it'll

play08:37

start up automatically in my web browser

play08:39

and show me all those notebooks there

play08:42

we'll take a look at the image 2 image

play08:45

one first

play08:47

one thing to note is that the default

play08:49

cache directory is set to slash TMP this

play08:53

is great however it does mean that any

play08:54

files you download will get wiped out

play08:57

when your temp directory gets cleaned

play08:59

thus what I like to do is change the

play09:01

cache there in all of these examples to

play09:03

a permanent storage location so you'll

play09:06

see that here I've got the cache

play09:08

directory and by default it isn't even

play09:11

there but like I say it's set to slash

play09:13

temp so there I have home nerdy GitHub

play09:16

Kandinsky 2 cache and that I have

play09:19

changed in all of these notebooks as

play09:21

with Google collab you just have a

play09:23

number of cells and you click run but

play09:25

what I can do here is just do sell and

play09:27

then run all and that will run all these

play09:30

cells in order as you can see there we

play09:33

should have a photo of a bearded man

play09:36

come out at the end now this is doing

play09:39

image to image and it is image opening

play09:41

that image.png so let's show you that

play09:44

one that is there so that's the image to

play09:46

image it's doing you might be familiar

play09:49

with image to image if you've used

play09:51

stable diffusion at all but it's

play09:53

basically using that image as an input

play09:56

and also merging it with the text prompt

play09:58

all right let's have a look see what it

play10:01

has produced loads and loads of text

play10:03

it's fine you can ignore all that

play10:05

because down at the bottom there we've

play10:07

got a photo of a bearded man so there

play10:10

it's done the strength 0.85 so it's only

play10:13

changed it a little bit number of steps

play10:15

50. this is all very familiar stuff if

play10:17

you have indeed used stable diffusion so

play10:20

yeah image is the one got their photo of

play10:23

a bearded man like I say not my favorite

play10:25

function this nor is anything at 2.0 but

play10:29

there it is working

play10:30

okay moving swiftly on the next one is

play10:34

the in painting notebook again here I'm

play10:36

just gonna run all the cells and

play10:38

obviously I've updated that cache

play10:40

directory as well in this case instead

play10:43

it's done a bit of in painting so

play10:44

there's the mask it's basically the

play10:46

bottom bit of that and I've impainted

play10:48

that with a bronze statue as the

play10:51

original image so it's cut that off

play10:53

change that made him into a bronze

play10:54

statue not too bad I do quite like the

play10:57

in painting a little bit but still not

play10:59

quite as good as stable diffusion okay

play11:01

so that's 2.0 out of the way those are

play11:04

the two things you can do with those

play11:05

let's take a look at 2.1 now but with a

play11:09

slightly different interface instead you

play11:12

can of course install this in exactly

play11:14

the same environment so here I'm also in

play11:17

Kandinsky too but I'm going to git clone

play11:20

this repo there as you can see from my

play11:23

directory listing I have of course

play11:25

already run that command so I can just

play11:28

move straight onto the next step which

play11:31

is changing directory into there there I

play11:33

am change directory into the advanced

play11:36

notebooks now you want to install a few

play11:38

extra things here so once again you will

play11:40

run pip install minus r requirements.txt

play11:43

and you'll have to wait a few seconds

play11:45

for that to run through

play11:47

at the end of that certainly for me

play11:49

anyway I also install the font Roboto

play11:53

pip package too because it uses that

play11:56

font later on now this uses Jupiter as

play11:59

well but rather than Jupiter notebook

play12:01

there we are going to run Jupiter lab

play12:04

instead

play12:06

as I did with the example Jupiter

play12:08

notebooks I've also changed my cache

play12:10

directory in here in this case it's in

play12:14

main.py so if you double click that

play12:16

you'll get that file open and there I

play12:19

have set my cache directory

play12:22

now this is an optional step but if like

play12:25

me you get errors with the font if you

play12:27

go into the UI directory then it edits

play12:30

the init.py file there it is that's this

play12:33

one here and then you scroll down to

play12:35

where it says Roboto regular and there

play12:37

as you can see on the image font line I

play12:40

have instead given the absolute path to

play12:44

that Roboto regular ttf font file

play12:46

another thing you may like to do is like

play12:48

me have this dark theme so up here at

play12:51

the top settings theme it's Jupiter lab

play12:55

light by default which absolutely blinds

play12:58

me I can't see using that at all so I

play13:01

use Jupiter lab dark

play13:03

if you do want to get that nice

play13:04

real-time updating GPU widget then you

play13:08

can uncomment that line and install that

play13:10

but personally I've already got one on

play13:12

my screen so I don't need to just run

play13:15

that cell and then eventually the user

play13:18

interface will appear now do note this

play13:21

will download a whole bunch of models to

play13:23

start with if you haven't so the first

play13:25

time this will take a couple of minutes

play13:27

before the interface appears

play13:30

and there it is as you can see now we've

play13:32

got a whole bunch of Sliders and things

play13:34

that you can have menus with and you can

play13:36

make the prompts bigger so it's an ever

play13:38

so slightly better interface for playing

play13:41

with this you've also got this image

play13:42

post processing at the Top If you expand

play13:45

that and then expand image saving if you

play13:48

want to you can save out all of your

play13:51

images now one problem with this however

play13:53

is that the directory has to exist

play13:55

already it won't create it for you so do

play13:59

make sure that you create that directory

play14:01

first before specifying it in save path

play14:03

all right so I'm just going to shrink

play14:05

that down for now because I'm absolutely

play14:08

fine with those we'll take off the save

play14:10

because I'm not going to save these and

play14:12

then we've just got the normal interface

play14:14

right I've also got stable diffusion

play14:16

running here as well so I can do a

play14:18

little comparison of this versus stable

play14:20

diffusion 1.5 and there we are hello

play14:23

good old stable diffusion alright so

play14:25

I'll be switching back and forth between

play14:27

these two tabs let's start off with

play14:29

something fairly easy make this prompt

play14:32

box a little bit bigger and there I've

play14:34

got a cyberpunk rodent is wearing a top

play14:36

hat whilst dancing on an oak table

play14:38

inside a South London pub in summer if I

play14:42

do a generate on that one and also on

play14:45

stable diffusion at exactly the same

play14:47

time as well we can see which produces

play14:49

the best image as you can see on this

play14:52

one by default it does create six images

play14:55

so we'll have a few to look at and there

play14:58

we have our wow that's actually really

play15:01

quite good obviously the hands are

play15:04

absolutely crazy but that really is a

play15:07

cyberpunk rodent wearing a top hat

play15:09

dancing on an oak table inside a South

play15:12

London pub that's cool let's have a look

play15:14

at some of the others there he is he's

play15:16

on the table there's some more tables in

play15:17

the background so that they're all quite

play15:20

similar they're a little bit different

play15:22

but they all they all have a certain

play15:24

sort of feel to them they all they're

play15:27

all the same but he's dancing on the

play15:28

table all right let's have a look at

play15:29

stable diffusion and their stable

play15:31

diffusion and

play15:33

hmm

play15:34

I don't think that's quite as good

play15:37

to be honest

play15:38

I don't think that's quite as good

play15:40

because in all of these the rodent is

play15:42

actually dancing on the table he's got

play15:45

his steampunk hat going on this one he's

play15:47

got the top hat and of course you know

play15:49

the hands are a little bit manky but

play15:51

he's not he's not really on a table all

play15:53

right let's be fair to stable diffusion

play15:55

and we'll give it another couple of goes

play15:57

there maybe I should increase this batch

play15:59

count slightly let's let's pop that up

play16:01

to four and uh but once again you see

play16:03

it's not on the table it's not on the

play16:05

table so I'm going to give that one to

play16:07

Kandinsky purely because the rodent is

play16:10

on the table and that's what I asked for

play16:12

all right let's try again with another

play16:14

prompt let's put this one in this is

play16:16

this is also going to be a fun one so

play16:18

here we have a professional photo of a

play16:21

kitten painting a watercolor image of an

play16:24

open cat food tin on a white canvas

play16:27

supported by an old wooden easel and

play16:31

again I've popped the same thing into

play16:33

stable diffusion as well and see how

play16:36

both of these do let's take take a look

play16:39

at this first one in Kandinsky there it

play16:42

is it's it's okay it's not bad I mean

play16:45

the kitten isn't painting a watercolor

play16:48

image of the cat food tin but it is a

play16:51

painting it's got the canvas it's got

play16:53

the easel it's got the tin not too bad

play16:56

there the kitten itself seems to be the

play16:59

painting so no again no we've got a

play17:02

painting rather than cat food in and

play17:04

some weird paint brushes at the top I

play17:06

mean they're quite good they are quite

play17:08

good that that one's not bad it's close

play17:11

it's close it's close all right it's

play17:14

they're quite good images but let's have

play17:15

a look at stable diffusion all right

play17:17

we've got four there and oh

play17:21

oh I think that one is probably the

play17:24

closest I mean the kitten isn't actually

play17:26

painting but it does have kind of a cat

play17:28

food tin that's kind of got a cat food

play17:31

tin as well and that's just a human

play17:32

painting a kitten I think I'm gonna give

play17:35

that one to stable diffusion personally

play17:38

although I do quite like some of the

play17:40

textures on that let's scroll up again

play17:43

and try another one this time I've got a

play17:46

moose head emerging from the birthday

play17:48

cake on a metal table in a 1960s style

play17:52

American kitchen okay let's have a look

play17:56

see how Kandinsky did

play17:58

all right that's that's nice and weird

play18:00

we seem to have a bit of a party moose

play18:02

going with a sort of party hat there

play18:04

it's not quite emerging from the cake

play18:06

but it still looks pretty cool oh not

play18:09

that one yeah that's a moose cake that's

play18:12

a moose cake I quite like the the sort

play18:14

of Shades and the some of the lighting

play18:16

on that as well it's it's got a certain

play18:18

sort of look to it that's that's kind of

play18:21

nice

play18:22

yeah okay anyway

play18:24

um yeah we've got got more moose there I

play18:27

think I think that's done quite well I

play18:28

think that's done quite well all right

play18:30

let's have a look at stable diffusion

play18:32

here is stable diffusion oh I don't know

play18:35

I don't know that one's quite good but

play18:37

it's made the mousse and it's and that

play18:40

one's quite realistic and it's got the

play18:42

moves cut that one's pretty good

play18:44

that one is pretty good but it doesn't

play18:46

have that 1960 style American Kitchen

play18:50

theme to it that's that's more modern

play18:52

and ah I'm I'm actually going to give

play18:55

that to Kandinsky I think because I like

play18:58

the 1960s style theme it's given to it

play19:01

it's got the mousse I think I just

play19:03

prefer the sort of color and the

play19:05

lighting in that it's it's more it's

play19:07

more like what I was thinking about in

play19:09

the in the 1960s Style

play19:11

let's try another one to see how well it

play19:14

handles complex prompts we'll pop this

play19:17

one in this time we have a full color

play19:19

highly detailed high resolution

play19:21

professional photo of a marble statue

play19:24

depicting a cute mouse wearing a red and

play19:26

white gingham dress and massive dark

play19:28

polished black leather army boots

play19:31

let's take a look at this do we have any

play19:34

marble statues wearing red and white

play19:36

gingham dresses and black army boots uh

play19:40

yes yes we do that is pretty much

play19:42

exactly what I asked for all right well

play19:44

done kamdinsky I don't think you can do

play19:46

any better than that that's pretty good

play19:48

okay the boots aren't quite so black

play19:50

there but they are there and I think

play19:53

those are amazing it's going to be

play19:55

difficult for stable diffusion to beat

play19:57

that all right let's have a look at

play19:58

stable diffusion

play19:59

oh it's it's close it's close but these

play20:04

are more like stuffed toys and there

play20:06

we've got two I only wanted one it's got

play20:08

the red and white gingham dress and the

play20:10

black army boots but that is not a

play20:13

marble statue I'm afraid I'm really

play20:15

sorry stable diffusion but I'm gonna

play20:17

have to give that to Kandinsky once

play20:20

again let's drop the batch count a

play20:22

little bit here so we don't have to make

play20:23

so many and we'll try doing a few

play20:25

different styles okay so here I'm just

play20:28

going to do a portrait of a woman in the

play20:30

style of a particular a painter let's

play20:33

see how well it has handled that style

play20:36

oh that's that's quite nice I quite like

play20:39

that that's it's got quite a sort of

play20:42

painted texture fit it's definitely the

play20:44

style it's got the same colors

play20:46

those are pretty good I do quite like

play20:48

those I quite like those all right let's

play20:50

have a look at stable diffusion oh also

play20:54

done quite well whoops also done quite

play20:56

well but it's

play20:59

I don't know that's tricky actually

play21:02

that is very tricky that's a I think

play21:04

that's a better representation of the

play21:06

style but that has more of a sort of

play21:10

textured

play21:11

feel to it so I don't know I don't know

play21:15

that's tricky that is very tricky I like

play21:17

both of those equally I think they've

play21:19

both done very well

play21:21

that's captured more of the style

play21:23

perhaps accurately and that's done more

play21:25

of the painting so I'm just going to

play21:27

call that even between the two now

play21:29

that's a fairly easy style so let's try

play21:31

something slightly more difficult now I

play21:34

know I often have issues generating

play21:36

watercolor Styles so let's try a

play21:38

watercolor art style painting of a moldy

play21:41

sandwich on a park bench let's have a

play21:45

look-see here oh

play21:47

that's not too bad actually that's not

play21:50

too bad it's got the sort of watercolor

play21:53

blotchiness to it oh

play21:56

I do quite like those that's that's the

play21:59

sandwich is the bench so it's it's a

play22:02

little bit uh it's a little bit off on

play22:04

the subject but as far as the style that

play22:06

one's closed that's got a sandwich on

play22:08

the bench as far as the style I think

play22:09

that's done very well let's have a look

play22:12

at stable diffusion audit I can already

play22:14

see that's not quite as good all right

play22:16

uh no no is the answer to that I don't

play22:19

even really need to look any further

play22:21

that that's not watercolor but it's

play22:24

partly down to the model obviously but

play22:26

uh yes Kandinsky wins clearly there all

play22:29

right let's try another style we'll do

play22:31

that same thing again we'll do the

play22:32

sandwich on the bench but this time in a

play22:35

surreal art style now how surreal is

play22:39

this gonna be oh that's quite nice

play22:43

that's quite nice I I do quite like the

play22:46

colors in Kandinsky there's just

play22:47

something about the the slightly

play22:49

different feeling it's got to that's

play22:51

perfect that's a yeah that's that's a

play22:54

sandwich on a bench that's the wheel

play22:56

okay so it seems like surreal also

play22:59

interprets as dripping but I like that

play23:01

one I like that one quite a lot all

play23:03

right let's have a look at stable

play23:04

diffusion is it um no sorry stable

play23:07

diffusion you're once again

play23:08

disappointing I mean it's nice it's got

play23:11

the sandwich on the bench but it's not I

play23:15

don't really cast that as surreal no I'm

play23:17

going to give that style to Kandinsky

play23:19

once again

play23:20

let's let's change this up ever so

play23:22

slightly once more we're gonna do a

play23:25

realistic graffiti art style painting of

play23:27

a moldy sandwich but this time it's

play23:29

going to be on a park bench in New York

play23:32

at midnight

play23:34

how well has it done this time for our

play23:36

graffiti art style painting of a moldy

play23:38

sandwich on a park bench in New York at

play23:40

midnight

play23:42

um

play23:43

actually slightly stunned by that

play23:45

because that's brilliant

play23:46

that is really cool

play23:48

all right that's

play23:50

that one's not quite as good that's not

play23:51

quite as good but the style

play23:53

the style and the feeling of it I love

play23:55

those that's fantastic

play23:58

that is fantastic it's that little

play24:00

drippy bits down there oh that's nice

play24:03

that is a really nice style all right

play24:05

let's have a look at stable Fusion

play24:08

okay all right

play24:10

um yeah not as impressed sorry I'm gonna

play24:13

give that realistic graffiti art style

play24:15

at night see these stable diffusion ones

play24:18

aren't even at night that's that's

play24:19

daytime that's that's completely

play24:21

Kandinsky

play24:22

let's let's go for something really

play24:24

really difficult now then this this

play24:27

comes out quite interestingly so animate

play24:29

cartoon style illustration of some

play24:32

anthropomorphic happy fire with cell

play24:35

shading and detailed accurate nerdy and

play24:38

handsome and high quality and amazing

play24:39

which is of course trending on GitHub so

play24:43

I'm hoping for a bit of an anime cartoon

play24:45

style theme to this let's see how these

play24:48

two compare

play24:49

anthropomorphic fire how does it oh

play24:52

that's quite nice that's quite nice it's

play24:54

it's some nerdy anthropomorphic fire

play24:57

yeah that's nice I quite like it got a

play24:59

little bit of a grin going on there some

play25:02

some cheeky nerd fire yeah I like like

play25:04

The Cheeky nerdfight all right let's

play25:06

have a look at stable diffusion oh I'm

play25:09

gonna be disappointed again aren't I

play25:10

it's given me loads and loads of faces

play25:12

and then I mean they're kind of fiery a

play25:15

little bit but

play25:17

um that's not really what I wanted okay

play25:19

I'm sorry I'm gonna give that to

play25:21

Kandinsky once again okay so we've done

play25:24

loads of Styles now we've done

play25:25

watercolor and impressionist and surreal

play25:27

and realistic and anime let's try

play25:29

something slightly realistic how can we

play25:32

go on the realistic here I've got a high

play25:34

quality high resolution details

play25:36

professional digital photo of a Roman

play25:39

soldier he's middle-aged he has dark

play25:42

hair and there is also a spear let's

play25:45

generate some of those and see what

play25:46

happens so the realism test how good is

play25:49

it it's not bad the eyes are a bit wonky

play25:53

but then eyes are a bit wonky in stable

play25:55

diffusion as well it's it's not too bad

play25:58

it got you know some some battle stuff

play26:00

going on there that's that's not too bad

play26:03

it's okay it's okay all right let's have

play26:06

a look at stable diffusion oh we haven't

play26:08

got faces there we've got actual

play26:10

characters so let's let's see there

play26:13

um

play26:15

hmm

play26:16

that's tricky

play26:18

I mean obviously they're a little bit

play26:20

wonky uh that's quite realistic that's

play26:23

quite I mean I prefer those again for

play26:25

the style for the for the aesthetic of

play26:28

the output I prefer Kandinsky the image

play26:31

mixer much the same thing again we'll

play26:34

just run this cell to have the interface

play26:36

appear after a few seconds okay there it

play26:40

is for the image mixer again you've got

play26:41

the same options if you want to do the

play26:43

saving and including generation text at

play26:46

the bottom let's just close that down

play26:48

again so this is mixing images so you

play26:50

can mix one or more images there you can

play26:52

upload one upload another one and keep

play26:54

adding images into the mix by default it

play26:57

has two so let's just use these defaults

play27:00

and we'll select our little soldier guy

play27:02

got a soldier guy there and then let's

play27:05

also select one of those delicious

play27:08

sandwiches so we'll select that upload

play27:11

there pick a delicious sandwich so now

play27:13

we're doing a mix of our Roman soldier

play27:16

with a spear and one of the sandwich is

play27:19

mix two images what's it going to do I

play27:22

don't know let's find out

play27:24

and there you have it a mix between a

play27:27

Roman soldier and a sandwich it's sort

play27:29

of taken the the bench turned that into

play27:32

a chair although that's a bench as well

play27:34

and then it's got the sandwich so it's

play27:36

done quite a good job at mixing but as

play27:38

with many of these mixing programs I

play27:41

don't really like the final output

play27:43

however one thing you may like is this

play27:45

next video

Rate This

5.0 / 5 (0 votes)

Étiquettes Connexes
Изображение генерацииKandinsky 2Сравнение моделейТекст в изображениеИзображение в изображениеИзображение стиляАИ технологииРусское программное обеспечениеУкраинаПолитикаИнтерфейс пользователяОбзор AIИзображение смешивания
Besoin d'un résumé en anglais ?