AI Building in Creative Mode

Emergent Garden
26 May 202414:14

Summary

TLDRこのビデオでは、クリエイティブモードでAIと共にさらに多くのものを作り上げる Minecraft エピソードを紹介します。砂漠の環境で大きなピラミッドを作り、言語モデル Claude Opus によって自動生成された構造です。さまざまなモデルを比較しながら、ロマン風の建物や砂漠城、レッドストーン回路など、さまざまな建造物に挑戦します。GPT 4 は最も高価で最先端のモデルですが、ネザーポータルの構築に成功し、他のモデルと比べて驚くべきパフォーマンスを見せました。このビデオは、AIがMinecraftで何を作り上げることができるかという膨大な可能性を示しており、今後もさらなる探求が期待されます。

Takeaways

  • 🏰 スクリプトでは、MinecraftのクリエイティブモードでAIを用いて様々な建造物を建設している様子が紹介されています。
  • 🤖 AIの制御は、Claude Opusという言語モデルによって行われており、コードを生成して構造を作っています。
  • 🕒 大規模な建造物を作るには時間がかかるため、ビデオではMinecraftの時間を使用して建造物の進行状況を表現しています。
  • 🛠️ ブロック配置の順序は大きな建造物にとって非常に重要で、効率を上げるために逆順で行う方法が提案されています。
  • 💎 建造物の頂上にダイアモンドブロックを設置するなど、AIは特定の要求に応じて建造物を装飾することができます。
  • 📈 ビデオでは、Claude Opus、Gemini、llama llama 3という3つの異なるAIモデルを比較しています。
  • 🔍 llama llama 3はFacebook(メタ)によって開発されたオープンソースモデルで、特定のハードウェアがあれば自分のコンピュータで実行できます。
  • 🏠 各AIモデルは、与えられたブロックパレットから選んで建造物を作り、複雑な構造を理解し、実行する能力を示しています。
  • ️ Redstone回路の構築に挑戦し、シンプルな回路から始めて徐々に複雑さを増やしていきます。
  • 💥 最後に破壊的な方向にシフトし、爆発物を作成し、Minecraftのヌークを作り上げるまでの道のりを示しています。
  • 🌐 最後の挑戦として、Netherポータルの作成を行い、AIが理論上はそれを可能としていることがわかります。

Q & A

  • ビデオではどのようなゲームをプレイしていますか?

    -ビデオではMinecraftというゲームをプレイしています。

  • ビデオの主人公は誰を操作していますか?

    -ビデオの主人公はAndyを操作しています。

  • ビデオではどのような言語モデルを使っていますか?

    -ビデオではClaude Opusという言語モデルを使っています。

  • ビデオで建造されるものは何ですか?

    -ビデオで建造されるものは砂漠地形にある大きなピラミッドです。

  • ビデオで紹介されている言語モデルには何が特徴的ですか?

    -ビデオで紹介されている言語モデルは、コードを生成して構造を作り上げる能力があるという点が特徴的です。

  • ビデオ内で建造物を作るのにどれくらいの時間がかかりましたか?

    -ビデオ内で建造物を作るのに数日のMinecraft時間がかかりました。

  • ビデオではどの建造物が最も技術的に印象的ですか?

    -ビデオでは特に技術的に印象的な建造物は直接言及されていませんが、大きなピラミッドが完成度が高いと言えます。

  • ビデオ内で紹介されている言語モデルには何が欠点ですか?

    -ビデオ内で紹介されている言語モデルには、時には文字を省略したり、間違ったブロックを書くなどの欠点があります。

  • ビデオ内で言語モデルがどのようにブロックを置くことを指示していますか?

    -ビデオ内で言語モデルは一行ずつ交互に逆順にブロックを置く方法で建造物を作るように指示しています。

  • ビデオ内で紹介されている言語モデルはどれが最も優れていると言えますか?

    -ビデオ内で紹介されている言語モデルの中では、GPTとClaude Opusが比較的優れていると言えます。

  • ビデオ内で言語モデルが建造物を作っている際にどのくらいの時間をかかる予定でしたか?

    -ビデオ内で言語モデルが建造物を作っている際にかかる予定の時間について言及されていないため、具体的な時間を知ることはできません。

Outlines

00:00

🏗️ MinecraftでのAI建築プロジェクト

ビデオでは、クリエイティブモードでAIを使って様々な建造物を作り上げるMinecraftのエピソードが紹介されています。砂漠生地に大きなピラミッドを建設するプロジェクトが進んでおり、Claude Opusという言語モデルがコードを生成して構造を作動させています。建造物は数多くの失敗を経て完成し、全体の建造過程はPatreonでタイムラプスで見ることができます。また、効率的な建造方法として、行を交互に逆順に積む方法が取り入れられています。

05:07

🤖 AIモデルの紹介と建造物作りの挑戦

ビデオでは、Claude Opus、Gemini、llama llama 3という3つのAIモデルが紹介され、それぞれが異なる建造物を作り上げる試みを行っています。各モデルは独自の特性を持っており、建造物のスタイルや完成度にばらつきがあります。特にllama llama 3はオープンソースモデルで、高性能ながらも不安定な動作が見られます。建造物作りの過程で、モデル同士が協力し合う様子も描かれています。

10:08

💥 レッドストーン回路の挑戦と破壊的な方向へのシフト

AIモデルたちはレッドストーン回路の作り方を試みていますが、複雑な部分が多く、完璧な結果は得られません。GPTは一度目で完璧な結果を出しますが、他のモデルは多少の混乱を経験します。その後、ビデオは破壊的な方向にシフトし、爆発物作りに挑戦します。最後に、ネザーポータルの作成という難しい課題に取り組みますが、全てのモデルが成功するわけではありません。GPT 4は驚くべき結果を出し、ネザーに到達するまでを記録していますが、すぐに混乱して出てしまいます。

Mindmap

Keywords

💡Minecraft

Minecraftは、サンドボックス型のマルチプレイヤーのビデオゲームです。プレイヤーはブロックを置いて自分自身の世界を作り、探索することができます。このビデオでは、Minecraftを利用してAIが創造モードで建物を建設する様子が紹介されています。

💡AI

AIとは、人工知能の略で、人間のように学習・判断・行動することができる技術を指します。ビデオでは、AIがMinecraftでの建設を支援する役割を果たしています。

💡創造モード

Minecraftにおける創造モードは、無制限に資源を利用して建物を建設できるモードです。ビデオでは、AIがこのモードで無制限の資源を使ってピラミッドを建設する様子が紹介されています。

💡シェーダー

シェーダーとは、コンピュータグラフィックスにおいて、3Dモデルの表面をよりリアルに見せるための技術です。ビデオでは、シェーダーがオンになっており、Minecraftのビジュアルを強化しています。

💡砂漠生物

Minecraftには様々な生物群落があり、砂漠生物はその中の1つで、砂漠のような環境を表します。ビデオでは、AIが砂漠生物で建設を行っているシーンがあります。

💡コード

コードとは、コンピュータが実行する命令の集合体です。ビデオでは、AIがコードを生成してMinecraftの建物を建設するプロセスが紹介されています。

💡パラメーター

パラメーターとは、アルゴリズムやモデルに入力される値のことで、AIの性能に影響を与える重要な要素です。ビデオでは、70億パラメーターを持つモデルなど、AIのパラメーター数が触れられています。

💡Redstone

Redstoneは、Minecraftにおける電気回路システムです。ビデオでは、AIがRedstoneを使って複雑な装置を作り、それらを操作する挑戦が行われています。

💡Nether portal

Nether portalは、Minecraftの別の次元であるNetherにアクセスするためのポータルです。ビデオでは、AIがNether portalを作り、そこにアクセスする試みがされています。

💡GPT

GPTとは、生成予測変換子(Generative Pre-trained Transformer)の略で、自然言語処理の分野で用いられるAIモデルです。ビデオでは、GPTモデルがMinecraftでの建設に挑戦する様子が紹介されています。

Highlights

Building in creative mode with AI in Minecraft using shaders and a desert biome.

Construction of a large pyramid controlled by Claude, Opus, a language model.

Efficient row construction method by reversing order for each row.

Claude Opus' ability to follow specific construction requests.

Introduction of models: Claude Opus, Gemini, and Llama Llama 3.

Llama Llama 3 is an open-source model from Meta with 70 billion parameters.

Replicate API used for running Llama Llama 3, though not free.

Llama Llama 3's issues with character omissions and incorrect block placements.

Bots' ability to place blocks through other blocks without a clear line of sight.

GP Omni's new text model is fast, cheap, and has a casual personality.

Building a Roman-style building with columns as a tricky challenge.

GPT and Claude's close attempts at building a redstone contraption.

Llama Llama 3's failure to build the redstone contraption correctly.

Challenge of building a desert castle with specific features.

GPT's successful construction of a desert castle with towers and torches.

Claude's creative castle with torches but lacking doors and staircases.

Llama Llama 3's weak performance and simplified prompt outcome.

Exploration of redstone contraptions and electrical systems in Minecraft.

GPT's successful completion of redstone contraption challenges.

GPT 4's incorrect but impressive attempt at building a nether portal.

The potential and future exploration of AI in Minecraft construction.

Transcripts

play00:00

hello welcome to another episode of

play00:03

Minecraft today we'll be building even

play00:05

more stuff with AI in creative mode you

play00:08

can see I've got some fancy shaders on

play00:10

we're now in a desert biome and we have

play00:12

a little construction project going on

play00:14

here this is Andy and what better thing

play00:17

to build with infinite resources than a

play00:19

big ass

play00:21

pyramid Andy is controlled by Claude

play00:23

Opus a language model that wrote this

play00:25

piece of code to generate the structure

play00:30

there were some earlier failed attempts

play00:32

but this one turned out pretty good and

play00:34

took several Minecraft days to build the

play00:36

full time-lapse of this build is on my

play00:39

patreon also notice that they're

play00:41

building each row back and forth in

play00:43

reversing order which is much more

play00:45

efficient than having to start all the

play00:46

way over on the same side for each row

play00:49

the order of block placement matters

play00:51

especially for big builds like this and

play00:53

Claude Opus was able to follow my

play00:55

request to build in this way

play01:00

and it placed a diamond block on top

play01:02

ain't that

play01:03

pretty uh this isn't the most

play01:05

technically impressive build ever but I

play01:07

think it looks

play01:11

[Music]

play01:13

awesome okay let's meet our models for

play01:16

this video of course we have Claude Opus

play01:19

a new model for GPT and the same Gemini

play01:21

as before and then there's the new guy

play01:23

llama llama 3 is the open source model

play01:26

from Facebook I mean meta and this is

play01:29

the big set 70 billion parameter model

play01:31

and because it's open source you can run

play01:33

this model or the smaller model on your

play01:35

own computer if you have the

play01:38

hardware hey

play01:40

Gemini this instance of llama is running

play01:43

through the replicate API which is not

play01:45

free but much more convenient for me a

play01:49

llama is okay I think it's about as good

play01:51

as GPT 3.5 this house took a couple

play01:54

tries and it's very simple llama is also

play01:57

buggy if that's the word for it like it

play01:59

will sometimes just leave out characters

play02:01

and write and stone instead of sandstone

play02:03

which can really ruin the code that it

play02:05

writes also yes the Bots can place

play02:07

blocks through other blocks they don't

play02:09

need a clear line of sight this is

play02:11

another way that mind flare cheats they

play02:14

do need to be close enough though okay

play02:16

so even though we're in creative mode

play02:18

I'm still giving them a pallet of blocks

play02:20

to choose from everything you see in

play02:21

this top section rather than the

play02:23

hundreds of available blocks in the game

play02:25

this keeps them from getting confused

play02:26

and doesn't eat up their contact space

play02:29

so let's let's get started with GP Omni

play02:33

no it's not their weird flirtatious

play02:34

Voice Chat thing that totally isn't

play02:36

Scarlet Johansson well hello there cutie

play02:40

this is just their new text model which

play02:42

is super fast and cheap but supposedly

play02:44

about as good as GPT 4 and for the

play02:46

record I like its personality it's a

play02:48

little more casual less wordy it uses

play02:50

emojis I like it let's start with

play02:52

something tricky a Roman style building

play02:55

with columns

play03:15

that's pretty good it's missing a roof

play03:17

but that's fine and yes the Bots are

play03:19

still using scaffolding I have not added

play03:21

the flying ability yet and I actually

play03:24

think the scaffolding problem is a

play03:25

really interesting one to solve a decent

play03:28

solution is to just use a unique block

play03:30

like dirt in a desert and then simply

play03:32

have the agent remove all dirt when it's

play03:34

done building that works pretty good but

play03:37

not

play03:39

perfectly okay now

play03:53

Claude a little simple but they are

play03:55

technically columns good job and now for

play03:58

llama llama was a little more

play04:08

ambitious and it doesn't space out the

play04:10

column so it ends up building a giant

play04:14

[Music]

play04:15

Cube and then it stopped right in the

play04:17

middle of the build because I forgot to

play04:19

turn off this 10-minute timer that keeps

play04:21

code from running indefinitely I'll turn

play04:23

that off for the rest of the video but I

play04:24

think we can see where it was going not

play04:26

that impressive let's move on to Gemini

play04:29

I promise I'll be nicer to Gemini but

play04:31

this is the same model as before Gemini

play04:33

1.0 so it can't build anything and it

play04:36

can hardly follow instructions does

play04:38

anyone know how to get API access to

play04:40

Gemini 1.5 not through their AI Studio

play04:44

through an API key I think I need to be

play04:46

whitelisted by someone at Google so if

play04:49

you are someone who can get me access

play04:51

send me a DM on Twitter or Discord I'd

play04:53

love to give Gemini a fair Shake but

play04:55

Google does not make it easy for now

play04:57

Gemini is on Cactus collecting d

play05:06

[Music]

play05:09

all right let's be more ambitious I want

play05:11

a desert castle with walls doors torches

play05:15

towers crenellations and staircases I'm

play05:18

giving each Model A few tries here this

play05:20

is a big one

play05:23

[Music]

play05:37

[Music]

play05:44

[Music]

play05:52

[Music]

play05:59

it started building a second castle

play06:01

overlapping with the first one so I

play06:02

stopped it but I'd say it did pretty

play06:04

good it's Grand if a bit plain there are

play06:08

Towers a couple torches but no doors no

play06:10

crenellations no staircases and a lot of

play06:13

scaffolding which I had it remove here's

play06:16

a question how do you remove scaffolding

play06:18

that requires scaffolding to get to

play06:21

right now it builds up to each block

play06:23

then removes what it just built only to

play06:25

build it again for the next block it's

play06:27

very inefficient but it will eventually

play06:29

so they remove all

play06:31

dirt while it's doing that let's give

play06:33

Claude a shot

play06:42

[Music]

play06:55

[Music]

play07:13

[Music]

play07:23

[Applause]

play07:25

[Music]

play07:27

look GPT finished removing its SC fing

play07:29

and came to help Claude remove theirs

play07:32

that is really cute I really like

play07:34

claude's Castle the Torches are a nice

play07:36

touch the towers are lopsided and again

play07:39

no doors or staircases but still very

play07:41

good

play07:43

job and here's gpt's castle without

play07:45

scaffolding yes there are still a few

play07:47

Sandstone scaffolding blocks that I

play07:49

think it placed by

play07:51

[Music]

play07:53

accident now I tried getting the same

play07:55

prompt to work with llama but it crashed

play07:57

about 15 times it kept writing a broken

play07:59

code block so I had to simplify the

play08:01

prompt and this is what it gave

play08:04

me it's just another box and it took

play08:06

forever to

play08:21

[Music]

play08:26

build well okay it did end up adding

play08:28

Towers so there's that it's okay

play08:32

definitely the weakest of the three it

play08:34

did make this monstrosity earlier when I

play08:36

wasn't recording which I kind of love a

play08:39

box floating over a layer of torches and

play08:41

this is the closest we've got to actual

play08:43

crenellations now onto a new kind of

play08:46

challenge redstone contraptions if you

play08:48

don't know Redstone is basically the

play08:50

electrical system in Minecraft and it

play08:52

can get complicated we'll try these

play08:54

three simple builds a redstone lamp

play08:57

controlled by a lever five lamps

play08:59

controlled control by the same lever

play09:00

which requires some redstone dust or

play09:02

clever positioning and an or gate where

play09:05

the lamp is only on if the right lever

play09:07

or the left lever is on or both all

play09:10

pretty simple but nuanced don't expect

play09:12

too much from these

play09:18

models right off the bat GPT does it

play09:20

perfectly on the first try and even

play09:22

turns it

play09:25

on eventually Claude got it too and

play09:28

happened to build almost exactly the

play09:29

same thing GPT

play09:32

built and llama uh well never quite

play09:36

figured it

play09:42

out a little confused but it's got the

play09:46

spirit for five lamps controlled by one

play09:49

lever GPT for some reason runs over here

play09:52

to build it but almost gets

play09:54

it the Redstone needs to be one block up

play09:57

and it would have worked see Redstone is

play09:59

kind of tricky llama didn't place a

play10:01

lever or Redstone so it failed and

play10:03

Claude didn't quite get it

play10:05

either I tried to tell them more

play10:08

specifically how to build the lamps to

play10:09

put the redstone on top and both GPT and

play10:12

Claude got really close if it had just

play10:15

plac the lever on the side it would have

play10:17

worked so

play10:24

[Music]

play10:26

close for the orgate GPT wasn't able to

play10:29

wire up the

play10:32

[Music]

play10:33

Redstone llama actually got really close

play10:36

it just needed to put the lamp in the

play10:39

center and Claude technically got it if

play10:42

we count the Redstone wire as the

play10:46

output eventually I got a little tired

play10:48

of these finicky Contraptions and

play10:50

decided to take things in a more

play10:52

destructive Direction

play11:00

Claude even built a fuse for its

play11:09

bomb GPT definitely had some fun blowing

play11:12

stuff

play11:20

up pretty impressive but we have a long

play11:23

way to go before we build the Minecraft

play11:26

nuke

play11:32

[Music]

play11:38

okay one last challenge the nether

play11:41

portal it needs to be a square outline

play11:43

of obsidian blocks with fire lit at its

play11:45

base to activate it this is tricky but

play11:48

in theory possible hey Gemini good

play11:54

try GPT got close but it used the wrong

play11:57

Dimensions it needs at least an inner

play11:59

width width of two and height

play12:01

three Claude keeps stopping early for

play12:03

some reason and then builds it sideways

play12:06

which is again the wrong

play12:12

Dimensions eventually it did get the

play12:15

right shape but failed to activate it by

play12:17

[Music]

play12:22

itself after multiple tries and hints

play12:25

all models failed to produce the nether

play12:27

portal even Gemini

play12:30

as a last Stitch effort I tried out GPT

play12:32

4 not four Omni not four turbo just four

play12:36

this is I think the most expensive model

play12:38

on the market right now but possibly the

play12:40

best and it's totally wrong I don't even

play12:43

know what it's doing why is it oh I

play12:47

think I see what it's doing it might be

play12:49

the right shape I wonder if

play12:55

[Music]

play13:02

[Music]

play13:10

this is the first time an agent has been

play13:12

to the nether I've never brought them

play13:14

here so it's pretty incredible that it

play13:15

got here by itself sort of and that was

play13:18

GPT 4's first try it did immediately get

play13:22

confused and accidentally left as soon

play13:23

as it arrived but whatever it's still

play13:26

impressive I'm lucky I recorded it

play13:31

I think you'll agree that there is

play13:32

enormous potential here and I am not

play13:35

done exploring all the things you can

play13:36

build and do with these

play13:39

agents I'll see you later

play13:46

[Music]

play13:55

[Music]

play13:59

h

Rate This

5.0 / 5 (0 votes)

الوسوم ذات الصلة
MinecraftAIビルドクリエイティブシェーダー砂漠ピラミッド言語モデル効率性ブロック配置レッドストーン
هل تحتاج إلى تلخيص باللغة الإنجليزية؟