OpenAI o1 VS Sonnet 3.5 in Coding Physics Games - AI Showdown

Eduards "Do It Yourself" Lab
13 Sept 202419:50

Summary

TLDRIn diesem Video vergleicht der Sprecher zwei künstliche Intelligenz-Modelle, das Sonet 35 und das neue OpenAI Model, indem er sie dazu herausfordert, einen Auto-Parkplatz-Simulator mit Physik zu entwickeln. Während das Sonet 35 bei früheren Versuchen scheiterte, gelingt es dem OpenAI Modell in einem einzigen Versuch, was seine überlegene Leistung zeigt. Der Sprecher demonstriert, wie das OpenAI Modell auch komplexere Aufgaben wie die Entwicklung eines 3D-Parkplatz-Simulators annimmt, wobei es einige Herausforderungen und Fehler macht, die auf seine Grenzen hindeuten.

Takeaways

  • 😀 Der Videoinhalt dreht sich um die Gegenüberstellung zweier KI-Modelle, um deren Leistung bei der Entwicklung eines Parkplatzsimulators mit Physik zu testen.
  • 🤖 Das Modell Sonet 35 aus Anthropic wird mit dem neuen OpenAI-Modell O1 verglichen, wobei O1 in zwei Versionen existiert: O1 Preview (größer, langsamer, teurer, besser) und O1 Mini (schneller, kleiner, günstiger).
  • 🚗 Die Herausforderung besteht darin, ein Spiel im GTA-Stil mit realistischen Physik- und Radverhalten zu entwickeln, was Sonet 35 bisher nicht schaffte.
  • 📊 OpenAI-Modelle zeigen eine signifikante Leistungssteigerung in mathematischen Problemlösungen im Vergleich zu früheren Modellen, wie GP4.
  • 💡 O1-Modelle sind speziell darauf trainiert, länger zu denken und ihre Überlegungen vor dem Lösen von Problemen zu verbergen, um dann eine Zusammenfassung zu geben.
  • 🔧 O1 Preview schaffte es, den Parkplatzsimulator in einem einzigen Versuch zu entwickeln, was Sonet 35 nicht konnte.
  • 🎯 Durch die Iteration mit dem Code von O1 Preview konnte WebSim das Spiel weiter verbessern, indem es zusätzliche Funktionen wie Parkplätze, eine Geschwindigkeitsanzeige und eine Punktzahl hinzufügen konnte.
  • 🛠 Die Anforderungen für die KI-Modelle sind hoch, da sie nicht nur Code schreiben, sondern auch verstehen müssen, wie externe Bibliotheken und Physik-Engines funktionieren.
  • 🔄 Es zeigt sich, dass selbst hoch entwickelte KI-Modelle wie O1 Preview nicht alle Aufgaben perfekt lösen können und es zu Iterationen und Fehlern kommt.
  • 🔮 Die Zukunft der KI-Entwicklung scheint darauf ausgerichtet, intelligentere Modelle für schwierige Probleme zu nutzen und dann effizientere Modelle für weitere Verfeinerungen einzusetzen.

Q & A

  • Welche beiden Modelle werden im Video verglichen?

    -Im Video werden das Sonet 35 und das neue OpenAI Model verglichen.

  • Was ist das Hauptziel des Vergleichs zwischen den Modellen?

    -Das Hauptziel ist zu sehen, wie gut beide Modelle einen Parkplatzsimulator mit Physik schreiben können.

  • Warum wurde das Sonet 35 Model vorher nicht erfolgreich bei der Entwicklung eines Parkplatzsimulators?

    -Das Sonet 35 Model scheiterte wiederholt, weil es die komplexen Anforderungen des Simulators, wie realistische Physik und Raddrehungen, nicht korrekt implementieren konnte.

  • Was ist ein Beispiel für die verbesserte Leistung des OpenAI Model 01?

    -Das OpenAI Model 01 konnte 83% der mathematischen Probleme korrekt lösen, im Gegensatz zum vorherigen Modell, das nur 13% richtig beantworten konnte.

  • Wie unterscheidet sich das OpenAI Model 01 von früheren Modellen?

    -Das OpenAI Model 01 wurde speziell trainiert, um länger zu denken, uncensored, und dann seine Überlegungen zu summarieren und sie dem Benutzer zu zeigen, bevor es die Probleme löst.

  • Welche Einschränkungen gibt es bei der Verwendung des OpenAI Model 01?

    -Es gibt eine begrenzte Anzahl an Aufrufen pro Woche, nämlich 30 für das größere Modell (01 preview) und 50 für das kleinere Modell (01 mini).

  • Was versucht der Uploader mit dem OpenAI Model 01 zu erreichen, das er mit Sonet 35 nicht konnte?

    -Der Uploader versucht, mit dem OpenAI Model 01 einen Parkplatzsimulator zu entwickeln, der realistische Physik und Raddrehungen umsetzt, was er mit Sonet 35 nicht schaffen konnte.

  • Wie wurde die Leistung des OpenAI Model 01 in der Entwicklung des Parkplatzsimulators bewertet?

    -Das OpenAI Model 01 schaffte es, einen funktionierenden Parkplatzsimulator in einem einzigen Versuch zu entwickeln, was eindrucksvoll ist, da es ohne Fehler und ohne Korrekturen funktionierte.

  • Was zeigte der Uploader, indem er das Ergebnis des OpenAI Model 01 an das WebSim Model weitergab?

    -Der Uploader zeigte, dass das Ergebnis eines intelligenteren Modells (OpenAI Model 01) von einem weniger leistungsfähigen Modell (WebSim) korrekt verwendet und verbessert werden kann.

  • Was versucht der Uploader als nächstes, um die Grenzen des OpenAI Model 01 zu testen?

    -Der Uploader versucht, das OpenAI Model 01 zu einem noch schwierigeren Test herauszufordern, indem er es auffordert, einen 3D-Parkplatzsimulator mit realistischer Physik zu entwickeln.

Outlines

00:00

🚀 Vergleich von AI-Modellen für ein Parksimulationsspiel

Dieser Absatz stellt einen Vergleich zwischen dem AI-Modell Sonet 3.5 und einem neuen OpenAI-Modell vor, um zu sehen, wie gut sie ein Parksimulationsspiel mit Physik entwickeln können. Der Sprecher beschreibt, wie er bereits mit Sonet 3.5 experimentiert hat, das jedoch wiederholt scheiterte. Das OpenAI-Modell wird als ein reasoning Model vorgestellt, das in zwei Versionen verfügbar ist: '01 preview', das größer und langsamer ist, und '01 mini', das schneller und kostengünstiger ist. Die Tests zeigen, dass das OpenAI-Modell in der Lage ist, das Spiel von Anfang an korrekt zu entwickeln, während Sonet 3.5 immer wieder scheitert.

05:01

🔍 Herausforderungen bei der Entwicklung eines Parksimulationsspiels

In diesem Absatz wird erläutert, warum die Entwicklung eines Parksimulationsspiels mit Physik so schwierig ist. Es wird darauf hingewiesen, dass selbst kleine Fehler in der Physikimplementierung das Spiel unbrauchbar machen können. Der Sprecher beschreibt, wie er versucht hat, das Modell dazu zu bringen, ein solches Spiel zu entwickeln, und wie dies mit Sonet 3.5 nicht funktionierte. Es wird auch erwähnt, dass das OpenAI-Modell in der Lage war, ein Spiel zu entwickeln, das den Anforderungen entspricht, aber noch nicht perfekt ist.

10:02

🚗 Verbesserung des Spiels mit weiterführenden Anpassungen

Der Sprecher zeigt, wie er das von OpenAI entwickelte Spiel weiter verbessern lässt, indem er das weniger leistungsfähige Modell WebSim dazu verwendet, das Spiel zu erweitern und zu verbessern. Es werden Features wie ein Tachometer, Parkplatzzeichen, eine Punktzahl und Gebäude hinzugefügt. Es wird gezeigt, wie durch die Iterationen und Anpassungen das Spiel immer besser wird, und wie die verschiedenen Modelle zusammenarbeiten können, um ein besseres Endprodukt zu erstellen.

15:03

🛠 Test und Iteration mit dem GP4-Modell

In diesem letzten Absatz wird eine Live-Test-Situation mit dem GP4-Modell beschrieben. Der Sprecher will ein 3D-Parksimulationsspiel entwickeln und stellt hohe Anforderungen an die Realismus und Interaktivität des Spiels. Es wird gezeigt, wie das Modell versucht, die Anforderungen zu erfüllen, aber auch, wie es bei der Umsetzung in die Praxis Schwierigkeiten hat. Es wird betont, dass, obwohl das Modell schneller und fähig ist, komplexere Aufgaben zu bewältigen, es immer noch nicht auf der Stufe eines erfahrenen menschlichen Entwicklers ist und weiterhin an einigen Herausforderungen scheitert.

Mindmap

Keywords

💡Sonet 35

Sonet 35 ist ein Modell von Anthropic, das in der künstlichen Intelligenz für die Entwicklung von Programmen und Anwendungen eingesetzt wird. Im Video wird es mit dem neuen OpenAI-Modell verglichen, um zu sehen, wie gut es in der Entwicklung eines Parkplatz-Simulators mit Physik ist. Es scheitert jedoch wiederholt, was die Herausforderungen bei der Entwicklung solcher Anwendungen verdeutlicht.

💡OpenAI-Modell

Das OpenAI-Modell, speziell das AI1-Modell, wird im Video als eine der beiden Modelle vorgestellt, die in einem Vergleichsexperiment verwendet werden. Es besteht aus zwei Versionen: '01 preview', das größer, langsamer und teurer ist, und '01 mini', das schneller, kleiner und kostengünstiger ist. Das Modell wird für seine Fähigkeit gelobt, komplexere Probleme zu lösen, wie im Beispiel der mathematischen Probleme aus der Internationalen Mathematik-Olympiade.

💡Parkplatz-Simulator

Ein Parkplatz-Simulator ist ein virtuelles oder computerbasiertes Trainingswerkzeug, das verwendet wird, um das Parken von Fahrzeugen zu simulieren. Im Video wird der Simulator als Herausforderung für die künstliche Intelligenz modelliert, um zu zeigen, wie gut die verschiedenen Modelle in der Programmierung von physikalisch realistischem Verhalten sind.

💡Physik-Engine

Ein Physik-Engine ist ein Software-Tool, das verwendet wird, um die Gesetze der Physik in Videospielen oder Simulationen zu simulieren. Im Video wird es erwähnt, dass der Simulator realistische Physik haben sollte, was auf die Komplexität der Aufgabe hinweist, die die künstliche Intelligenz bewältigen muss, um das richtige Verhalten der Fahrzeuge zu programmieren.

💡Rechtschreibprüfung

Die Rechtschreibprüfung ist ein Prozess, bei dem die Schreibweise von Wörtern überprüft wird, um sicherzustellen, dass sie korrekt sind. Im Kontext des Videos wird dies als Teil der Herausforderung angesehen, die von der künstlichen Intelligenz bei der Codegenerierung bewältigt werden muss, um sicherzustellen, dass die Syntax und die verwendeten Befehle korrekt sind.

💡Code-Generierung

Die Code-Generierung ist der Prozess, bei dem eine künstliche Intelligenz oder ein Programm Code schreibt, um eine bestimmte Aufgabe auszuführen. Im Video wird dies als zentrale Fähigkeit der künstlichen Intelligenz modelliert, die in der Lage sein muss, den Parkplatz-Simulator korrekt zu programmieren.

💡Canvas

Canvas ist ein HTML-Element, das verwendet wird, um Grafiken und Animationen zu erstellen. Im Video wird es erwähnt, dass es für die Entwicklung des Parkplatz-Simulators verwendet wird, um die Darstellung des Fahrzeugs und seiner Bewegungen auf dem Bildschirm zu ermöglichen.

💡JavaScript

JavaScript ist eine Programmiersprache, die häufig für die Entwicklung interaktiver Funktionen in Websites verwendet wird. Im Video wird JavaScript als eine der Technologien genannt, die für die Entwicklung des Parkplatz-Simulators verwendet wird, um die Logik und die Interaktionen des Spiels zu programmieren.

💡CSS

CSS steht für Cascading Style Sheets und wird verwendet, um das Erscheinungsbild von Websites zu gestalten. Im Video wird CSS erwähnt, um die visuellen Aspekte des Parkplatz-Simulators zu beschreiben, wie zum Beispiel die Darstellung der Fahrzeuge und der Umgebung.

💡Interaktivität

Interaktivität bezieht sich auf die Fähigkeit, mit einem System oder einer Anwendung zu kommunizieren und diese zu steuern. Im Video ist Interaktivität ein wichtiger Aspekt des Parkplatz-Simulators, da es dem Benutzer ermöglicht, das Fahrzeug zu steuern und die Auswirkungen seiner Steuerung auf das Parken zu sehen.

Highlights

Comparing Sonet 3.5 and OpenAI's new model for coding a car parking simulator with physics.

Sonet 3.5's repeated failures in creating a physics-based car parking simulator.

Introduction of OpenAI's new reasoning model, featuring two versions: '01 preview' and '01 mini'.

Limited access to the new models with 30 and 50 calls per week respectively.

The new model's ability to solve 83% of mathematical problems in the International Mathematics Olympiad, a 600% improvement over previous models.

The model's training to think longer and uncensored, then summarize its thinking for problem-solving.

Testing the new model by asking it to build an educational parking game with realistic physics.

CLA's initial failure to create the game, despite understanding the prompt.

Web's attempt with the same prompt, resulting in errors and eventual partial success.

The complexity of integrating physics engines for top-down games and the model's challenges in handling it.

OpenAI's '01 preview' model's successful creation of a basic, working car parking simulator in one attempt.

The impressive single-shot success of '01 preview' compared to Sonet 3.5's repeated failures.

Demonstration of using the output from '01 preview' as a starting point for further improvements with a less capable model.

The potential of using smarter models for complex tasks and cheaper, faster models for routine improvements.

Live testing with the GP4 model to push its limits by asking for a 3D parking game with physics.

The model's struggle with the complexity of creating a 3D game, showing it's not yet at human-level development.

The model's iterative process in attempting to correct the wheel positioning in the 3D game.

Final thoughts on the model's capabilities, its room for improvement, and future potential.

Transcripts

play00:01

hello today in this video we're going to

play00:04

pit two models one against the other to

play00:07

see how they perform one model is Sonet

play00:10

35 that you can use in Claude or in

play00:12

websi another model is a new open ai1

play00:16

model and the way we're going to do it

play00:19

is by asking them to write a car parking

play00:22

simulator with

play00:24

physics and this is something that Sonet

play00:26

35 repeatedly failed to do for me before

play00:30

so it's going to be interesting why this

play00:32

going to be interesting well o one is

play00:35

new open AI reasoning model it's

play00:38

actually two models in chpt you will see

play00:41

01 preview which is larger slower more

play00:44

expensive better model and there is 01

play00:46

mini which is faster smaller cheaper you

play00:49

will only get 30 calls to o1 preview per

play00:52

week and 50 calls to One Mini per week

play00:56

so kind of limited for now now what does

play01:00

mean that it's a reasoning model well

play01:02

one example Opia gives in their blog is

play01:05

this number their previous best model

play01:07

gp4 could only solve

play01:10

13% of mathematical problems correctly

play01:13

in international mathematics Olympiad

play01:15

while 01 can score 83 This is 6X

play01:19

Improvement not 100x Improvement it's

play01:22

still 600% Improvement so not as high as

play01:25

hype but pretty high and very very

play01:28

impressive how does it do it well it was

play01:31

specifically trained to think for longer

play01:34

uncensored in a way that open a hides so

play01:37

it thinks but you do not see it then it

play01:40

summarizes and sensors its thinking

play01:42

shows it to you and proceeds to solve

play01:45

your problem and answer your questions

play01:47

this together allowed it this kind of 6X

play01:51

Improvement which is again very

play01:53

impressive but I wanted to test it for

play01:55

myself how can I myself build this

play01:59

Improvement

play02:00

and what is the easiest way to do this

play02:02

to answer like is it better the easiest

play02:05

way is by comparison so I want to

play02:08

compare it today to a previous best

play02:10

model I know of which is Sonet 35 from

play02:12

anthropic in Cloe and webin now however

play02:15

good the model is however impressive it

play02:17

is it did very impressive things for me

play02:19

before surprised me I asked it once to

play02:22

write for me educational game of how to

play02:24

park top down game with physics with

play02:27

rotating Wheels with acceleration with

play02:29

brakes

play02:30

and with Trails for car wheels to show

play02:33

where everything is when you drive kind

play02:36

of a visualization for you to learn how

play02:38

to park correctly and this is what I

play02:40

asked before I asked it to ride this

play02:42

kind of game and it was failing time and

play02:44

time and time again well today we will

play02:47

try and see what 01 can do for this

play02:49

problem so this is a fight and this is

play02:52

round one let's

play02:55

go okay so here is a chat I had with CLA

play03:00

I copy pasted my prompt it lost the

play03:02

formatting I'm asking here for HTML CSS

play03:06

and JavaScript game in GTA top down

play03:09

style it should be parking game car

play03:12

should have four wheels they should

play03:13

rotate they should be realistic physics

play03:16

and friction and it should be parking

play03:19

game it leaves Trails uh behind it

play03:22

Wheels like Trails of Wheels okay not

play03:25

super prompt there just something I

play03:27

brought

play03:28

quickly here is what LO understood and

play03:31

try to do so it said yes I will create

play03:33

for you this kind of HTML CSS JavaScript

play03:35

game with rotating Wheels realistic

play03:38

physics and whe

play03:39

trails and WR like everything else I can

play03:42

see that it understood correctly what I

play03:44

wanted and it wrote

play03:46

this a gray Square nothing happens you

play03:49

cannot write nothing happens no errors

play03:52

nothing so it fails blot failed from one

play03:55

shot we can take a look at code as far

play03:59

as code goes it didn't use any kind of

play04:00

third party libraries there is some CSS

play04:02

and there is Javascript it uses canvas

play04:05

it wrote All on its own it just not

play04:08

working considering there were not even

play04:10

errors it's even hard to say what's

play04:13

wrong okay next let's see what happened

play04:15

in web it uses the same model we can see

play04:18

that I have here the same prompt just

play04:22

formatted here with lines and from first

play04:25

try also failed it showed an error and

play04:27

when I've tried to fix the error it did

play04:32

this exactly like I wanted

play04:36

right uh I did ask it to change some

play04:38

things and got

play04:41

this now at least it doesn't fall down

play04:44

partially problem is that I asked it for

play04:46

physics game and i' used physics engine

play04:48

made not for topown games but for Sid

play04:50

scroller

play04:51

games but we can also here see why this

play04:54

prompt is problematic it includes using

play04:57

physics engines a third party dependence

play05:00

and physics engines are notoriously very

play05:04

lacky in a sense that even small

play05:06

mistakes in how physics Works makes them

play05:08

go crazy that's what you see here so

play05:12

asking model to write this kind of code

play05:14

it needs to write a code it needs to

play05:15

understand how to use library and it

play05:17

needs to understand what I asked in

play05:18

context that I want a top- down parking

play05:22

game how do you use physics engine

play05:25

correctly to make it work and also

play05:27

render correctly this is a very it's not

play05:29

easy CU we are asking a textual model it

play05:34

never seen things how can it reason

play05:37

about physics and how to use physics

play05:39

engine for it to correctly simulate what

play05:42

what I'm asking this is this is hard

play05:44

even with iteration with son 3 in vbim

play05:47

or in clo this thing fails for me I've

play05:50

tried multiple and multiple times before

play05:53

I wanted to make this to help my wife

play05:55

illustrate how parking works when and

play05:57

how you should reason about where and

play05:59

where your car is and how different

play06:01

parts trajectories go when you're

play06:03

parking and I couldn't do it and I don't

play06:06

don't have time to sit down and do it on

play06:08

my own it could take takes it's not

play06:11

actually easy to do this from scratch to

play06:14

work correctly so Sonet and Claud and

play06:18

web fail for me with this kind of PR

play06:20

let's take a look at what openio wanted

play06:23

here is the same PR here is chat with uh

play06:28

ch1 preview the slower more expensive

play06:32

smarter model it thought for 24 seconds

play06:36

it didn't show whole reasoning it just

play06:37

showed summary of it and we here see

play06:40

that it says crafting the simulation I'm

play06:43

creating GTA 2 style top down car game

play06:46

it will be HTML and

play06:48

JavaScript it will Design on

play06:52

canvas will be

play06:54

controls mapping out the game I'm

play06:57

setting up HTML the game Conners will

play06:59

serve as rendering area setting up

play07:02

setting up environment I'm pulling

play07:03

together the game conas driving car

play07:05

class focusing on speed acceleration

play07:07

steering position updates and rendering

play07:10

we can see that it's reasoning through

play07:12

in different ways through what it's

play07:13

going to be doing then it would have

play07:16

enchancing car physics I'm working

play07:18

through realistic car physics including

play07:20

wheel rotation it's like selft talk you

play07:22

know like it's self-affirmation what

play07:24

humans do in the sense that what is it

play07:27

that I'm doing now oh yeah that's what

play07:28

I'm doing okay let's do

play07:30

it so interesting what's interesting

play07:33

that clae also does that we can see here

play07:36

some of its reasoning being put out but

play07:39

there is also hidden part of the

play07:41

reasoning that they do not show we know

play07:43

from proms that they have thinking parts

play07:45

and they are not showing them here

play07:47

they're hiding them so CLA does this too

play07:50

and this does this too does it perform

play07:52

better well it generated this kind of

play07:54

code actually big one usually chipt was

play07:57

not generating such a big one code from

play07:59

what I seen o1 preview and o1 mini can

play08:03

output very large answers something like

play08:06

eight or more like 8 to 16 times larger

play08:09

than before this is not 8 to 16 times

play08:12

larger than before but it's an large

play08:14

answer and we can say that it gives us

play08:17

explanation like there is HTML convas is

play08:19

of this size there are colors there is

play08:22

car code it explains everything about

play08:24

the code you can read it and learn on

play08:26

some level of what the code it wrote

play08:28

does

play08:30

it has features realistic physics how to

play08:32

use use with this arrows and we have

play08:35

code what we can do is we can copy that

play08:37

code we can go to code pen past that

play08:40

code let's save so that we have empty

play08:45

one save again and this is what we

play08:49

got it's a little bit

play08:52

unfinished but it's

play08:54

working Wheels rotate going I can go

play08:58

back I can go forward and it is drawing

play09:01

a line only issue is that this line

play09:03

seems to be the middle of the rectangle

play09:05

of the car it's a little bit not what I

play09:07

want I probably would draw actually

play09:10

Trails of corners of the car's body I

play09:13

think this is the most important thing

play09:15

for this kind of game but um it did it

play09:19

it did it from first try single prompt

play09:22

one shot no correction it just

play09:26

works this is impressive

play09:30

so that's it I just compared Sonet to 01

play09:33

model 01 did in one shot something I was

play09:36

trying to achieve with web seam or

play09:39

CLA it's not

play09:41

perfect then one more interesting thing

play09:44

happened I've gave this code to web seam

play09:47

to see what it's going to do let's take

play09:49

a look at

play09:51

that so I think it was here here I gave

play09:57

web Sim the same code and it worked we

play10:01

have the same car that 01 generated but

play10:05

webam added other things we can see

play10:07

speedometer we can see parking spots we

play10:10

can see score we can see buildings and

play10:13

if we go into the building it's going to

play10:15

become

play10:16

red we can go and try to

play10:26

park oh we got 100

play10:29

score and four parking spots remaining

play10:33

we can go to the next

play10:36

one I also wanted a little bit more work

play10:40

around the

play10:42

trails now we have four

play10:46

trails and we have also some

play10:50

settings so we can change acceleration

play10:54

and some other things now it's right

play10:57

faster we we we we we we oh my

play11:02

God so what happened here and we can go

play11:05

and see the prompts so we can see that

play11:07

first prompt was just code that o1

play11:10

generated and then I said that make it

play11:13

draw four Trails one for each will leave

play11:16

other things as this add settings

play11:18

control minimal and maximal speed

play11:20

friction control and so on make left

play11:22

right arrows rotate the wheel and so on

play11:24

so on so I I iterated little bit and I

play11:26

got even better version so what's

play11:28

happening here is is I've gave result of

play11:32

smarter model presumably seems like o1

play11:35

is smarter than son 35 I gave thinking

play11:38

from smarter model to a less capable

play11:40

model and it was capable of using it

play11:43

correctly not breaking it and improving

play11:45

Aon it which is interesting and

play11:47

impressive use case you can use smarter

play11:49

models to get you started or for hard

play11:51

problems and for other things use

play11:52

cheaper faster models it's an

play11:54

interesting future we're going into so

play11:57

this is another thing I wanted to show

play11:59

now all of what I showed you I've did

play12:02

yesterday evening now I want to do a

play12:03

little bit of live testing with you the

play12:06

gp4 model and here I want to create so

play12:10

we using preview I want a new

play12:13

chat so what I'm thinking I still

play12:16

slightly dislike this I think it's not

play12:19

realistic enough in some

play12:22

ways it doesn't really behave like a

play12:25

physical

play12:26

car and it got me thinking while o1 is

play12:30

impressive it moves the button further

play12:33

it can do more it's not that sky is the

play12:36

limit ceiling has risen but like but

play12:40

it's lower than the sky and this is what

play12:43

I want to show you because I was playing

play12:45

with it a little bit and it does fail it

play12:48

just it does more complex things and

play12:50

fails less it still

play12:51

fails so what I was thinking let's ask

play12:54

it to make a 3D game let's make it even

play12:57

harder let's try to push it Until It

play12:59

Breaks A little bit let's make a harder

play13:03

game I want

play13:06

HTML JavaScript and

play13:09

CSS okay it should be in 3D with physics

play13:17

it should

play13:20

be parking

play13:23

game there is a car four wheels front

play13:28

wheels

play13:29

State wheels are round there is friction

play13:36

inertia and everything else to make it

play13:40

realistic purpose of the game is to

play13:45

allow people to learn how to park how to

play13:49

think and reason about when and why and

play13:53

how you need to rotate the whe

play13:57

will what are

play13:59

trails and trajectories of wheels and

play14:04

car

play14:06

Corners so people can learn in the game

play14:10

and apply that knowledge to real

play14:14

parking let's ask it this what's

play14:17

interesting here and I I do suspect it's

play14:19

going to fail what's interesting here is

play14:21

that I'm now asking it to use third

play14:23

party Library game engine and it's in 3D

play14:26

and it should be realistic and there is

play14:28

purpose in teaching people how to park

play14:30

let's see what it's going to

play14:32

do it's very challenging Asar in some

play14:36

ways so it's thinking it's

play14:38

crafting so crafting parking game 3D in

play14:43

HTML JavaScript CSS setting up 3D

play14:45

elements crafting a game creating a

play14:47

realistic 3D parking HTML game it will

play14:51

use 3GS

play14:52

library and Canon G for physics note due

play14:56

to the complexity of full-fledged game

play14:57

the following code provides a basic

play14:59

framework you can expand upon it to add

play15:01

more features

play15:03

okay

play15:05

okay so what I dislike so far is that it

play15:07

writes it as a bunch of different

play15:12

segments I want to ask it can you write

play15:15

it as one large HTML code block I can

play15:20

copy and paste at

play15:25

once so on the reasoning side I think

play15:28

it's good choice three jazz is most

play15:30

popular Library Canon Jazz I do not know

play15:33

I have not much experience with 3D

play15:36

physics but it seems more or less

play15:39

correct it creates the car there is

play15:41

geometry there are wheels and there is a

play15:44

lot of opportunity for failure it is

play15:46

very hard task so now I asked it it to

play15:48

do a one block it thought for 7 seconds

play15:52

and wrote for me a block that actually

play15:54

looks

play15:55

smaller is it we can see it's probably

play15:59

not as good let's create a new fan see

play16:02

what this small block

play16:06

did okay it's

play16:09

actually the only thing it's not

play16:12

writing can

play16:15

I but it's almost

play16:17

there um

play16:21

okay I wonder if we could add all of

play16:24

that let's try to

play16:26

check uh

play16:29

if we search for this it is here twice

play16:36

okay okay let's give it feedback let's

play16:38

see if it can

play16:43

iterate oh I I see that wheels are in

play16:45

different directions it cannot analyze

play16:48

images as well so it's not multimodel

play16:50

model just yet I didn't check in maybe

play16:53

it is multimodel Data just didn't expose

play16:55

the capability oh what's interesting it

play16:58

hand it still continue to

play16:59

work it didn't finish uh so let's give

play17:03

it feedback it seems like wheels are not

play17:08

correctly

play17:10

positioned they

play17:12

go parallel to the

play17:15

ground ground so when I press arrows car

play17:21

does uh right anywhere anywhere can you

play17:26

ruminate

play17:28

and why and how to

play17:31

fix so I don't want to make this video

play17:34

too long after this iteration we're

play17:36

going to stop for this last part I just

play17:38

wanted to demonstrate that it's not

play17:40

magic it is better in some ways it's

play17:43

considerably better but it's not human

play17:47

level developer yet on other hand it's

play17:50

fast it is faster than human developer

play17:53

doing these kind of things so it's again

play17:57

noticing issue the user has been point

play17:59

of the wheels is parel to the ground

play18:00

causing the car to stay motionless when

play18:01

arrows are

play18:03

pressed hang out the options analyzing

play18:06

code

play18:07

setup ensuring correct wheel setup

play18:10

navigating orientation

play18:12

differences oh it's thinking it was

play18:15

taking for a while and it failed so yeah

play18:18

no magic there let's

play18:25

see okay this time it thought slightly

play18:27

less

play18:29

and it's writing new code well let's

play18:32

keep some fingers crossed maybe it's

play18:34

going to

play18:38

work as you can see it writes more code

play18:42

I wonder how many lines that is I

play18:44

remember they were speaking about

play18:45

something like 32,000 16,000 64,000

play18:48

tokens this is a lot it wrote A big one

play18:51

it explains what it try to

play18:53

fix it even wants to run oh boy this is

play18:57

going to be interesting in ating

play18:58

workflows it speaks about how to test

play19:01

that you will not run to server open it

play19:03

I wonder if it's going to be able to

play19:04

manipulate a browser

play19:06

eventually okay so we here have no new

play19:09

code let's give it a try

play19:12

sa and I think it completely failed and

play19:16

it failed so not magic it's better and

play19:19

impressive but still will require

play19:22

iteration there will be type of problems

play19:25

for which it's going to be failing this

play19:27

is all I wanted to show today it's

play19:29

better it's going to be interesting to

play19:31

see where it goes this was round one in

play19:34

it it did win against son 35 I will be

play19:36

playing with it more during next weeks

play19:38

and I will make another video uh if I

play19:41

learn anything interesting so if you

play19:43

like this kind of things push subscribe

play19:45

down there and see you next time

Rate This

5.0 / 5 (0 votes)

الوسوم ذات الصلة
KI-ModellParkplatz-SimulatorPhysik-SimulationProgrammierungWeb-EntwicklungJavaScriptCanvas3D-GameMachine LearningTechnologie-Test
هل تحتاج إلى تلخيص باللغة الإنجليزية؟