Sora Release Date Deep Dive

Theoretically Media
14 Mar 202410:26

Summary

TLDRこのスクリプトは、話題のAI技術であるSoraについて詳しく語っています。OpenAIのCTOであるMir Moraは、Wall Street JournalのインタビューでSoraのリリースについて触れ、新しいビデオのサンプルも公開されました。Soraのトレーニングデータに関しても話がされていますが、詳細は明らかになりませんでした。また、Soraのリリース時期についても話されていますが、技術的な問題や安全面の懸念があるため、具体的なスケジュールはまだわかっていません。一方で、中国のAIスタートアップが急速に発展しており、競合他社が積極的に投資を受けています。また、他にもAI生成のSouth Parkエピソードや、Fable Studiosが開発するThistle Gulchという西部劇のプロジェクトなど、AI技術の進歩とその応用が紹介されています。

Takeaways

  • 🚀 Soraは、非常に高い期待と注目を集めており、OpenAIのCTOであるMir Moraは、Soraのリリースの可能性のあるタイムフレームについて語りました。
  • 🤔 Mora氏はWall Street Journalのインタビューで、Soraのトレーニングデータに関する問いに対して、詳細には答えず、曖昧な回答をしました。
  • 📺 Soraの新しいビデオが公開され、Pixar風のアニメーションの牛が中国の店内を破壊する様子などが含まれています。
  • 📹 Soraで生成されたビデオは、720pまたは標準定義で約20秒の長さで、数分で生成されると報告されました。
  • 🔍 Mora氏は、Shutterstockからデータを使用してSoraをトレーニングしたと語りましたが、詳細については明言しませんでした。
  • 📅 Soraのリリースについては、Mora氏は今年内、もしくは数ヶ月以内にリリースされる可能性があると述べましたが、そのスケジュールには懸念があります。
  • 💬 YouTubeのMares BrownleyがSoraのプロジェクトリードと話した際には、リリースのタイムフレームはなく、近い将来はないと答えられました。
  • 💰 中国のAIスタートアップに資金が注ぎ込まれており、Soraと競うためにピクシバースやAIスフィアが1400万ドルを獲得しました。
  • 🎵 Mora氏はSoraが最終的に音声を持つと述べましたが、チームはまだ検討段階だと答えています。
  • 🤖 Soraは、GPTファミリーに似たようなモデルで、扩散Transformerの研究に基づいています。
  • 🎨 チームは、アーティストがSoraのようなツールを使って何を作り上げるかに興味があり、プロンプトによる映画制作よりも、新しいユニークな方法で遊ぶことに注目しています。
  • 📽 AIによって生成された「South Park」のエピソードが話題となり、その背後にあるのはFable Studiosです。彼らはシミュレーション技術を用いて、新しいプロジェクトを進めています。

Q & A

  • SoraというAI技術がどのような期待をされていますか?

    -Soraは非常に高い期待をされていますが、その期待は現実的でない場合もあるとされています。

  • OpenAIのCTOであるMir MoraはSoraのリリースについてどのような情報を提供しましたか?

    -Mir MoraはWall Street Journalのインタビューで、Soraのリリースについての興味深い詳細を提供しましたが、その中には疑問も残ります。

  • Soraのトレーニングデータには何が含まれていますか?

    -Mir MoraはSoraが公開されているデータでトレーニングされたと述べましたが、YouTubeやFacebookの動画が含まれているかどうかについては詳しくは語らなかったです。

  • Soraの生成動画の品質と生成にかかる時間について教えてください。

    -Soraの生成動画は720pまたは標準定義で、約20秒の長さで、数分で生成されると報告されていますが、一般的には20秒の動画を生成するには30分から1時間半かかるとされています。

  • Soraのリリースの見通しについてMir Moraはどのように述べましたか?

    -Mir Moraは今年内、おそらく数ヶ月以内にSoraがリリースされる可能性があると述べましたが、その見通しには懐疑的な見方も指摘されています。

  • Soraのプロジェクトリードがリリースの見通しについてどのように述べましたか?

    -Soraのプロジェクトリードは、YouTubeのMares Brownleyのポッドキャストで、リリースのタイムフレームはなく、すぐにはリリースされないと述べています。

  • Soraの音声機能についてどのような情報が提供されていますか?

    -Mir MoraはSoraが最終的に音声機能を持つと述べましたが、プロジェクトリードはその実現に向けて検討しているとのことです。

  • Soraのトレーニングに使用されたデータソースについて教えてください。

    -SoraはShutterstockのデータでトレーニングされたとMir Moraが述べ、OpenAIはShutterstockと関係を持っています。

  • Soraのプロジェクトリードが最も興味を持っていることは何ですか?

    -プロジェクトリードは、アーティストがSoraのようなツールを使って何を作り上げるかに最も興味を持っています。

  • AI生成のSouth Parkエピソードについて教えてください。

    -AI生成のSouth Parkエピソードは、The Simulation社(Fable Studios)によって作成され、技術デモとしてリリースされました。

  • Fable StudiosのThistle Gulchプロジェクトとは何ですか?

    -Thistle Gulchは、西部劇のテイストが強いプロジェクトで、小西部町で発生した殺人事件を巡って警部が捜査を進める物語です。

  • Fable Studiosが提供するSageとは何ですか?

    -Sageは、Fable Studiosが提供するオープンソースのAIエージェントで、物語やキャラクターの対話を深くカスタマイズすることができます。

Outlines

00:00

🤖 Soraの登場と期待:オープンAIのCTOが話す

SoraというAI技術が注目されており、オープンAIのCTOであるMir Moraは、Wall Street JournalのインタビューでSoraのリリースについて触れています。新しいSoraのビデオも公開されており、その生成速度についても話されていますが、生成にはまだ時間がかかることが予想されます。また、Soraのトレーニングデータについても議論されており、Shutterstockからデータを使用していることが明らかになっています。MoraはSoraのリリース時期について今年内である可能性を示唆していますが、プロジェクトリーダーたちの過去のコメントと照らし合わせると、その見通しは曖昧です。

05:01

🚀 AI技術の競争と進化:Soraの音声化とその可能性

中国のAIスタートアップがSoraと競争するために資金を集めており、Soraは最終的に音声機能を持つ予定です。プロジェクトリーダーは、Soraのトレーニングデータについても話しており、GPTファミリーに似ているとされています。Soraはアーティストがどのように使用するかに興味を持ち、新しい創造的な方法でプレイすることを期待しています。また、AI生成のSouth Parkエピソードが話題になり、その背後にある技術はFable Studiosによって開発されています。彼らはシミュレーション技術を使って、新しいプロジェクトを進めています。

10:02

🎬 AIとエンターテイメント:Thistle Gulchの物語

Fable Studiosは、AI技術を使って物語を進化させています。彼らのプロジェクトThistle Gulchは、西部劇の風味を持ちながらも、AIエージェントが物語を進めています。シミュレーション技術を使って、キャラクターに背storiesを与え、観客がインタラクティブな選択肢を提供できます。Fableはまた、Sagaをオープンソースとしてリリースし、AIの意思決定プロセスや会話生成をカスタマイズできる強力なPython APIを提供しています。Thistle Gulchのベータ版への登録も開始されており、興味のある人は参加することができます。

Mindmap

Keywords

💡Sora

Soraは、OpenAIが開発しているAI技術で、映像生成に特化しています。この技術は、高度なアニメーションやビデオを短時間で生成することが可能です。ビデオのテーマは、Soraの期待、技術の進歩、そして将来の可能性に焦点を当てています。

💡OpenAI

OpenAIは、人工知能技術を研究開発する組織で、Soraの開発にかかわっています。彼らは、AIの進歩とその社会的影響に貢献しています。ビデオでは、OpenAIのCTOであるMir Mora氏のインタビューが取り上げられており、Soraに関する情報を提供しています。

💡AI生成コンテンツ

AI生成コンテンツとは、人工知能によって生成されるコンテンツのことを指します。ビデオでは、Soraが生成する映像や、AIによって生成されたSouth Parkエピソードが例として挙げられています。これにより、創造性とエンターテイメント性の新しい次元が開かれています。

💡データの訓練

データの訓練とは、AIが学習するためのプロセスで、大量のデータを用いてモデルをトレーニングします。Soraは、Shutterstockなどの公開またはライセンスされたデータで訓練されたとされています。ビデオでは、訓練データの詳細について言及されていますが、具体的な情報は明かされていません。

💡映像生成モデル

映像生成モデルは、AIが映像コンテンツを生成するアルゴリズムです。Soraは、これらのモデルを用いて、短時間で高品質の映像を生成することができます。ビデオでは、Soraの映像生成モデルの進歩と、それが芸術家やクリエイターに与える可能性が議論されています。

💡テクニカルデモンストレーション

テクニカルデモンストレーションとは、技術的な製品やサービスを実際に動作させて、その機能や性能を示すプロセスです。ビデオでは、Soraのデモンストレーションとして、新しい映像が提示されており、その生成速度や品質について説明されています。

💡インタラクティブフィクション

インタラクティブフィクションは、読者が選択肢に応じて物語が進展するタイプのエンターテイメントです。ビデオでは、Fable Studiosが開発しているThistle Gulchというプロジェクトで、AIエージェントが背storiesを持って物語を進めることが説明されています。これは、新しいタイプのエンターテイメントとして注目されています。

💡オープンソース

オープンソースとは、ソフトウェアのソースコードが公開され、誰もが自由に使用・改変できることを指します。ビデオでは、Fable Studiosが開発したSagaをオープンソースとして公開し、AIの決定プロセスや会話生成をカスタマイズできるようにしていることが紹介されています。

💡AIエージェント

AIエージェントとは、人工知能を用いて動作するプログラムです。ビデオでは、Thistle Gulchのシェリフとして登場するAIエージェントが、物語を進めるだけでなく、自らのバックストーリーを持つことが説明されています。これにより、より豊かな物語体験が可能になります。

💡Westworld

Westworldは、AIが主体的な役割を果たすSFドラマです。ビデオでは、Thistle GulchのAIエージェントが持つ背storiesと、その独自の選択肢がWestworldと比べられる点で触れられています。また、実際のロボットがChat GPTに接続される可能性についても言及されており、Westworldのような世界が近づいているとの予想を示唆しています。

💡Hyper

Hyperは、無料で利用できる映像生成ツールのひとつです。ビデオでは、Soraを待つ必要はなく、今すぐ作成を始めるべきであり、Hyperのような既存のツールを活用することが勧められています。これにより、クリエイターは新しい技術を待つことなく、今すぐ作品制作を始めることができます。

Highlights

Sora, an AI technology, has been hyped with potentially unrealistic expectations.

OpenAI's CTO, Mir Mora, discussed a potential timeframe for Sora's release in an interview with the Wall Street Journal.

New Sora videos showcased, including a Pixar-styled animated bull and a robot interaction with a female reporter.

Sora's training data was a point of contention, with Mora stating it was trained on publicly available or licensed data.

Moradi mentioned that Sora was trained on data from Shutterstock, which OpenAI has a relationship with.

Sora's release timeframe is uncertain, with Mora suggesting 'this year, possibly within a few months', contrasting with tech leads' 'no timeframe, not anytime soon'.

An arms race in AI funding is occurring, with Chinese AI startups receiving significant investments to compete with Sora.

Sora's potential capabilities include sound, as mentioned by Mora, and is a focus of the development team.

The development team referred to Sora using a version number, hinting at possible future iterations.

Sora's training methodology resembles the GPT family, based on research that began with a NYU computer science professor.

The team is interested in how artists will use Sora, focusing on new and unique applications rather than pre-defined prompts.

The speaker encourages creators to start making content now instead of waiting for Sora.

AI-generated South Park episodes were created by The Simulation, a company also known as Fable Studios.

The Simulation is working on a virtual city project called Sim Francisco and a Western-themed project called Thistle Gulch.

Thistle Gulch features interactive AI characters with backstories, driven by a generative AI system.

Fable has released Saga, the AI system used in Thistle Gulch, as open source.

The speaker signed up for the beta waitlist of Thistle Gulch and encourages viewers to do the same.

Figure One recently showcased how their robot can be integrated with chat GPT, hinting at future advancements in AI.

Transcripts

play00:00

hey everyone so Sora has obviously been

play00:02

hyped pretty hard I'd say to some pretty

play00:04

unrealistic expectations the latest is

play00:07

an interview with open ai's CTO Mir Mora

play00:10

in which she discusses a potential time

play00:12

frame for sora's release I have some

play00:15

doubts about that we're going to talk

play00:16

about that plus I have some other

play00:18

interesting sore details that I have not

play00:20

seen covered anywhere else also remember

play00:22

those AI generated South Park episodes

play00:24

from a few months back well that

play00:26

technology is starting to roll out and

play00:27

I'm going to show you how you can get

play00:29

access to it all right let's dive in

play00:31

kicking off with Sora in a recent

play00:33

interview with the Wall Street Journal

play00:35

open ai's CTO Mir moradi fielded some

play00:38

questions about Sora and provided some

play00:40

pretty interesting details but things

play00:42

get really interesting when you contrast

play00:44

Mora's Wall Street Journal answers with

play00:47

an interview from the Sora tech leads

play00:49

conducted just 5 days beforehand we'll

play00:52

give it the smell test in just a minute

play00:53

but first let's break down morat's Wall

play00:55

Street Journal interview which does get

play00:57

a little spicy uh which makes sense

play00:59

considering that you know open AI is

play01:01

currently being sued by the New York

play01:03

Times over copyright infringement first

play01:05

and probably most exciting is the fact

play01:06

that we got to see some new Sora videos

play01:09

we got this sort of Pixar styled

play01:11

animated bull in a china shop um yeah it

play01:13

looks pretty good it was pointed out

play01:15

that the bull should probably be causing

play01:17

more destruction in said china shop we

play01:20

got the prompt of a female video

play01:21

producer on a sidewalk in New York City

play01:23

holding a high-end Cinema Camera

play01:25

suddenly a robot Yanks the camera out of

play01:27

her hand and this was the result in

play01:29

which the female reporter kind of does

play01:31

this weird dance move and then morphs

play01:33

into the robot and holds the world's

play01:35

weirdest looking cinema camera that I've

play01:37

ever seen I do want to shoot on that

play01:39

thing though it's such a weird camera

play01:40

it's got like two lenses on the front

play01:42

and one on the side we also had some

play01:44

pretty convincing footage of two women

play01:46

giving an interview there was some

play01:48

issues of course with uh the one woman's

play01:50

fingers but yeah again that's to be

play01:52

expected other than that it actually

play01:53

looks pretty good and we also got to see

play01:55

bootleg Ariel with a a mermaid reviews a

play01:58

smartphone prompt interestingly the

play02:00

reporter states that these videos were

play02:02

720p or standard definition and were

play02:04

about 20 seconds in length morate

play02:07

reported that it took a few minutes to

play02:09

generate which does seem fast

play02:11

considering most reports indicate that

play02:13

it takes anywhere between 30 minutes to

play02:15

an hour and a half to generate uh 20

play02:18

second videos in Sora I can only presume

play02:20

that the now famous 1081 minute examples

play02:23

on the Sora website probably took a lot

play02:25

longer than that now it's possible that

play02:27

open AI have optimized since those

play02:29

report s came in uh we're going to be

play02:31

hearing a little bit more about that

play02:33

from the tech team in just a little bit

play02:35

now the real kind of clipped and

play02:37

bookmarked moment from this interview is

play02:39

when Mora is asked about the training

play02:41

data used for Sora she states that Sora

play02:44

was trained on publicly available or

play02:46

license data but pressed on if any of

play02:49

that data could possibly include say

play02:51

YouTube videos or Facebook videos morate

play02:54

says she doesn't know and she can't be

play02:56

sure but um I'm I'm not sure I'm not

play02:59

confident about about it she eventually

play03:00

just completely stonewalls the question

play03:03

saying I'm I'm just not going to go into

play03:05

the details of of the data that was that

play03:08

was used she did off camera and after

play03:11

the interview and likely after

play03:12

Consulting with like the army of lawyers

play03:14

that are probably just standing right

play03:15

off screen that Sora was trained on data

play03:18

from Shutterstock which open AI does

play03:21

have a relationship with this one felt a

play03:23

bit weird to withhold on considering

play03:25

it's been well public knowledge since at

play03:27

least July 11th of 2023 in terms of

play03:30

sora's actual release Mora does state

play03:33

that you know given the amount of power

play03:35

in compute uh the amount of you know

play03:37

expenses in terms of that compute that

play03:40

they don't know what Sora will look like

play03:43

when it is eventually released

play03:45

personally and I am speculating here I

play03:47

take that to mean that we will not be

play03:49

generating 1080p one minute long videos

play03:53

with Sora but probably something more in

play03:55

the 4 to 10 second range very much akin

play03:58

to the video generators that we

play04:00

currently have access to but the hot

play04:03

ticket question on everyone's mind is

play04:05

when does sore release to which maradi

play04:07

replied this year possibly within a few

play04:10

months I have some doubts at least on

play04:13

the in a few months time frame mostly

play04:15

because just 5 days before this

play04:17

YouTube's own mares brownley had an

play04:20

interview with open AI Bill peoples Tim

play04:23

Brooks and aled ey raash the project

play04:25

leads of Sora on his podcast they were

play04:29

asked the same question when does Sor

play04:31

released and their answer was we have no

play04:33

time frame not anytime soon now look

play04:36

anyone that has worked anywhere knows

play04:38

that sometimes in a meeting the boss is

play04:40

just going to say we're doing a thing

play04:41

and all of you guys who actually do the

play04:43

thing are like we are what where did

play04:45

that come from we have all been there

play04:47

now is that what is actually happening

play04:48

here well I don't know but it does seem

play04:50

a little suspect that in the span of 5

play04:52

days we've gone from we have no time

play04:54

frame to this is happening in a few

play04:56

months especially when you factor in all

play04:58

of the hurdles both from a technical

play05:00

standpoint and frankly from a safety

play05:02

guard rail standpoint that need to be

play05:04

overcome before release that said there

play05:06

is definitely an arms race going on

play05:08

right now with funding pouring into

play05:10

Chinese AI startups to compete with Sora

play05:13

just this week both pixiverse and AI

play05:16

sphere have both received $14 million

play05:19

that's like $14 million each to catch up

play05:22

with open AI marate also stated in the

play05:24

Wall Street Journal interview that Sora

play05:26

will eventually have sound whereas the

play05:29

team actually stated it was something

play05:30

that they were thinking about that's not

play05:32

a gotcha or anything like that but there

play05:34

was an interesting little tidbit in Bill

play05:36

people's answer about sound wherein he

play05:39

said it's hard to give exact timelines

play05:41

with these kinds of things uh for sort

play05:44

one we were really focused on pushing

play05:47

the capabilities of video generation

play05:48

models is it telling that he referred to

play05:50

it as Sora 1 it's unsurprising that he

play05:53

refers to it by a version number you

play05:55

know considering we're sitting here

play05:56

waiting for Chad GPD 5 but I did find it

play05:59

fascinating that he referred to it in

play06:01

the past tense which does beg the

play06:04

question are they already working on

play06:05

Sora 2 and what does that look like in

play06:07

Marquees interview which is linked down

play06:09

below the team is also asked about what

play06:12

data Sor was trained on and similarly

play06:15

they kind of give you know a Stonewall

play06:16

answer there but they do state that Sora

play06:18

was trained quote like Del but it more

play06:21

resembles the GPT family that likely

play06:24

backs up the speculation and reverse

play06:27

engineering research papers that have

play06:29

been released on Sora that state that

play06:31

it's actually built on a diffusion

play06:34

Transformer research on that began with

play06:36

signing she a NYU computer science

play06:40

Professor back in July of 2022

play06:43

interestingly his Mente at that time

play06:46

Bill peoples ultimately through both

play06:47

interviews both Mora and the team state

play06:50

that they're most interested in seeing

play06:52

what artists eventually end up doing

play06:54

with a tool like Sora they don't seem to

play06:56

be very interested in like prompt a

play06:58

movie but rather seeing what happens

play07:00

when you and I start playing with it in

play07:03

new and unique ways that even they

play07:05

couldn't foresee as I've been saying

play07:07

over the last few videos if you want to

play07:09

make something don't wait for Sora just

play07:11

start making it now I mean for one I

play07:13

just don't think that Sora is going to

play07:15

do what you think it's going to do at

play07:17

least at launch and look given what

play07:19

we've seen of sora's output including my

play07:21

personal favorite the shark at the beach

play07:24

video I mean look at this thing it's

play07:25

amazing it is amazing it's still going

play07:27

to be weird morphe and and it's not

play07:30

going to be doing consistent characters

play07:31

or locations so honestly use the tools

play07:34

that we have now there is the totally

play07:35

free Hyper that I covered just a few

play07:37

videos ago the way I look at it is that

play07:39

like it's 1958 and you have a Super 8 mm

play07:43

film camera but you don't want to make

play07:45

your film until you have like a Sony fx3

play07:48

in your hands it just doesn't make sense

play07:50

moving on remember those AI generated

play07:52

South Park episodes that made the rounds

play07:53

a few months back well they were created

play07:55

by a company called the simulation uh

play07:58

which is also Fable Studios it gets a

play07:59

little confusing it's a whole rabbit

play08:01

hole but I have been following them with

play08:03

great interest since the South Park

play08:05

episodes were released as more or less a

play08:07

tech demo to you know get some interest

play08:09

in Fable Studios slthe simulation uh

play08:13

which I think it definitely did uh since

play08:15

then they've been working on a number of

play08:17

other projects including Sim Francisco

play08:19

which I covered in another video but

play08:21

it's basically a virtual City filled

play08:23

with virtual AI citizens who go about

play08:26

their daily lives you know going to work

play08:29

going to sleep falling in love and even

play08:31

dying and for quite some time they have

play08:33

been working on thistle Gulch which is

play08:35

more or less a straight ahead Western

play08:37

but you know given all this technology

play08:39

it definitely has a lot of strong

play08:41

Westworld Vibes and while this might not

play08:43

look terribly cinematic at this point

play08:45

it's what's happening underneath that is

play08:48

really interesting narratively there has

play08:50

been a murder that has taken place in

play08:51

the small western town of thisle Gulch

play08:53

and we are following a sheriff who is

play08:55

investigating our sheriff is an AI agent

play08:57

who not only has obviously his goal of

play09:00

solving the murder but an entire

play09:01

backstory as well and so does every

play09:04

other character in this town all driven

play09:06

by Saga or skill to action generative

play09:08

agent in thisle Gulch you can be

play09:11

presented with interactive choices for

play09:12

your characters to make but you know

play09:14

they can also just make choices for

play09:16

themselves it's a really fascinating

play09:18

idea blending emergent storytelling with

play09:20

interactive fiction and although you

play09:22

know the overall look of it might not

play09:25

necessarily look all that cinematic

play09:26

right now I actually think it has kind

play09:28

of a cool look to it it sort of looks

play09:29

like Borderlands B says their platform

play09:31

is designed for creators researchers and

play09:34

AI enthusiasts offering unprecedented

play09:36

control over the narrative and character

play09:38

interactions through a powerful python

play09:40

API this allows for deep customization

play09:43

of AI decision-making processes and

play09:45

conversation generation additionally

play09:47

Fable has released Saga as open source

play09:49

so if you want to play with that code

play09:51

you can go ahead and do so right now as

play09:53

for thistle Gulch they have opened up a

play09:55

beta weit list that link is down below

play09:57

I've signed up for it you should

play09:58

definitely sign up for it too and you

play10:00

know I'm not covering it here today but

play10:01

figure one just recently released a

play10:03

video showcasing how their robot can be

play10:06

hooked up into chat GPT so maybe we

play10:08

really aren't far off from a real life

play10:11

Westworld hopefully we have learned the

play10:12

lesson not to shoot the robots because

play10:14

you know eventually they will shoot back

play10:16

on that note I thank you for watching my

play10:18

name is

play10:24

Tim

Rate This

5.0 / 5 (0 votes)

Related Tags
AI技術Sora動画生成オープンAIニューヨークタイムズ版権侵害アニメーションシミュレーションFable StudiosインタラクティブフィクションオープンソースSora 1Sora 2シミュレーション技術AIエージェント
Do you need a summary in English?