一口气了解英伟达,芯片新王凭什么是他?

小Lin说
30 Jun 202327:17

Summary

TLDR本视频深入探讨了Nvidia的崛起及其在AI领域的主导地位。从创始人黄仁勋的早期生活开始,讲述了Nvidia是如何从一个专注于3D图形处理芯片的小公司,成长为全球AI训练行业的霸主。视频详细回顾了Nvidia在游戏、加密货币、元宇宙和人工智能等全球科技创新中的关键作用,特别是其在AI硬件和CUDA编程平台上的创新,使其在深度学习和机器学习领域占据了几乎垄断的地位。最后,视频还探讨了Nvidia的市场策略、股价表现和技术挑战,以及它如何应对新兴的竞争对手和市场变化。

Takeaways

  • 😀英伟达从1993年成立至今,已经成长为全球显卡行业的龙头老大
  • 👌英伟达成功把握住了人工智能和大模型训练这波红利,成为科技行业最大的赢家
  • 🤔英伟达CPU可以以4%的成本和1.2%的功耗完成大模型训练
  • 😮英伟达的核心业务从游戏转向了数据中心,后者现占56%的营收
  • 📈英伟达在Q2的营收高达110亿美元,远超华尔街预期
  • 🔥AI训练都需要大量英伟达GPU卡,已成为衡量公司计算力的标准配备
  • 🏭英伟达从硬件、软件、服务全面布局,构筑强大生态圈
  • ⚠️英伟达的市盈率高达200倍,股价处于高位
  • 🌊芯片行业的迭代速度太快,很难形成长期护城河
  • 😎英伟达需要在AI浪潮中再战十年,才能守住龙头地位

Q & A

  • 英伟达是如何从一个小公司成长为全球GPU巨头的?

    -英伟达从1993年成立开始,经历了多次技术和商业上的风险与失败。但凭借杰出的研发实力、CUDA编程平台的建立、抓住比特币挖矿热潮等关键机遇,以及AI计算需求的爆发,逐步成为GPU市场的主导者。

  • CUDA对英伟达的发展有什么重要意义?

    -CUDA使英伟达的GPU变成可编程的,从而拓展了其应用范围。这为英伟达搭建了完整的生态,使其硬件与软件高度融合,形成了强大的护城河。

  • 为什么AI运算必须依赖GPU?

    -GPU拥有大量并行计算核心,特别适合AI模型训练中需要的海量并行简单计算。其算力远超过CPU,已经成为AI训练的标准配置。

  • 芯片行业的特点是什么?

    -芯片行业迭代速度极快,门槛高昂,需要大量人才和投入。芯片性能每18-24个月就会翻番,很难依靠单一技术或产品形成持久护城河。研发实力才是核心竞争力。

  • 英伟达如何在芯片行业获得垄断地位?

    -通过建立CUDA生态,以及保持高达25%的研发投入占比,英伟达能够保持GPU性能迭代的速度,始终领先竞争对手。这种高强度的技术积累,才是其垄断地位的基石。

  • 英伟达股价飙升的原因是什么?

    -英伟达抓住了AI浪潮的机遇,成为AI训练标准配置,未来发展前景广阔。基金和机构为避免漏买,纷纷高价扫货,推高了估值。

  • 英伟达可能面临哪些竞争压力?

    -谷歌、Facebook等都在自主研发AI芯片,争夺这片蓝海。另外英伟达并购ARM失败,也暴露了其扩张遭到科技巨头的联合抵制。

  • 黄仁勋对英伟达的发展起到了什么关键作用?

    -作为创始人和CEO,黄仁勋对CUDA编程平台的远见,以及高比例研发投入的决策,使英伟达形成了强大的GPU生态并获得AI时代红利。

  • 英伟达的命运会重演3dfx的覆灭吗?

    -尽管芯片迭代速度极快,但英伟达成功搭建起GPU+软件生态,并持续巨额投入研发。这为其提供了比3dfx更可靠的护城河,英伟达有望继续扩大优势。

  • 英伟达的四大业务板块中,哪个份额最大?

    -2022年,英伟达的数据中心板块收入占比已达56%,超过游戏板块的33%,成为英伟达的最大贡献者。

Outlines

00:00

😀Nvidia公司的起源与发展

这一段详细讲述了Nvidia公司的创立历史,最初专注图形处理芯片的研发,后来逐步扩展业务范围,通过不断的技术创新获得行业领先地位。

05:02

😉图形处理器行业的竞争格局

这一段概述了图形处理器行业的竞争情况,Nvidia和AMD这两家公司占据着专用图形处理器的主要市场份额。Nvidia通过并购和技术创新不断扩大优势。

10:03

🤑比特币给Nvidia带来意外收益

这一段讲述了比特币挖矿热潮给Nvidia带来的巨大且意外的市场需求,这为Nvidia赚取了大量收入。

15:08

😎人工智能成为新机遇

这一段阐述了图形处理器在人工智能训练中的应用,Nvidia早年对通用图形处理的投入为其在AI浪潮中获得先发优势。GPU性能成为衡量AI计算能力的关键标准。

20:10

🤔Nvidia今后的发展前景

最后,作者讨论了Nvidia当前的估值是否过高,以及在AI芯片行业日益激烈的竞争中,Nvidia能否继续保持领先。

Mindmap

Keywords

💡NVIDIA

NVIDIA是一家专注于图形处理器(GPU)和人工智能(AI)技术的公司。在视频中,NVIDIA被描绘为在全球科技创新中扮演着重要角色,特别是在云计算、加密货币、元宇宙和人工智能等领域。它通过其高性能的图形卡支持了多数AI模型的训练,占据了全球AI训练行业95%的市场份额。

💡Jensen Huang

Jensen Huang是NVIDIA的创始人和CEO,他出生于台湾台南,后移民至美国,并在斯坦福大学完成硕士学位。视频中描述了他如何与其他两位科技人士共同创立了NVIDIA,以及他对于公司未来方向的前瞻性思考,尤其是将NVIDIA引入AI领域的决策。

💡图形处理器(GPU)

图形处理器(GPU)是一种专门设计来处理图形和图像处理任务的电脑芯片,NVIDIA被广泛认为是这一领域的领导者。视频强调了GPU在非顺序计算任务中的效率,特别是在视频游戏渲染和AI模型训练中的应用,展示了它如何优于传统的中央处理器(CPU)。

💡CUDA

CUDA是NVIDIA开发的一个并行计算平台和编程模型,使得开发者能够使用NVIDIA的GPU进行通用计算。视频中提到,通过CUDA,NVIDIA不仅扩展了图形卡的应用范围,还为航天、生物制药、天气预测和能源探索等领域提供了加速计算的能力。

💡AI训练

AI训练指的是使用大量数据训练人工智能模型,以提高其性能的过程。视频中指出,NVIDIA的GPU由于其并行计算能力,成为AI训练的首选硬件,以低成本和低能耗完成了大规模语言模型的训练任务。

💡GeForce

GeForce是NVIDIA推出的一系列图形卡产品,主要面向消费者市场。视频中提到,GeForce256被称为世界上第一款真正的GPU,这标志着NVIDIA在图形卡行业的领导地位。

💡3D加速卡

3D加速卡是早期图形卡的一种,主要用于提高计算机处理3D图形的能力。视频中提及,在NVIDIA成立之初,市场上存在大量类似公司,竞争激烈,但最终NVIDIA通过创新和战略合作脱颖而出。

💡并行计算

并行计算是指同时使用多个计算资源解决问题的过程。视频强调了GPU相比于CPU在并行计算方面的优势,特别是在处理大量图像和视频数据时的高效率。

💡市场份额

市场份额指的是一个公司在特定市场上销售额或销售量占总市场的比例。视频中提到,NVIDIA在专用图形卡市场占据了约80%的市场份额,凸显了其在行业中的主导地位。

💡人工智能(AI)

人工智能(AI)是使计算机系统能够执行通常需要人类智能才能完成的任务的技术。视频中解释了NVIDIA如何通过其GPU技术,成为AI研究和应用的关键参与者,尤其是在AI模型训练领域。

Highlights

英伟达是全球GPU霸主,占有全球80%市场份额

英伟达在AI浪潮中是最大的赢家

英伟达创始人黄仁勋被称为AI教父

英伟达发明了GPU,专门用于图形处理

英伟达通过CUDA使GPU可编程,开启通用计算

比特币挖矿爆发为英伟达带来巨大收入

AlexNet通过英伟达GPU训练模型赢得 Imagenet比赛

英伟达推出A100成为AI模型训练标准

英伟达收购Mellanox,布局DPU领域

英伟达市值已达到万亿美元级别

英伟达估值虽高但有巨大风险溢价

芯片行业迭代速度极快难以形成壁垒

英伟达通过巨大的研发投入保持技术领先

科技巨头都在围追堵截英伟达

AI时代英伟达能否再次打开局面存在不确定性

Transcripts

play00:01

On May 24,

play00:02

Nvidia released their first quarter earnings

play00:04

Many investors are calling this an unprecedented

play00:06

once in a lifetime release

play00:08

Nvidia relied on

play00:09

the dazzling data

play00:10

giving Wall Street a slap in the face

play00:12

The stock soared by 30% on the day

play00:13

Market cap reached trillion of dollars

play00:15

propelled Nvidia into becoming sixth-largest company in the world

play00:17

surpassing Tesla

play00:18

and approaching Amazon

play00:19

Who would have thought that

play00:20

a company selling graphics cards

play00:22

would become the biggest winner

play00:23

in the AI ​​war in 2023

play00:26

For the past few years

play00:27

Nvidia participated in almost all of global tech innovation

play00:29

cloud computing

play00:29

cryptocurrency

play00:30

Metaverse

play00:30

Artificial Intelligence

play00:31

Nvidia is main player in all these

play00:32

Majority of the AI model you’ve heard of

play00:34

are trained with Nvidia graphics cards.

play00:36

Not only are they an industry leader

play00:37

but they monopolised global AI training industry

play00:39

by occupying 95% market share

play00:41

Even the quantity of owning

play00:42

Nvidia A100 graphics cards

play00:44

an indicator to measure a company’s computational power

play00:47

The founder, Jensen Huang said

play00:51

Sounds arrogant right

play00:52

But that’s the truth

play00:57

With his foresight from over 20 years

play00:59

and his unchanging outfit style

play01:01

becomes the Godfather of AI

play01:03

You must be wondering

play01:05

what’s so great about Nvidia

play01:06

How did they monopolise?

play01:07

Why nobody can compete against them

play01:08

Today let Lin take you down a trip

play01:10

to the story of Nvidia

play01:11

We’ll also talk about the secret behind

play01:13

graphic card and chip industry

play01:17

Let’s begin with the rise of

play01:19

Jensen Huang and Nvidia

play01:21

In 1963

play01:22

Jensen Huang was born in Tainan, Taiwan

play01:24

which means he is 60 years old this year

play01:26

At the age of 9, he moved to US

play01:28

after graduating from college

play01:29

he worked in two semiconductor companies

play01:30

focusing on chip design

play01:31

one of them is AMD

play01:33

a company that fought with Jensen

play01:34

for half of their lifetime

play01:35

until now

play01:38

After finishing his Master at Stanford, Jensen turned 30

play01:40

With two other tech guys who were

play01:42

also in their 30s with big ambition

play01:44

they planned to do something big

play01:46

They believed that 3D graphics processing

play01:48

has great potential in the future

play01:49

So in 1993

play01:50

they established NVIDIA

play01:52

specialises in graphic processing chip

play01:54

Jensen Huang is the CEO.

play01:56

he still is today

play01:57

With recommendation from his former boss

play01:58

he obtained $20 million capital investment

play02:00

from Sequoia Capital

play02:02

After all gaming industry needs real-time rendering when playing

play02:04

You can’t say there’s no 3D games

play02:06

but definitely can’t play with normal computer

play02:07

Many games are quite classic

play02:09

but the graphic

play02:09

is basically, well as long as you can see a figure there

play02:11

because this kind of 3D image processing

play02:13

is computationally intensive

play02:14

It was difficult for the CPU at that time

play02:16

Normally they need a specific chip

play02:18

to process the graphics

play02:19

and this chip is graphic card

play02:21

In the early days, the graphics card was very simple

play02:23

it was a 3D accelerator card at most

play02:26

Now when you hear 3D accelerator

play02:28

It sounds like a small workshop business

play02:29

That's right

play02:30

actually at that time 3D games and

play02:31

3D rendering

play02:32

was at seeding stage

play02:33

Graphics card companies like Nvidia

play02:35

are actually a lot

play02:35

at least 50-60 companies

play02:37

There was no uniform standard for both hardware and software

play02:38

Whoever comes out with better research

play02:39

can publish their own standard.

play02:41

Often times

play02:41

when you finally came up with a graphics card

play02:43

but it actually is

play02:44

not compatible with other people’s standard

play02:46

Just one word

play02:47

Chaotic

play02:48

One of the most famous company at the time

play02:50

is a company called 3dfx

play02:51

It was established in 1994,

play02:53

a year later than Nvidia

play02:54

At that time, a graphics card called Voodoo

play02:56

was all the rage

play02:57

Many popular games during that time

play02:58

relied on Voodoo graphic card

play03:00

It wasn’t going so well with NVIDIA

play03:02

Although they obtained capitals

play03:03

and had very professional team

play03:05

but their NV1 was not successful

play03:07

NV2 was aborted

play03:10

By 1997

play03:11

NVIDIA was hanging on by a thread

play03:12

9 more months till they run out of money

play03:14

The company downsized its staff from 100

play03:15

till there were about 30 people left

play03:17

Jensen Huang took a gamble

play03:19

Just when the company only had

play03:20

6 months of operating capitals

play03:21

They released Riva128 graphic card carrying NV3

play03:24

With its good price/performance ratio,

play03:25

they finally occupy a place in the market

play03:28

and allow NVIDIA to survive

play03:30

Actually Jensen and his team

play03:32

are very strong in their R&D

play03:33

After they figure out

play03:34

the market direction

play03:36

they managed to enter fast lane quickly

play03:38

They reached a long-term strategic cooperation with TSMC

play03:40

At the same time they cooperate closely with Microsoft

play03:41

supported the Direct 3D display standard introduced by Microsoft

play03:44

Finally they rose in the sea

play03:46

of competition in graphics card industry

play03:46

With RivaTNT

play03:48

it helped

play03:49

NVIDIA to become

play03:50

industry leader in graphics card industry

play03:52

In 1999, NVIDIA

play03:53

successfully listed on NASDAQ

play03:56

After listing

play03:57

NVIDIA had more money and

play03:58

in September 1999, they released

play04:00

the epoch-making GeForce256

play04:02

which made them the leading force among their competitors

play04:05

I believe that gamers

play04:06

should be familiar with

play04:07

this GeForce series.

play04:08

It has also become Nvidia’s

play04:10

the flagship line of consumer graphics card

play04:11

Jensen Huang named GeForce256

play04:13

the world’s first GPU

play04:15

The first truly dedicated graphics card

play04:17

The claim is

play04:18

basically accepted by everyone

play04:19

Hence some people might generally claimed that

play04:21

Nvidia invented the graphics card.

play04:23

As for why this dedicated graphics card is so powerful,

play04:25

we will talk about it later.

play04:27

At that time, Microsoft happened to be working on Xbox

play04:29

With GeForce 256

play04:30

powerful performance, NVIDIA managed to

play04:32

score $200 million worth of order

play04:34

After building image processing hardware for Xbox

play04:36

they then score another with Sony’s PS3

play04:38

From 1999 to 2002,

play04:40

Nvidia’s revenue

play04:40

almost doubled every year to

play04:42

$2 billion

play04:43

becoming the only player in the market

play04:45

They started to acquire

play04:46

competitors in the same industry

play04:47

one of them we mentioned earlier

play04:49

the once popular 3dfx

play04:51

Another major player in the market

play04:53

ATi was acquired by AMD

play04:56

And so in the early 2000s

play04:57

after a series of merger and acquisition

play04:59

happening in the market

play05:01

There are only two players left in the market

play05:03

Nvidia and AMD.

play05:04

Until now, the dedicated graphics card market

play05:06

has been dominated by these two companies.

play05:08

I don’t know if you have heard of

play05:09

the legendary N card and A card

play05:11

actually refer to the graphics cards of these two companies.

play05:13

Gamers started to argue

play05:15

whether N card or A card is better

play05:16

It doesn’t matter which one is better

play05:17

after that there isn’t a

play05:18

third company exists in the market

play05:19

like B card of C card or X card

play05:21

It’s a two-horse race

play05:22

However NVIDIA is gradually

play05:25

eating away AMD’s market share

play05:27

from 60% in 2010

play05:28

slowly expanded to

play05:29

80% in 2022.

play05:30

becoming global GPU hegemon

play05:33

The speed of development of GPU technology itself

play05:35

is also jaw-dropping

play05:36

The rapid development of gaming industry

play05:38

has supported Nvidia,

play05:39

At the same time, NVIDIA’s graphics card development

play05:41

has promoted the development of the gaming industry.

play05:43

Look at the new games coming out every year

play05:44

so much improvement in image quality

play05:46

Even if you don’t understand game

play05:47

you can see the speed of progress

play05:52

In fact, the overall graphics card market

play05:55

is in a tripartite state

play05:58

Three biggest player – INTEL, NVIDIA, AMD

play06:00

Intel occupied 71% of market share

play06:02

Nvidia is 17% and AMD is 12%.

play06:05

You must be wondering

play06:06

why is there Intel

play06:07

and their market share is way higher

play06:08

Didn’t you just say that Nvidia is the biggest player?

play06:12

Actually

play06:12

this graphics card is not the same graphics card

play06:16

Because graphics card

play06:17

is divided into dedicated and integrated graphics card

play06:19

If you were to compare both

play06:20

then Intel indeed is the biggest player

play06:22

but they sell mostly integrated graphics card

play06:24

Integrated graphics card is placed together with CPU

play06:26

they share memory

play06:27

So Intel taking advantage of their position in CPU industry

play06:29

monopolise the integrated graphics card market share

play06:31

However the integrated graphics card

play06:33

is quite weak, I won’t explain in details here

play06:35

Comparing with NVIDIA’s dedicated graphics card

play06:36

although they are both graphics card but they don’t belong in same market

play06:39

Just from dedicated graphics market POV

play06:40

NVIDIA occupied 80% of the market share

play06:41

Some of you might start to get bored now

play06:43

Alright we know

play06:44

NVIDIA designs graphics card and chips

play06:46

they are very good at it

play06:47

I’ve been talking a lot about

play06:48

3D rendering and gaming

play06:49

How does it have anything to do with AI

play06:51

Why are all these AI companies want to buy graphics card

play06:52

and it has to be NVIDIA’s graphics card

play06:55

Don’t worry, we’ll talk about graphics card characteristic

play06:59

in a computer, Central Processing Unit

play07:01

The purpose of its design is that

play07:02

it can do everything

play07:03

It’s sequential computing

play07:04

and it can carry out very complex logical reasoning

play07:06

However image processing

play07:08

doesn’t care much about sequential computing

play07:09

It’s more concerned with computational volume

play07:10

For example, a 4k video

play07:12

has 10 million pixels

play07:13

Let’s say there’s 30 frames per second

play07:14

then each pixel and frame

play07:16

has to compute correspond colour

play07:17

based on

play07:18

shadow and action

play07:20

This requires non-stop, very fast

play07:22

and massive simple calculations

play07:23

is Graphics Processing Unit

play07:24

It’s especially designed to

play07:26

do this kind of computation

play07:27

The foundation of the chip design

play07:29

is to optimise parallel computing

play07:31

So for CPU

play07:32

is 64 or 128 core at best

play07:34

while GPU could have thousands of core

play07:36

computing together at the same time

play07:37

See this video is giving

play07:38

a very good explanation

play07:39

CPU is like a very precise

play07:41

very strong gun firing one shot at a time

play07:43

The shots are fired in clear order

play07:45

but slow,

play07:46

GPU on the other hand

play07:47

is like having thousands of this gun

play07:49

firing at the same time

play07:55

Due to GPU special feature

play07:56

Jensen Huang started to think about

play07:58

how to maximise

play07:59

its potential

play08:00

It’s definitely as simple as 3D image processing

play08:02

and rendering

play08:03

Can they carry out

play08:04

more of General Purpose Computing

play08:06

General Purpose Computing

play08:07

General Purpose Graphics Processing

play08:09

play08:14

But it’s not simple to use GPU

play08:15

to do this kind of

play08:16

general purpose computing

play08:17

Because the purpose of its design is not for this

play08:19

so the programming is very difficult

play08:20

Not anyone can do this job

play08:22

Jensen Huang was thinking that

play08:22

if graphics card was to realise its greater potential

play08:24

it needs to be programmable

play08:26

By chance

play08:27

he saw a project by a

play08:28

PhD student in Stanford

play08:29

Using C language programming

play08:30

to let GPU do some computing

play08:31

Jensen thought this idea is amazing

play08:34

and offered this fella a job at NVIDIA

play08:36

He appointed him with a very important job

play08:38

and let him lead the team to carry out R&D

play08:40

in making GPU programmable

play08:44

Finally in 2006

play08:46

NVIDIA officially released CUDA

play08:47

making GPU programmable

play08:49

In order to build this CUDA system,

play08:50

NVIDIA invested a large and

play08:53

unreasonable amount of

play08:53

capitals and human resources into it

play08:54

Originally those graphics card that

play08:56

to support CUDA

play08:57

Originally those graphics card that

play08:59

specializes in 3D graphics processing

play09:00

need many top engineers to

play09:02

make it programmable

play09:04

But now anyone can do it by

play09:05

buying an NVIDIA graphics card

play09:07

and use it with

play09:08

CUDA library

play09:09

Through CUDA

play09:09

NVIDIA expanded the boundary of graphics card

play09:11

from gaming and 3D image processing

play09:13

to a whole realm of accelerated computing

play09:15

Like aerospace, biopharmaceuticals

play09:17

Weather forecasting, energy exploration and so on

play09:19

are actually using large amount of NVIDIA graphics card

play09:21

to carry out computation

play09:22

Many have tried to create

play09:23

software like CUDA

play09:25

to challenge Nvidia’s position.

play09:26

But Nvidia

play09:27

has a monopoly on hardware itself.

play09:28

They can try everything

play09:30

to merge their hardware, graphics card

play09:31

with the software, CUDA

play09:33

and make them work really well with each other

play09:33

Through hardware and software merger

play09:35

they formed a very strong moat

play09:37

Does this make you think of another company?

play09:39

That’s Apple

play09:39

They all build something

play09:41

that’s had been bandied about in business world

play09:42

an Ecosystem

play09:42

Microsoft, Adobe

play09:44

they all have their own strong ecosystem

play09:46

The amount of capital Jensen invested in CUDA

play09:48

might sound reasonable to you now

play09:50

It makes sense

play09:50

but if you look at short-term return

play09:52

It is very unreasonable.

play09:53

Wall Street is quite displeased with this thing

play09:56

because although the computation performance of GPU

play09:58

is outstanding

play09:58

but their application is not much

play10:00

For a long period of time

play10:01

it can only focus on area

play10:03

that requires massive computing

play10:04

To put it bluntly, it's not profitable at all

play10:05

Who would’ve thought that AI

play10:07

could be so popular in 2023 right

play10:09

In fact, it is not AI that first makes the

play10:11

accelerated computing capabilities of graphics cards

play10:12

realise its commercial value.

play10:14

It was a coincidence

play10:15

A trend that’s not related to AI at all

play10:16

Something that even someone like Jensen

play10:18

could not have expected

play10:22

The explosion of Bitcoin

play10:23

brought a huge demand for mining

play10:24

Mining in essence is mindless computing

play10:27

to encrypt and decrypt

play10:28

In order to do mining

play10:29

in faster way then you will need to use graphics card

play10:30

and you have to use NVIDIA’s graphics card

play10:32

I believe most of you

play10:33

only know about using graphics card to do

play10:35

computation from the mining part

play10:38

The huge demand for mining

play10:40

is like a godsend for NVIDIA

play10:42

a big big gift

play10:44

This makes NVIDIA’s graphics card

play10:45

in constant short supply for years

play10:46

NVIDIA was very nice

play10:48

they designed a GPU specifically for mining

play10:49

Of course many would say

play10:51

Mining pollutes the environment,

play10:52

meaningless computation

play10:53

all sorts of problems

play10:54

Graphics card capability

play10:55

was seen by many

play10:57

and NVIDIA did make lots of money because of it

play11:01

According to analysts

play11:02

Between 2018 to 2021

play11:03

a period when bitcoin was very popular

play11:05

NVIDIA could earn up to

play11:06

$1 to $3 billion annually

play11:08

NVIDIA’s market cap even

play11:10

surpassed the giant

play11:12

Intel

play11:12

During the time when bitcoin was very popular

play11:13

their market cap even approached trillion dollars

play11:21

Although mining has made NVIDIA a lot of money

play11:23

but it is after all not its main business

play11:25

After the crash of crypto market

play11:26

NVIDIA’s stock plunged by 46%

play11:29

So far mining is only

play11:30

a sideshow at best

play11:31

We all know

play11:32

what’s really been helping NVIDIA lately

play11:34

Artificial Intelligence

play11:38

We mentioned that graphics card

play11:40

massive parallel computing capability

play11:41

is very suitable for deep learning and machine learning

play11:45

AI

play11:47

has to keep learning

play11:48

up to billions of times

play11:49

So GPU parallel computing capability

play11:50

is where it fits in

play11:51

Jensen Huang showed that

play11:52

to train a large language model

play11:54

in comparison with CPU

play11:54

GPU server

play11:55

can complete it at 4% of the cost

play11:57

and 1.2% of power.

play11:58

Therefore, GPU CPU

play11:59

are not on the same level

play12:00

This is determined by its underlying structure.

play12:02

Jensen Huang

play12:02

just wanted to subtly tell you that

play12:04

to train large language model

play12:06

only a fool would use CPU

play12:07

You have to use GPU

play12:08

If you use GPU, you’d better use CUDA.

play12:10

Then you have to buy the graphics card from NVIDIA

play12:13

Actually a decade ago

play12:13

no one knows graphics card can be used in AI

play12:15

AI itself

play12:16

is more theoretical than practical

play12:18

The change occurred in 2012.

play12:19

At that time, there’s a

play12:20

very famous computer competition

play12:22

called ImageNet.

play12:25

Everyone was competing

play12:25

whose algorithm could better

play12:27

recognise the content of the image

play12:28

From No.2 to No.4

play12:30

their error rate

play12:31

is about 26% to 29%.

play12:33

A team called AlexNet

play12:34

made it to 16.4%

play12:36

Ten points ahead of the second place

play12:38

and won the comepetition

play12:39

they used neural networks to train their models

play12:41

with NVIDIA’s graphics card

play12:45

We’ve talked about this in ChatGPT episode

play12:47

the theory of neural network has been around for a long time

play12:49

but it had not been realised

play12:51

the problem was with computational power

play12:52

Thanks to NVIDIA’s graphics card

play12:54

the theory of neural network has been able to realise

play12:56

It made a sensation in the academic world

play12:57

Jensen is also very serious about

play12:59

GPU’s application in AI

play13:01

They went all in after 2012

play13:03

and let NVIDIA’s graphics card

play13:04

do accelerated calculations easily and conveniently

play13:06

Apart from investment focused on CUDA

play13:07

there are also optimisation on AI for graphics card

play13:09

including software support

play13:10

platform support and etc

play13:11

Later there is a consensus in the field of AI

play13:14

If you want to do AI

play13:15

then no doubt

play13:16

you have to buy NVIDIA’s graphics card

play13:21

Google, Amazon, Microsoft, Baidu

play13:24

all used NVIDIA’s graphics card

play13:24

to train the models

play13:26

The most famous one in this wave is

play13:28

NVIDIA’s A100

play13:30

ChatGPT was trained with over 10,000 of

play13:31

graphics cards.

play13:32

A100 has also

play13:33

become the standard for training large model

play13:35

For major AI companies

play13:35

if they couldn’t come out with significant result

play13:37

but still want to catch the wave

play13:38

What do they do

play13:39

They fight in owning graphics card

play13:40

They’ll tell you

play13:41

how many NVIDIA’s

play13:42

A100 graphics card they bought

play13:43

ranging from thousands to tens of thousands

play13:44

So this caused NVIDIA’s graphics card

play13:45

to be in short supply for a long time

play13:47

Their price went up to ten of thousands of dollars

play13:49

Last year NVIDIA released

play13:50

upgraded version of A100, H100

play13:51

It has four to six times the performance of the A100

play13:56

So don’t think that

play13:57

NVIDIA is just in luck

play13:59

just because they soar the highest

play14:00

They spent a decade

play14:02

and a lot of money and effort

play14:04

to build their own wings

play14:05

Just waiting for the right wind

play14:09

Now that the wings are completed

play14:11

the wind has come

play14:12

NVIDIA definitely will want to

play14:14

fly high

play14:17

This is A100 system board released by

play14:19

Jensen Huang 3 years ago

play14:20

a big guy over 20kg

play14:21

became the world’s largest GPU

play14:24

Looks intimidating

play14:26

This year he released another thing

play14:39

DGX GH200 supercomputer.

play14:43

The size is in 1:1 ratio with the actual thing

play14:45

play14:47

They use 240km of optic cable

play14:49

is kilometre

play14:49

that’s 240,000 metres

play14:51

Its weigh is equal to 4 adult elephants

play14:53

with internal memory of 144TB

play14:56

This GPU is connected using

play14:59

technology like NVLink, NVSwitch

play15:01

And these 4 elephants

play15:02

is actually a mega graphics card

play15:08

it is specifically used for AI computation

play15:09

It's expected to be ready by the end of the year

play15:11

Google Cloud, Meta, Microsoft

play15:13

are the first to access it

play15:14

With such big first-mover advantage

play15:15

NVIDIA is not limited to making

play15:17

only graphics card and chip design

play15:18

In 2019, they spent $6.9 billion

play15:20

to acquire an Israeli chip company

play15:22

called Mellanox

play15:22

and came up with something called DPU

play15:24

This thing is very powerful too

play15:25

Jensen Huang said that

play15:26

this acquisition is the most

play15:28

successful strategic decision he has ever made

play15:29

NVIDIA has started to merge

play15:31

GPU, CPU and DPU together

play15:33

and created

play15:33

a server with incredible computing power

play15:35

Supercomputer

play15:36

is slowly creeping towards CPU market

play15:38

and launched a variety of products

play15:39

The eighth supercomputer in the world

play15:41

DGX BasePOD

play15:42

Accelerated superchip

play15:43

GPU Grace Hopper

play15:44

computing platform

play15:44

BlueField-3 DPU and so on

play15:46

Good names right

play15:47

but you probably don’t understand

play15:50

In short, Nvidia

play15:52

covers everything

play15:53

from chips to supercomputers

play15:55

they’ve got it all covered

play15:55

What if you can’t afford it

play15:57

or don’t want to buy so many hardware?

play15:58

It’s okay

play15:59

NVIDIA can lease it to you

play16:01

just get connected online

play16:01

and use NVIDIA’s server

play16:03

This is AI cloud business

play16:08

It provides services to end user

play16:10

as well as upstream companies

play16:11

They came up with a software called cuLitho

play16:13

to help TSMC, ASML

play16:14

these upstream chip makers

play16:16

Improve the performance of Inverse lithography technology by up to 40 times

play16:18

By the way,

play16:19

I have a Spanish friend who told me

play16:21

that the name cuLitho

play16:22

in Spanish sounds like

play16:23

a booty

play16:28

One word

play16:29

Incredible

play16:31

See, NVIDIA expanding across hardware,

play16:34

software, services and etc

play16:36

becoming the biggest winner in this AI wave

play16:41

US sanctions towards China

play16:43

bans the sale of A100 and H100

play16:44

to China.

play16:45

The impact on Nvidia is actually quite big.

play16:46

China occupies a quarter of

play16:47

NVIDIA market

play16:49

They don’t want the cake

play16:50

fly away just because of US government

play16:52

When a reporter interviewed

play16:53

Jensen Huang asking him

play16:54

play16:55

how big is the impact of this for NVIDIA

play16:56

His answer is impermeable

play16:59

First we will definitely work closely

play17:00

with US government’s policy

play17:01

Express their position first

play17:02

At the same time we are trying

play17:04

within the rules

play17:05

to satisfy the demand

play17:07

of Chinese consumers

play17:08

play17:10

He offends neither side

play17:10

NVIDIA released

play17:12

A800 graphics card

play17:13

bypass a bunch of sensitive technology that has been sanctioned

play17:15

This is made especially just for China

play17:16

Alright

play17:17

we understand the entire background of Nvidia,

play17:19

it’s very clear now when you look at their earnings report

play17:21

Currently they divided their business into 4 segments

play17:23

Gaming, data centre

play17:24

automotive, professional graphics processing

play17:26

Previously NVIDIA’s ace business was gaming

play17:28

and data center is about accelerated computing

play17:31

lAI, Cloud service and etc

play17:32

are all in this segment

play17:33

These two segments are the main ones

play17:34

In 2018,

play17:35

Gaming occupied half of their business

play17:37

Data center is a quarter

play17:38

by 2022,

play17:39

Data center occupies 56% of their business

play17:41

Gaming dropped to 33%

play17:43

The earning report released on 24th May

play17:45

their revenue dropped significantly due

play17:47

to sluggish global demand in gaming

play17:48

This is actually what Wall Street expected.

play17:50

However data center segment

play17:51

is strong

play17:52

18% growth from previous quarter

play17:54

Most importantly the revenue in second quarter

play17:55

blinded Wall Street

play17:58

Wall Street predicted that

play17:59

their revenue for Q2 would be $7.2 billion

play18:01

but NVIDIA came out and said

play18:02

Your estimation is incorrect

play18:03

Our revenue is $11 billion, 50% more

play18:06

For automotive segment

play18:08

it is a segment with great potential

play18:09

Not only have to do chip for cars

play18:11

also need to create car system

play18:11

Another big cake

play18:13

But this is at initial stage

play18:15

It’s difficult to say what’s going to happen in future

play18:16

So we’ll not discuss it

play18:17

Before this NVIDIA released Omniverse

play18:19

to bet on Metaverse

play18:20

Although we haven’t seen much return yet

play18:22

but I’ve watched the promotional video

play18:23

It really is quite cool

play18:24

If Metaverse ushered in explosive period

play18:26

Then NVIDIA will again be on of the biggest winner

play18:28

It’s definite

play18:31

Anyway, the general picture is that

play18:33

gaming needs graphics card

play18:34

mining needs graphics card

play18:35

AI computing needs graphics card

play18:37

Need graphics card look for NVIDIA

play18:40

NVIDIA stock price

play18:40

rose by over 1000 fold

play18:42

from its initial IPO

play18:43

becoming the sixth largest company in the world

play18:45

And it looks like it's just getting started

play18:56

From the stories earlier

play18:58

you would’ve believe Jensen is almost godlike

play18:59

Every step is so precise.

play19:01

but he actually made a lot of mistakes along the way

play19:02

For the continuity of the story

play19:04

I ignored it

play19:04

In the early 2000s

play19:06

There was a series of graphics card failures

play19:07

almost defeated by ATI

play19:08

there were also insider trading committed by employees

play19:09

SEC did a thorough investigation on them

play19:11

There was also problem with over-marketing

play19:12

They even tried to enter mobile phone chip market

play19:14

but it was all a failure

play19:15

Looking at their stock price

play19:17

Although it has risen so much as a whole

play19:18

but in 2002 it dropped by 90%

play19:20

2008 dropped 80%

play19:21

with more than 50% of retracement

play19:23

This occurs every 3 to 5 years

play19:24

Everyone knows

play19:26

Jensen is great

play19:27

but he is no fortune-teller

play19:28

His obsession with leather jacket

play19:30

I don’t know what else to say

play19:33

Alright

play19:34

Let’s talk a little bit about NVIDIA’s stock price

play19:36

I’m worried that many would rush

play19:37

to buy

play19:38

NVIDIA’s stock after this video

play19:39

I wouldn’t say you cannot buy

play19:40

but just don’t go buy it

play19:42

on a whim

play19:43

The estimated value for NVIDIA

play19:44

is too high from every perspectives

play19:46

Their Price to Earning ratio is over 200

play19:47

Price to Sales ratio is 38

play19:48

For Apple, Microsoft and Google

play19:50

their Price to Earning is less than 40

play19:51

Tesla is only over 70

play19:52

Compare them with those in same industry

play19:53

Their revenue

play19:55

is less than half of Intel’s

play19:56

but market cap is 7 times more than Intel

play19:59

revenue is similar to AMD

play20:00

but market cap is 5 times more than AMD

play20:02

One word

play20:03

Expensive

play20:04

Of course they are expensive for a reason

play20:05

For a company in an industry like this

play20:06

moreover it’s a monopolistic company

play20:07

Its valuation is no longer one of the

play20:10

main criteria for judging stock price

play20:10

One of the main reason

play20:12

do you know what it is?

play20:12

There are some professional institutions

play20:14

dare not to not invest in NVIDIA

play20:16

because AI is the biggest wave

play20:17

with biggest opportunity in the market

play20:18

And NVIDIA is the biggest player

play20:20

in this AI wave

play20:21

When these funds invested in NVIDIA

play20:22

even if the stock price fall

play20:23

investors wouldn’t complain much

play20:25

but if NVIDIA stock price keeps rising

play20:27

and they didn’t invest because the price is expensive

play20:29

then investors would really be pissed

play20:32

Cathie Wood

play20:34

liquidated NVIDIA stocks

play20:36

in January through her ETF

play20:37

Lately NVIDIA stock price is rising

play20:38

she was reviled in the

play20:40

investment circle

play20:41

Choosing to not invest in this type of company

play20:42

would risk fund their reputation

play20:45

There’s a term in English

play20:45

that best describe this behaviour

play20:47

It is FoMO

play20:48

Fear of Missing Out

play20:50

They fear losing the potential rise

play20:52

more than buying at expensive price

play20:54

This is similar to Tesla two years ago

play20:56

EV has great future

play20:57

and Tesla was

play20:59

the only option

play21:00

even if its valuation

play21:01

is astronomical compare to

play21:02

other automotive companies

play21:03

but funds still continued to buy

play21:05

The behaviour of FoMo

play21:07

in turn pushes the stock price

play21:08

of these companies to an even higher position

play21:09

You can say that

play21:10

for these type of companies

play21:11

they just have huge risk premium

play21:13

but just because of this

play21:14

you can’t say that their stock price is too high

play21:16

That's not necessarily true

play21:16

Because it they can sustain this momentum

play21:18

and maintain this development trend

play21:19

Then their stock price will continue to rise

play21:22

There's a metaphor that many people often use

play21:24

The wave we mentioned

play21:25

is similar to gold rush

play21:27

It’s hard to bet where the gold is

play21:28

or who will find the gold

play21:30

but there is a sure-fire deal

play21:31

and that is you can sell shovels

play21:32

Nvidia

play21:33

is like selling a shovel at digital age

play21:35

at AI age

play21:36

This metaphor sounds right

play21:37

and quite reasonable

play21:39

But whenever I hear this analogy

play21:40

it doesn’t seems right

play21:42

Those who understand economy would know that

play21:43

if you sell shovel during gold rush

play21:46

could you be making a fortune?

play21:47

The first one would perhaps earn a little

play21:49

but if more people are selling it

play21:50

and it’s easy to make shovel

play21:51

then the marginal profit would quickly be eaten up

play21:53

so in most industries

play21:54

if you sell this so-called shovel

play21:55

the upstream production tool

play21:57

Due to its low entry barrier

play21:58

the competition in the industry will be very fierce

play22:00

so the profit margin will be low

play22:02

However for NVIDIA

play22:03

they can achieve a monopoly and a market value

play22:05

of trillions by doing this so-called shovel

play22:06

Its similar to TSMC

play22:08

more basic than shovel, they make hammer

play22:10

a tool to make shovel

play22:11

And they can monopolise that

play22:12

Have you ever thought why?

play22:15

We’ll get deep into the

play22:17

feature of chip industry

play22:18

Have you heard of Moore’s Law?

play22:21

On an integrated circuit

play22:23

the number of transistors that can be accommodated

play22:24

double up every 18 months

play22:26

You can understand it as

play22:27

The speed of chip can be faster every 18 months

play22:29

Actually for the past few years

play22:30

The speed of CPU evolution

play22:31

is difficult to catch up with Moore’s law

play22:33

However Jensen found that

play22:34

the advancement of GPU dedicated graphics card

play22:37

is faster than prediction of Moore’s Law

play22:39

Performance increase more than triples every 2 years

play22:41

This pattern even has

play22:42

its own term

play22:43

Huang's Law

play22:48

No matter which law it is

play22:49

This is a distinctive feature

play22:49

in chip industry

play22:51

The initial cost to invest is very high

play22:53

and requires a lot of talents and equipment

play22:54

The problem is that

play22:55

the iteration rate is too fast

play22:57

This is an industry that is constantly running

play23:01

the iteration rate of this industry is too fast

play23:03

From business perspective

play23:05

this pose a tricky problem

play23:06

it’s difficult for company to build their own moat

play23:08

Moat is very important

play23:10

to a company

play23:11

For a traditional company

play23:11

if they spent a lot of money to build a factory, a railroad

play23:13

then they have large-scale advantage

play23:15

and cost advantage

play23:15

These advantages work as a moat

play23:17

This moat will protect the company for a long time

play23:19

The moat of internet company is stronger

play23:20

Once a network effect is established

play23:22

Wechat, Tiktok, Facebook

play23:23

users are your strong moat.

play23:27

For chip industry

play23:28

there are Moore’s Law, Huang’s Law

play23:30

No matter how good of a chip you release today

play23:32

it’ll be outdated in two years

play23:34

You won’t know who’ll make it next year

play23:35

and take you down with his new technology

play23:37

In the 90s

play23:38

during the graphics card war

play23:40

A company established less than two years

play23:41

3DFX

play23:42

quickly became the industry standard

play23:44

with their Voodoo graphics card

play23:45

5 years later they became dreary

play23:46

and was acquired by NVIDIA

play23:47

This is because their moat

play23:49

hasn’t finished building and was destroyed

play23:50

In this industry, it’s difficult to rely on a single product

play23:52

or a single technology

play23:54

to form a moat that lasts more than 2 years

play23:55

Of course you can slowly accumulate many patents

play23:57

to defend yourself

play23:58

but the truth is

play24:00

people will always find a way

play24:01

to get around those patents

play24:02

it doesn’t provide much protection

play24:04

You have to keep running

play24:06

and run faster than everyone

play24:08

This running ability is your moat

play24:10

In chip industry

play24:11

the moat is R&D

play24:12

You have to develop a complete set of

play24:14

talents, facilities, organisation structure

play24:16

These form the moat for chip company

play24:20

This is why R&D is expensive

play24:21

First is the initial cost is very high

play24:23

requires a lot of talents

play24:24

most importantly you have to continuously

play24:26

keep the iteration and running speed high

play24:27

Not a lot of people can stand it

play24:29

Those tech giants are

play24:30

actually very focused on technological innovation

play24:32

Amazon, Google, Microsoft

play24:34

their R&D investment is 10% to 15%

play24:35

of their revenue

play24:36

For NVIDIA

play24:37

they have to keep invest around

play24:38

25% of their revenue for R&D

play24:41

The Huang’s Law

play24:42

is not a natural phenomenon

play24:44

It is the result of

play24:45

a rat race he created

play24:47

If we were to tell a story

play24:48

we can say NVIDIA

play24:48

has been monopolising graphics card market since 2006

play24:50

end in just one sentence

play24:51

However for many times

play24:52

they overturned

play24:53

their own framework and technology

play24:55

They pushed themselves to release

play24:56

new generation of chip every 6 months

play24:58

For example their latest ray tracing RTX technology

play25:01

completely overturned

play25:02

the program accumulated before

play25:03

They use deep learning method

play25:04

through 1 pixel

play25:05

they guess what the other 8 pixels surround it looks like

play25:08

to speed up image processing.

play25:08

This intense running speed

play25:10

makes only those who run fastest

play25:12

with largest pool of talent

play25:13

and wealthiest companies

play25:15

can make money.

play25:16

The rest could only run behind

play25:17

This is why we often see that

play25:19

after a war in chip industry

play25:20

it always ends in acquisition

play25:21

It's not just the difficulty of developing chips

play25:24

it’s mainly because it’s not economical

play25:25

Even those tech giants

play25:26

with plentiful talents

play25:27

won't set foot in the chip industry

play25:28

unless they are forced to

play25:30

But things are different now

play25:32

The AI field is one of the biggest possible

play25:33

battleground for the tech giants

play25:35

In this battle, graphics card is too important

play25:37

after all it determines computing speed

play25:38

Non of them wish to be

play25:40

strangled by NVIDIA

play25:41

during AI era right

play25:44

Although currently they are

play25:45

NVIDIA’s biggest customer

play25:46

but at the same time they delve into

play25:48

R&D and chip making in full swing

play25:50

Google already developed a chip

play25:51

specifically designed for AI training called TPU

play25:54

According to them,

play25:54

it is more efficient than Nvidia's graphics card.

play25:56

In 2017, Meta already used

play25:58

over 20,000 NVIDIA's graphics cards

play26:00

for AI training

play26:01

but now they are also

play26:02

all in in researching chip, calling it MTIA

play26:05

Actually from 2020

play26:06

NVIDIA has been actively seeking

play26:08

Masayoshi Son’s Softbank

play26:09

to acquire their chip company ARM

play26:11

at the price of $40 billion

play26:13

But Qualcomm, Microsoft and Google

play26:15

are all strongly against it

play26:17

The deal fell through

play26:18

From here you can see that those tech giants

play26:19

are actually very afraid of Nvidia.

play26:23

play26:24

Emerging from the early days graphics card industry

play26:27

is his ability

play26:28

Having laid out CUDA since 20 years ago

play26:31

is his vision

play26:32

Continuous monopoly in graphics card industry

play26:34

is his endurance

play26:35

Meeting the demand for massive computing power in mining

play26:37

is his luck

play26:39

The unchanged fashion style

play26:41

is his devotion

play26:45

When long accumulation period

play26:47

meets the AI wave

play26:48

NVIDIA naturally obtain

play26:49

first-mover advantage

play26:51

Facing new vast potential market

play26:53

and competitors looking to get in on the action

play26:55

Can NVIDIA fight their

play26:56

way out again in this

play26:58

new wave of AI fight?

play26:59

It’s hard to say