一口气了解英伟达,芯片新王凭什么是他?

小Lin说
30 Jun 202327:17

Summary

TLDRThe video traces Nvidia and CEO Jensen Huang's rise from graphics card company to AI powerhouse. It examines how Nvidia leveraged its GPUs for parallel computing, invested heavily in R&D, and developed CUDA software to make GPUs programmable. This enabled breakthroughs in AI. However, Nvidia owes much success to luck from the crypto mining boom. Now Nvidia dominates training of AI models. With AI's vast potential, tech giants aim to compete, but Nvidia's enduring monopoly may be hard to challenge.

Takeaways

  • 😀 Nvidia started off by specializing in graphics processing chips and established itself as a leader in the gaming industry
  • 🤓 Jensen Huang had the vision to make Nvidia's GPUs more versatile through the CUDA software platform
  • 🔥 Cryptocurrency mining created huge demand for Nvidia's GPUs from 2018-2021
  • 🧠 Nvidia GPUs proved enormously effective for AI model training after 2012, cementing Nvidia's dominance
  • 💰 Nvidia's data center segment now generates over half its revenues due to AI demand
  • 👷‍♂️ Chipmakers need to iterate very fast - Nvidia stays ahead through huge R&D investments
  • 🏭 The chip industry has high barriers to entry and tends to concentrate power
  • 🚘 Nvidia is expanding into self-driving car computers and other growth areas
  • 🗺️ Major tech firms see Nvidia as a strategic asset and are wary of its dominance
  • ❓ It remains to be seen if Nvidia can maintain its edge as the AI market evolves

Q & A

  • When was Nvidia founded and by whom?

    -Nvidia was founded in 1993 by Jensen Huang, Chris Malachowsky, and Curtis Priem.

  • What breakthrough product established Nvidia as an industry leader in 1999?

    -The GeForce256 graphics card established Nvidia as an industry leader in 1999. It was the first 'GPU' that was designed specifically for graphics processing.

  • How did the rise of Bitcoin mining benefit Nvidia?

    -The rise in Bitcoin mining brought huge demand for mining hardware. Since graphics cards are very efficient for crypto mining, this greatly benefited Nvidia's graphics card business.

  • What is the significance of Nvidia's CUDA platform?

    -CUDA made Nvidia GPUs programmable and expanded their use for general purpose computing beyond just graphics processing. This laid the foundation for using GPUs in AI.

  • Why did the AlexNet deep learning model represent an important breakthrough?

    -The AlexNet model that won the 2012 ImageNet competition utilized GPUs for its neural network training. This showed the potential of GPU accelerated deep learning.

  • How has Nvidia tailored its GPU products for AI workloads?

    -Nvidia has optimized its GPU architecture, created software frameworks like CUDA, and launched specialized products like the A100 GPU specifically for accelerating AI workloads.

  • What are some key moves Nvidia has made to expand beyond GPUs?

    -Major moves include the acquisitions of Mellanox and ARM, investments in automotive tech, and initiatives in areas like Metaverse.

  • Why is it difficult to build an enduring competitive advantage in the chip industry?

    -The rapid pace of innovation means advantages don't last long before new technologies emerge. Nvidia tries to stay ahead with heavy R&D spending and continuous new product introduction.

  • How are tech giants responding to Nvidia’s dominance in AI chips?

    -Large tech firms like Google, Meta, and Microsoft are conducting their own AI chip research and development to reduce reliance on Nvidia GPUs.

  • What does Nvidia’s future success depend on in the face of rising competition?

    -Nvidia's future hinges on sustaining technological leadership in GPUs via continuous R&D innovation, expanding into new high-growth markets like AI and automotive, and leveraging vertical integration synergies across hardware and software stack.

Outlines

00:00

😀 Early History of Nvidia and Jensen Huang

This paragraph provides background on Nvidia CEO Jensen Huang's early history, including being born in Taiwan, moving to the US at age 9, and working at semiconductor companies. It then discusses how Huang and two others started Nvidia in 1993 to focus on graphics processing chips, obtained venture capital funding, and struggled initially before finally finding success with the Riva 128 graphic card.

05:02

😲 The Graphics Card Duopoly - Nvidia vs AMD

This paragraph explains how through acquisitions and mergers, the graphics card market consolidated to just two players by the early 2000s - Nvidia and AMD. It discusses their continuing competition over the years, with Nvidia slowly dominating more market share and becoming the global GPU leader.

10:03

💰 Bitcoin Mining Drove Massive Demand for Nvidia GPUs

This paragraph highlights how the bitcoin/crypto mining boom created huge unexpected demand and revenue for Nvidia's GPUs due to their parallel computing capabilities being very suitable for the computations needed in mining. However, it notes mining was just a sideshow and not Nvidia's main business.

15:08

🤖 AI and Deep Learning Drove Next Wave of GPU Demand

This paragraph explains how Nvidia GPUs proved hugely beneficial for AI and deep learning due to their high parallel computation ability. It discusses Nvidia's strategic moves to optimize GPUs for AI and build an ecosystem around this, leading to dominance as the go-to provider of GPUs for AI industry leaders.

20:10

😎 Nvidia's Moat, Valuation, and Competition in the AI Chip Wars

This final paragraph analyzes Nvidia's moat, extremely high valuation, and competition from tech giants getting into AI chips. It discusses the high R&D investments needed to compete in rapidly iterating chip markets, and questions whether Nvidia can maintain its leadership in the AI industry battles ahead.

Mindmap

Keywords

💡GPU

GPU stands for Graphics Processing Unit. It is a specialized electronic circuit designed to rapidly process and alter memory to accelerate the creation of images in a frame buffer for output to a display. GPUs are used extensively in deep learning and AI training because they can handle the massive parallel computation required. The video explains how Nvidia focused on GPU technology and innovation to position itself as a leader in AI.

💡CUDA

CUDA is a parallel computing platform and API model created by Nvidia that allows software developers to use GPUs for general purpose processing. The video highlights how Nvidia invested heavily in developing CUDA over many years to make GPUs more flexible, programmable, and useful beyond just graphics rendering. This opened up new commercial applications and was pivotal in enabling GPU-accelerated AI training.

💡AI training

AI training refers to the compute-intensive process of developing, optimizing and enhancing artificial intelligence models, usually based on neural networks. The emergence of AI and growth in model sizes created massive demand for parallel processing power that only GPUs could provide cost-effectively. Nvidia captured this opportunity early which fueled its rise.

💡Moore's Law

Moore's Law predicts that the number of transistors on an integrated circuit doubles about every 18-24 months. This drives exponentially increasing computing power over time. However, the video explains that GPU performance has advanced even faster than Moore's Law, dubbed Huang's Law. This rapid innovation cycle creates huge incumbency advantages.

💡R&D investment

The video emphasizes that continuous, heavy R&D spending is crucial in the chip industry to keep pace with the rate of innovation. While tech giants invest 10-15% in R&D, Nvidia spends about 25% to maintain its leadership. This intense focus on running faster than rivals forms Nvidia's core moat.

💡Acquisitions

Acquisitions, like Nvidia's takeover of 3dfx and Mellanox, allow rapid absorption of outside technology and talent. This is key to sustaining momentum in the fast-changing chip industry. The failed bid to acquire ARM also shows how wary rivals are of moves that expand Nvidia's ecosystem.

💡First mover advantage

By recognizing the potential of GPU computing early, investing in CUDA for years without immediate payoff, and building an ecosystem spanning hardware, software and platforms, Nvidia established first mover advantage just as AI began accelerating. This perfect timing and long runway explains its explosive growth.

💡Mining

Cryptocurrency mining offered an unexpected revenue boost for Nvidia between 2018-2021, earning up to $3 billion annually from surging graphics card demand. However, the video notes this is ultimately a sideshow, with the surge in AI workloads being the primary tailwind.

💡Cloud

Nvidia is expanding into offering GPU-based cloud computing services, allowing customers to rent hardware time rather than own it outright. This cloud business targets both end users and partners, further strengthening Nvidia's end-to-end ecosystem.

💡Competition

While dominant currently, the video warns that tech giants like Google, Meta and Microsoft are wary of reliance on Nvidia and are now racing to design their own AI chips. Emerging competitors could eventually erode Nvidia's lead, spurring its push into new areas like cloud, cars, metaverse, etc to diversify.

Highlights

Nvidia released their first quarter earnings, stock soared by 30% on the day, Market cap reached trillion of dollars, propelled Nvidia into becoming sixth-largest company in the world

Nvidia participated in almost all of global tech innovation, cloud computing, cryptocurrency, Metaverse, Artificial Intelligence, Nvidia is main player in all these

Nvidia A100 graphics cards, an indicator to measure a company’s computational power

Sounds arrogant right, But that’s the truth, With his foresight from over 20 years, and his unchanging outfit style, becomes the Godfather of AI

Nvidia is gradually, eating away AMD’s market share, from 60% in 2010, slowly expanded to , 80% in 2022., becoming global GPU hegemon

Thanks to NVIDIA’s graphics card, the theory of neural network has been able to realise, It made a sensation in the academic world

There is a consensus in the field of AI, If you want to do AI, then no doubt, you have to buy NVIDIA’s graphics card

This makes NVIDIA’s graphics card, in constant short supply for years, NVIDIA was very nice, they designed a GPU specifically for mining

AI, could be so popular in 2023 right, In fact, it is not AI that first makes the, accelerated computing capabilities of graphics cards, realise its commercial value.

Data center occupies 56% of their business, Gaming dropped to 33%, The earning report released on 24th May, their revenue dropped significantly due, to sluggish global demand in gaming

Your estimation is incorrect, Our revenue is $11 billion, 50% more

Cathode Wood, liquidated NVIDIA stocks, in January through her ETF , Lately NVIDIA stock price is rising, she was reviled in the, investment circle

There are some professional institutions, dare not to not invest in NVIDIA , because AI is the biggest wave, with biggest opportunity in the market,

Those who understand economy would know that, if you sell shovel during gold rush, could you be making a fortune? The marginal profit would quickly be eaten up,

Have you ever thought why?, We’ll get deep into the, feature of chip industry,

Transcripts

play00:01

On May 24,

play00:02

Nvidia released their first quarter earnings

play00:04

Many investors are calling this an unprecedented

play00:06

once in a lifetime release

play00:08

Nvidia relied on

play00:09

the dazzling data

play00:10

giving Wall Street a slap in the face

play00:12

The stock soared by 30% on the day

play00:13

Market cap reached trillion of dollars

play00:15

propelled Nvidia into becoming sixth-largest company in the world

play00:17

surpassing Tesla

play00:18

and approaching Amazon

play00:19

Who would have thought that

play00:20

a company selling graphics cards

play00:22

would become the biggest winner

play00:23

in the AI ​​war in 2023

play00:26

For the past few years

play00:27

Nvidia participated in almost all of global tech innovation

play00:29

cloud computing

play00:29

cryptocurrency

play00:30

Metaverse

play00:30

Artificial Intelligence

play00:31

Nvidia is main player in all these

play00:32

Majority of the AI model you’ve heard of

play00:34

are trained with Nvidia graphics cards.

play00:36

Not only are they an industry leader

play00:37

but they monopolised global AI training industry

play00:39

by occupying 95% market share

play00:41

Even the quantity of owning

play00:42

Nvidia A100 graphics cards

play00:44

an indicator to measure a company’s computational power

play00:47

The founder, Jensen Huang said

play00:51

Sounds arrogant right

play00:52

But that’s the truth

play00:57

With his foresight from over 20 years

play00:59

and his unchanging outfit style

play01:01

becomes the Godfather of AI

play01:03

You must be wondering

play01:05

what’s so great about Nvidia

play01:06

How did they monopolise?

play01:07

Why nobody can compete against them

play01:08

Today let Lin take you down a trip

play01:10

to the story of Nvidia

play01:11

We’ll also talk about the secret behind

play01:13

graphic card and chip industry

play01:17

Let’s begin with the rise of

play01:19

Jensen Huang and Nvidia

play01:21

In 1963

play01:22

Jensen Huang was born in Tainan, Taiwan

play01:24

which means he is 60 years old this year

play01:26

At the age of 9, he moved to US

play01:28

after graduating from college

play01:29

he worked in two semiconductor companies

play01:30

focusing on chip design

play01:31

one of them is AMD

play01:33

a company that fought with Jensen

play01:34

for half of their lifetime

play01:35

until now

play01:38

After finishing his Master at Stanford, Jensen turned 30

play01:40

With two other tech guys who were

play01:42

also in their 30s with big ambition

play01:44

they planned to do something big

play01:46

They believed that 3D graphics processing

play01:48

has great potential in the future

play01:49

So in 1993

play01:50

they established NVIDIA

play01:52

specialises in graphic processing chip

play01:54

Jensen Huang is the CEO.

play01:56

he still is today

play01:57

With recommendation from his former boss

play01:58

he obtained $20 million capital investment

play02:00

from Sequoia Capital

play02:02

After all gaming industry needs real-time rendering when playing

play02:04

You can’t say there’s no 3D games

play02:06

but definitely can’t play with normal computer

play02:07

Many games are quite classic

play02:09

but the graphic

play02:09

is basically, well as long as you can see a figure there

play02:11

because this kind of 3D image processing

play02:13

is computationally intensive

play02:14

It was difficult for the CPU at that time

play02:16

Normally they need a specific chip

play02:18

to process the graphics

play02:19

and this chip is graphic card

play02:21

In the early days, the graphics card was very simple

play02:23

it was a 3D accelerator card at most

play02:26

Now when you hear 3D accelerator

play02:28

It sounds like a small workshop business

play02:29

That's right

play02:30

actually at that time 3D games and

play02:31

3D rendering

play02:32

was at seeding stage

play02:33

Graphics card companies like Nvidia

play02:35

are actually a lot

play02:35

at least 50-60 companies

play02:37

There was no uniform standard for both hardware and software

play02:38

Whoever comes out with better research

play02:39

can publish their own standard.

play02:41

Often times

play02:41

when you finally came up with a graphics card

play02:43

but it actually is

play02:44

not compatible with other people’s standard

play02:46

Just one word

play02:47

Chaotic

play02:48

One of the most famous company at the time

play02:50

is a company called 3dfx

play02:51

It was established in 1994,

play02:53

a year later than Nvidia

play02:54

At that time, a graphics card called Voodoo

play02:56

was all the rage

play02:57

Many popular games during that time

play02:58

relied on Voodoo graphic card

play03:00

It wasn’t going so well with NVIDIA

play03:02

Although they obtained capitals

play03:03

and had very professional team

play03:05

but their NV1 was not successful

play03:07

NV2 was aborted

play03:10

By 1997

play03:11

NVIDIA was hanging on by a thread

play03:12

9 more months till they run out of money

play03:14

The company downsized its staff from 100

play03:15

till there were about 30 people left

play03:17

Jensen Huang took a gamble

play03:19

Just when the company only had

play03:20

6 months of operating capitals

play03:21

They released Riva128 graphic card carrying NV3

play03:24

With its good price/performance ratio,

play03:25

they finally occupy a place in the market

play03:28

and allow NVIDIA to survive

play03:30

Actually Jensen and his team

play03:32

are very strong in their R&D

play03:33

After they figure out

play03:34

the market direction

play03:36

they managed to enter fast lane quickly

play03:38

They reached a long-term strategic cooperation with TSMC

play03:40

At the same time they cooperate closely with Microsoft

play03:41

supported the Direct 3D display standard introduced by Microsoft

play03:44

Finally they rose in the sea

play03:46

of competition in graphics card industry

play03:46

With RivaTNT

play03:48

it helped

play03:49

NVIDIA to become

play03:50

industry leader in graphics card industry

play03:52

In 1999, NVIDIA

play03:53

successfully listed on NASDAQ

play03:56

After listing

play03:57

NVIDIA had more money and

play03:58

in September 1999, they released

play04:00

the epoch-making GeForce256

play04:02

which made them the leading force among their competitors

play04:05

I believe that gamers

play04:06

should be familiar with

play04:07

this GeForce series.

play04:08

It has also become Nvidia’s

play04:10

the flagship line of consumer graphics card

play04:11

Jensen Huang named GeForce256

play04:13

the world’s first GPU

play04:15

The first truly dedicated graphics card

play04:17

The claim is

play04:18

basically accepted by everyone

play04:19

Hence some people might generally claimed that

play04:21

Nvidia invented the graphics card.

play04:23

As for why this dedicated graphics card is so powerful,

play04:25

we will talk about it later.

play04:27

At that time, Microsoft happened to be working on Xbox

play04:29

With GeForce 256

play04:30

powerful performance, NVIDIA managed to

play04:32

score $200 million worth of order

play04:34

After building image processing hardware for Xbox

play04:36

they then score another with Sony’s PS3

play04:38

From 1999 to 2002,

play04:40

Nvidia’s revenue

play04:40

almost doubled every year to

play04:42

$2 billion

play04:43

becoming the only player in the market

play04:45

They started to acquire

play04:46

competitors in the same industry

play04:47

one of them we mentioned earlier

play04:49

the once popular 3dfx

play04:51

Another major player in the market

play04:53

ATi was acquired by AMD

play04:56

And so in the early 2000s

play04:57

after a series of merger and acquisition

play04:59

happening in the market

play05:01

There are only two players left in the market

play05:03

Nvidia and AMD.

play05:04

Until now, the dedicated graphics card market

play05:06

has been dominated by these two companies.

play05:08

I don’t know if you have heard of

play05:09

the legendary N card and A card

play05:11

actually refer to the graphics cards of these two companies.

play05:13

Gamers started to argue

play05:15

whether N card or A card is better

play05:16

It doesn’t matter which one is better

play05:17

after that there isn’t a

play05:18

third company exists in the market

play05:19

like B card of C card or X card

play05:21

It’s a two-horse race

play05:22

However NVIDIA is gradually

play05:25

eating away AMD’s market share

play05:27

from 60% in 2010

play05:28

slowly expanded to

play05:29

80% in 2022.

play05:30

becoming global GPU hegemon

play05:33

The speed of development of GPU technology itself

play05:35

is also jaw-dropping

play05:36

The rapid development of gaming industry

play05:38

has supported Nvidia,

play05:39

At the same time, NVIDIA’s graphics card development

play05:41

has promoted the development of the gaming industry.

play05:43

Look at the new games coming out every year

play05:44

so much improvement in image quality

play05:46

Even if you don’t understand game

play05:47

you can see the speed of progress

play05:52

In fact, the overall graphics card market

play05:55

is in a tripartite state

play05:58

Three biggest player – INTEL, NVIDIA, AMD

play06:00

Intel occupied 71% of market share

play06:02

Nvidia is 17% and AMD is 12%.

play06:05

You must be wondering

play06:06

why is there Intel

play06:07

and their market share is way higher

play06:08

Didn’t you just say that Nvidia is the biggest player?

play06:12

Actually

play06:12

this graphics card is not the same graphics card

play06:16

Because graphics card

play06:17

is divided into dedicated and integrated graphics card

play06:19

If you were to compare both

play06:20

then Intel indeed is the biggest player

play06:22

but they sell mostly integrated graphics card

play06:24

Integrated graphics card is placed together with CPU

play06:26

they share memory

play06:27

So Intel taking advantage of their position in CPU industry

play06:29

monopolise the integrated graphics card market share

play06:31

However the integrated graphics card

play06:33

is quite weak, I won’t explain in details here

play06:35

Comparing with NVIDIA’s dedicated graphics card

play06:36

although they are both graphics card but they don’t belong in same market

play06:39

Just from dedicated graphics market POV

play06:40

NVIDIA occupied 80% of the market share

play06:41

Some of you might start to get bored now

play06:43

Alright we know

play06:44

NVIDIA designs graphics card and chips

play06:46

they are very good at it

play06:47

I’ve been talking a lot about

play06:48

3D rendering and gaming

play06:49

How does it have anything to do with AI

play06:51

Why are all these AI companies want to buy graphics card

play06:52

and it has to be NVIDIA’s graphics card

play06:55

Don’t worry, we’ll talk about graphics card characteristic

play06:59

in a computer, Central Processing Unit

play07:01

The purpose of its design is that

play07:02

it can do everything

play07:03

It’s sequential computing

play07:04

and it can carry out very complex logical reasoning

play07:06

However image processing

play07:08

doesn’t care much about sequential computing

play07:09

It’s more concerned with computational volume

play07:10

For example, a 4k video

play07:12

has 10 million pixels

play07:13

Let’s say there’s 30 frames per second

play07:14

then each pixel and frame

play07:16

has to compute correspond colour

play07:17

based on

play07:18

shadow and action

play07:20

This requires non-stop, very fast

play07:22

and massive simple calculations

play07:23

is Graphics Processing Unit

play07:24

It’s especially designed to

play07:26

do this kind of computation

play07:27

The foundation of the chip design

play07:29

is to optimise parallel computing

play07:31

So for CPU

play07:32

is 64 or 128 core at best

play07:34

while GPU could have thousands of core

play07:36

computing together at the same time

play07:37

See this video is giving

play07:38

a very good explanation

play07:39

CPU is like a very precise

play07:41

very strong gun firing one shot at a time

play07:43

The shots are fired in clear order

play07:45

but slow,

play07:46

GPU on the other hand

play07:47

is like having thousands of this gun

play07:49

firing at the same time

play07:55

Due to GPU special feature

play07:56

Jensen Huang started to think about

play07:58

how to maximise

play07:59

its potential

play08:00

It’s definitely as simple as 3D image processing

play08:02

and rendering

play08:03

Can they carry out

play08:04

more of General Purpose Computing

play08:06

General Purpose Computing

play08:07

General Purpose Graphics Processing

play08:09

play08:14

But it’s not simple to use GPU

play08:15

to do this kind of

play08:16

general purpose computing

play08:17

Because the purpose of its design is not for this

play08:19

so the programming is very difficult

play08:20

Not anyone can do this job

play08:22

Jensen Huang was thinking that

play08:22

if graphics card was to realise its greater potential

play08:24

it needs to be programmable

play08:26

By chance

play08:27

he saw a project by a

play08:28

PhD student in Stanford

play08:29

Using C language programming

play08:30

to let GPU do some computing

play08:31

Jensen thought this idea is amazing

play08:34

and offered this fella a job at NVIDIA

play08:36

He appointed him with a very important job

play08:38

and let him lead the team to carry out R&D

play08:40

in making GPU programmable

play08:44

Finally in 2006

play08:46

NVIDIA officially released CUDA

play08:47

making GPU programmable

play08:49

In order to build this CUDA system,

play08:50

NVIDIA invested a large and

play08:53

unreasonable amount of

play08:53

capitals and human resources into it

play08:54

Originally those graphics card that

play08:56

to support CUDA

play08:57

Originally those graphics card that

play08:59

specializes in 3D graphics processing

play09:00

need many top engineers to

play09:02

make it programmable

play09:04

But now anyone can do it by

play09:05

buying an NVIDIA graphics card

play09:07

and use it with

play09:08

CUDA library

play09:09

Through CUDA

play09:09

NVIDIA expanded the boundary of graphics card

play09:11

from gaming and 3D image processing

play09:13

to a whole realm of accelerated computing

play09:15

Like aerospace, biopharmaceuticals

play09:17

Weather forecasting, energy exploration and so on

play09:19

are actually using large amount of NVIDIA graphics card

play09:21

to carry out computation

play09:22

Many have tried to create

play09:23

software like CUDA

play09:25

to challenge Nvidia’s position.

play09:26

But Nvidia

play09:27

has a monopoly on hardware itself.

play09:28

They can try everything

play09:30

to merge their hardware, graphics card

play09:31

with the software, CUDA

play09:33

and make them work really well with each other

play09:33

Through hardware and software merger

play09:35

they formed a very strong moat

play09:37

Does this make you think of another company?

play09:39

That’s Apple

play09:39

They all build something

play09:41

that’s had been bandied about in business world

play09:42

an Ecosystem

play09:42

Microsoft, Adobe

play09:44

they all have their own strong ecosystem

play09:46

The amount of capital Jensen invested in CUDA

play09:48

might sound reasonable to you now

play09:50

It makes sense

play09:50

but if you look at short-term return

play09:52

It is very unreasonable.

play09:53

Wall Street is quite displeased with this thing

play09:56

because although the computation performance of GPU

play09:58

is outstanding

play09:58

but their application is not much

play10:00

For a long period of time

play10:01

it can only focus on area

play10:03

that requires massive computing

play10:04

To put it bluntly, it's not profitable at all

play10:05

Who would’ve thought that AI

play10:07

could be so popular in 2023 right

play10:09

In fact, it is not AI that first makes the

play10:11

accelerated computing capabilities of graphics cards

play10:12

realise its commercial value.

play10:14

It was a coincidence

play10:15

A trend that’s not related to AI at all

play10:16

Something that even someone like Jensen

play10:18

could not have expected

play10:22

The explosion of Bitcoin

play10:23

brought a huge demand for mining

play10:24

Mining in essence is mindless computing

play10:27

to encrypt and decrypt

play10:28

In order to do mining

play10:29

in faster way then you will need to use graphics card

play10:30

and you have to use NVIDIA’s graphics card

play10:32

I believe most of you

play10:33

only know about using graphics card to do

play10:35

computation from the mining part

play10:38

The huge demand for mining

play10:40

is like a godsend for NVIDIA

play10:42

a big big gift

play10:44

This makes NVIDIA’s graphics card

play10:45

in constant short supply for years

play10:46

NVIDIA was very nice

play10:48

they designed a GPU specifically for mining

play10:49

Of course many would say

play10:51

Mining pollutes the environment,

play10:52

meaningless computation

play10:53

all sorts of problems

play10:54

Graphics card capability

play10:55

was seen by many

play10:57

and NVIDIA did make lots of money because of it

play11:01

According to analysts

play11:02

Between 2018 to 2021

play11:03

a period when bitcoin was very popular

play11:05

NVIDIA could earn up to

play11:06

$1 to $3 billion annually

play11:08

NVIDIA’s market cap even

play11:10

surpassed the giant

play11:12

Intel

play11:12

During the time when bitcoin was very popular

play11:13

their market cap even approached trillion dollars

play11:21

Although mining has made NVIDIA a lot of money

play11:23

but it is after all not its main business

play11:25

After the crash of crypto market

play11:26

NVIDIA’s stock plunged by 46%

play11:29

So far mining is only

play11:30

a sideshow at best

play11:31

We all know

play11:32

what’s really been helping NVIDIA lately

play11:34

Artificial Intelligence

play11:38

We mentioned that graphics card

play11:40

massive parallel computing capability

play11:41

is very suitable for deep learning and machine learning

play11:45

AI

play11:47

has to keep learning

play11:48

up to billions of times

play11:49

So GPU parallel computing capability

play11:50

is where it fits in

play11:51

Jensen Huang showed that

play11:52

to train a large language model

play11:54

in comparison with CPU

play11:54

GPU server

play11:55

can complete it at 4% of the cost

play11:57

and 1.2% of power.

play11:58

Therefore, GPU CPU

play11:59

are not on the same level

play12:00

This is determined by its underlying structure.

play12:02

Jensen Huang

play12:02

just wanted to subtly tell you that

play12:04

to train large language model

play12:06

only a fool would use CPU

play12:07

You have to use GPU

play12:08

If you use GPU, you’d better use CUDA.

play12:10

Then you have to buy the graphics card from NVIDIA

play12:13

Actually a decade ago

play12:13

no one knows graphics card can be used in AI

play12:15

AI itself

play12:16

is more theoretical than practical

play12:18

The change occurred in 2012.

play12:19

At that time, there’s a

play12:20

very famous computer competition

play12:22

called ImageNet.

play12:25

Everyone was competing

play12:25

whose algorithm could better

play12:27

recognise the content of the image

play12:28

From No.2 to No.4

play12:30

their error rate

play12:31

is about 26% to 29%.

play12:33

A team called AlexNet

play12:34

made it to 16.4%

play12:36

Ten points ahead of the second place

play12:38

and won the comepetition

play12:39

they used neural networks to train their models

play12:41

with NVIDIA’s graphics card

play12:45

We’ve talked about this in ChatGPT episode

play12:47

the theory of neural network has been around for a long time

play12:49

but it had not been realised

play12:51

the problem was with computational power

play12:52

Thanks to NVIDIA’s graphics card

play12:54

the theory of neural network has been able to realise

play12:56

It made a sensation in the academic world

play12:57

Jensen is also very serious about

play12:59

GPU’s application in AI

play13:01

They went all in after 2012

play13:03

and let NVIDIA’s graphics card

play13:04

do accelerated calculations easily and conveniently

play13:06

Apart from investment focused on CUDA

play13:07

there are also optimisation on AI for graphics card

play13:09

including software support

play13:10

platform support and etc

play13:11

Later there is a consensus in the field of AI

play13:14

If you want to do AI

play13:15

then no doubt

play13:16

you have to buy NVIDIA’s graphics card

play13:21

Google, Amazon, Microsoft, Baidu

play13:24

all used NVIDIA’s graphics card

play13:24

to train the models

play13:26

The most famous one in this wave is

play13:28

NVIDIA’s A100

play13:30

ChatGPT was trained with over 10,000 of

play13:31

graphics cards.

play13:32

A100 has also

play13:33

become the standard for training large model

play13:35

For major AI companies

play13:35

if they couldn’t come out with significant result

play13:37

but still want to catch the wave

play13:38

What do they do

play13:39

They fight in owning graphics card

play13:40

They’ll tell you

play13:41

how many NVIDIA’s

play13:42

A100 graphics card they bought

play13:43

ranging from thousands to tens of thousands

play13:44

So this caused NVIDIA’s graphics card

play13:45

to be in short supply for a long time

play13:47

Their price went up to ten of thousands of dollars

play13:49

Last year NVIDIA released

play13:50

upgraded version of A100, H100

play13:51

It has four to six times the performance of the A100

play13:56

So don’t think that

play13:57

NVIDIA is just in luck

play13:59

just because they soar the highest

play14:00

They spent a decade

play14:02

and a lot of money and effort

play14:04

to build their own wings

play14:05

Just waiting for the right wind

play14:09

Now that the wings are completed

play14:11

the wind has come

play14:12

NVIDIA definitely will want to

play14:14

fly high

play14:17

This is A100 system board released by

play14:19

Jensen Huang 3 years ago

play14:20

a big guy over 20kg

play14:21

became the world’s largest GPU

play14:24

Looks intimidating

play14:26

This year he released another thing

play14:39

DGX GH200 supercomputer.

play14:43

The size is in 1:1 ratio with the actual thing

play14:45

play14:47

They use 240km of optic cable

play14:49

is kilometre

play14:49

that’s 240,000 metres

play14:51

Its weigh is equal to 4 adult elephants

play14:53

with internal memory of 144TB

play14:56

This GPU is connected using

play14:59

technology like NVLink, NVSwitch

play15:01

And these 4 elephants

play15:02

is actually a mega graphics card

play15:08

it is specifically used for AI computation

play15:09

It's expected to be ready by the end of the year

play15:11

Google Cloud, Meta, Microsoft

play15:13

are the first to access it

play15:14

With such big first-mover advantage

play15:15

NVIDIA is not limited to making

play15:17

only graphics card and chip design

play15:18

In 2019, they spent $6.9 billion

play15:20

to acquire an Israeli chip company

play15:22

called Mellanox

play15:22

and came up with something called DPU

play15:24

This thing is very powerful too

play15:25

Jensen Huang said that

play15:26

this acquisition is the most

play15:28

successful strategic decision he has ever made

play15:29

NVIDIA has started to merge

play15:31

GPU, CPU and DPU together

play15:33

and created

play15:33

a server with incredible computing power

play15:35

Supercomputer

play15:36

is slowly creeping towards CPU market

play15:38

and launched a variety of products

play15:39

The eighth supercomputer in the world

play15:41

DGX BasePOD

play15:42

Accelerated superchip

play15:43

GPU Grace Hopper

play15:44

computing platform

play15:44

BlueField-3 DPU and so on

play15:46

Good names right

play15:47

but you probably don’t understand

play15:50

In short, Nvidia

play15:52

covers everything

play15:53

from chips to supercomputers

play15:55

they’ve got it all covered

play15:55

What if you can’t afford it

play15:57

or don’t want to buy so many hardware?

play15:58

It’s okay

play15:59

NVIDIA can lease it to you

play16:01

just get connected online

play16:01

and use NVIDIA’s server

play16:03

This is AI cloud business

play16:08

It provides services to end user

play16:10

as well as upstream companies

play16:11

They came up with a software called cuLitho

play16:13

to help TSMC, ASML

play16:14

these upstream chip makers

play16:16

Improve the performance of Inverse lithography technology by up to 40 times

play16:18

By the way,

play16:19

I have a Spanish friend who told me

play16:21

that the name cuLitho

play16:22

in Spanish sounds like

play16:23

a booty

play16:28

One word

play16:29

Incredible

play16:31

See, NVIDIA expanding across hardware,

play16:34

software, services and etc

play16:36

becoming the biggest winner in this AI wave

play16:41

US sanctions towards China

play16:43

bans the sale of A100 and H100

play16:44

to China.

play16:45

The impact on Nvidia is actually quite big.

play16:46

China occupies a quarter of

play16:47

NVIDIA market

play16:49

They don’t want the cake

play16:50

fly away just because of US government

play16:52

When a reporter interviewed

play16:53

Jensen Huang asking him

play16:54

play16:55

how big is the impact of this for NVIDIA

play16:56

His answer is impermeable

play16:59

First we will definitely work closely

play17:00

with US government’s policy

play17:01

Express their position first

play17:02

At the same time we are trying

play17:04

within the rules

play17:05

to satisfy the demand

play17:07

of Chinese consumers

play17:08

play17:10

He offends neither side

play17:10

NVIDIA released

play17:12

A800 graphics card

play17:13

bypass a bunch of sensitive technology that has been sanctioned

play17:15

This is made especially just for China

play17:16

Alright

play17:17

we understand the entire background of Nvidia,

play17:19

it’s very clear now when you look at their earnings report

play17:21

Currently they divided their business into 4 segments

play17:23

Gaming, data centre

play17:24

automotive, professional graphics processing

play17:26

Previously NVIDIA’s ace business was gaming

play17:28

and data center is about accelerated computing

play17:31

lAI, Cloud service and etc

play17:32

are all in this segment

play17:33

These two segments are the main ones

play17:34

In 2018,

play17:35

Gaming occupied half of their business

play17:37

Data center is a quarter

play17:38

by 2022,

play17:39

Data center occupies 56% of their business

play17:41

Gaming dropped to 33%

play17:43

The earning report released on 24th May

play17:45

their revenue dropped significantly due

play17:47

to sluggish global demand in gaming

play17:48

This is actually what Wall Street expected.

play17:50

However data center segment

play17:51

is strong

play17:52

18% growth from previous quarter

play17:54

Most importantly the revenue in second quarter

play17:55

blinded Wall Street

play17:58

Wall Street predicted that

play17:59

their revenue for Q2 would be $7.2 billion

play18:01

but NVIDIA came out and said

play18:02

Your estimation is incorrect

play18:03

Our revenue is $11 billion, 50% more

play18:06

For automotive segment

play18:08

it is a segment with great potential

play18:09

Not only have to do chip for cars

play18:11

also need to create car system

play18:11

Another big cake

play18:13

But this is at initial stage

play18:15

It’s difficult to say what’s going to happen in future

play18:16

So we’ll not discuss it

play18:17

Before this NVIDIA released Omniverse

play18:19

to bet on Metaverse

play18:20

Although we haven’t seen much return yet

play18:22

but I’ve watched the promotional video

play18:23

It really is quite cool

play18:24

If Metaverse ushered in explosive period

play18:26

Then NVIDIA will again be on of the biggest winner

play18:28

It’s definite

play18:31

Anyway, the general picture is that

play18:33

gaming needs graphics card

play18:34

mining needs graphics card

play18:35

AI computing needs graphics card

play18:37

Need graphics card look for NVIDIA

play18:40

NVIDIA stock price

play18:40

rose by over 1000 fold

play18:42

from its initial IPO

play18:43

becoming the sixth largest company in the world

play18:45

And it looks like it's just getting started

play18:56

From the stories earlier

play18:58

you would’ve believe Jensen is almost godlike

play18:59

Every step is so precise.

play19:01

but he actually made a lot of mistakes along the way

play19:02

For the continuity of the story

play19:04

I ignored it

play19:04

In the early 2000s

play19:06

There was a series of graphics card failures

play19:07

almost defeated by ATI

play19:08

there were also insider trading committed by employees

play19:09

SEC did a thorough investigation on them

play19:11

There was also problem with over-marketing

play19:12

They even tried to enter mobile phone chip market

play19:14

but it was all a failure

play19:15

Looking at their stock price

play19:17

Although it has risen so much as a whole

play19:18

but in 2002 it dropped by 90%

play19:20

2008 dropped 80%

play19:21

with more than 50% of retracement

play19:23

This occurs every 3 to 5 years

play19:24

Everyone knows

play19:26

Jensen is great

play19:27

but he is no fortune-teller

play19:28

His obsession with leather jacket

play19:30

I don’t know what else to say

play19:33

Alright

play19:34

Let’s talk a little bit about NVIDIA’s stock price

play19:36

I’m worried that many would rush

play19:37

to buy

play19:38

NVIDIA’s stock after this video

play19:39

I wouldn’t say you cannot buy

play19:40

but just don’t go buy it

play19:42

on a whim

play19:43

The estimated value for NVIDIA

play19:44

is too high from every perspectives

play19:46

Their Price to Earning ratio is over 200

play19:47

Price to Sales ratio is 38

play19:48

For Apple, Microsoft and Google

play19:50

their Price to Earning is less than 40

play19:51

Tesla is only over 70

play19:52

Compare them with those in same industry

play19:53

Their revenue

play19:55

is less than half of Intel’s

play19:56

but market cap is 7 times more than Intel

play19:59

revenue is similar to AMD

play20:00

but market cap is 5 times more than AMD

play20:02

One word

play20:03

Expensive

play20:04

Of course they are expensive for a reason

play20:05

For a company in an industry like this

play20:06

moreover it’s a monopolistic company

play20:07

Its valuation is no longer one of the

play20:10

main criteria for judging stock price

play20:10

One of the main reason

play20:12

do you know what it is?

play20:12

There are some professional institutions

play20:14

dare not to not invest in NVIDIA

play20:16

because AI is the biggest wave

play20:17

with biggest opportunity in the market

play20:18

And NVIDIA is the biggest player

play20:20

in this AI wave

play20:21

When these funds invested in NVIDIA

play20:22

even if the stock price fall

play20:23

investors wouldn’t complain much

play20:25

but if NVIDIA stock price keeps rising

play20:27

and they didn’t invest because the price is expensive

play20:29

then investors would really be pissed

play20:32

Cathie Wood

play20:34

liquidated NVIDIA stocks

play20:36

in January through her ETF

play20:37

Lately NVIDIA stock price is rising

play20:38

she was reviled in the

play20:40

investment circle

play20:41

Choosing to not invest in this type of company

play20:42

would risk fund their reputation

play20:45

There’s a term in English

play20:45

that best describe this behaviour

play20:47

It is FoMO

play20:48

Fear of Missing Out

play20:50

They fear losing the potential rise

play20:52

more than buying at expensive price

play20:54

This is similar to Tesla two years ago

play20:56

EV has great future

play20:57

and Tesla was

play20:59

the only option

play21:00

even if its valuation

play21:01

is astronomical compare to

play21:02

other automotive companies

play21:03

but funds still continued to buy

play21:05

The behaviour of FoMo

play21:07

in turn pushes the stock price

play21:08

of these companies to an even higher position

play21:09

You can say that

play21:10

for these type of companies

play21:11

they just have huge risk premium

play21:13

but just because of this

play21:14

you can’t say that their stock price is too high

play21:16

That's not necessarily true

play21:16

Because it they can sustain this momentum

play21:18

and maintain this development trend

play21:19

Then their stock price will continue to rise

play21:22

There's a metaphor that many people often use

play21:24

The wave we mentioned

play21:25

is similar to gold rush

play21:27

It’s hard to bet where the gold is

play21:28

or who will find the gold

play21:30

but there is a sure-fire deal

play21:31

and that is you can sell shovels

play21:32

Nvidia

play21:33

is like selling a shovel at digital age

play21:35

at AI age

play21:36

This metaphor sounds right

play21:37

and quite reasonable

play21:39

But whenever I hear this analogy

play21:40

it doesn’t seems right

play21:42

Those who understand economy would know that

play21:43

if you sell shovel during gold rush

play21:46

could you be making a fortune?

play21:47

The first one would perhaps earn a little

play21:49

but if more people are selling it

play21:50

and it’s easy to make shovel

play21:51

then the marginal profit would quickly be eaten up

play21:53

so in most industries

play21:54

if you sell this so-called shovel

play21:55

the upstream production tool

play21:57

Due to its low entry barrier

play21:58

the competition in the industry will be very fierce

play22:00

so the profit margin will be low

play22:02

However for NVIDIA

play22:03

they can achieve a monopoly and a market value

play22:05

of trillions by doing this so-called shovel

play22:06

Its similar to TSMC

play22:08

more basic than shovel, they make hammer

play22:10

a tool to make shovel

play22:11

And they can monopolise that

play22:12

Have you ever thought why?

play22:15

We’ll get deep into the

play22:17

feature of chip industry

play22:18

Have you heard of Moore’s Law?

play22:21

On an integrated circuit

play22:23

the number of transistors that can be accommodated

play22:24

double up every 18 months

play22:26

You can understand it as

play22:27

The speed of chip can be faster every 18 months

play22:29

Actually for the past few years

play22:30

The speed of CPU evolution

play22:31

is difficult to catch up with Moore’s law

play22:33

However Jensen found that

play22:34

the advancement of GPU dedicated graphics card

play22:37

is faster than prediction of Moore’s Law

play22:39

Performance increase more than triples every 2 years

play22:41

This pattern even has

play22:42

its own term

play22:43

Huang's Law

play22:48

No matter which law it is

play22:49

This is a distinctive feature

play22:49

in chip industry

play22:51

The initial cost to invest is very high

play22:53

and requires a lot of talents and equipment

play22:54

The problem is that

play22:55

the iteration rate is too fast

play22:57

This is an industry that is constantly running

play23:01

the iteration rate of this industry is too fast

play23:03

From business perspective

play23:05

this pose a tricky problem

play23:06

it’s difficult for company to build their own moat

play23:08

Moat is very important

play23:10

to a company

play23:11

For a traditional company

play23:11

if they spent a lot of money to build a factory, a railroad

play23:13

then they have large-scale advantage

play23:15

and cost advantage

play23:15

These advantages work as a moat

play23:17

This moat will protect the company for a long time

play23:19

The moat of internet company is stronger

play23:20

Once a network effect is established

play23:22

Wechat, Tiktok, Facebook

play23:23

users are your strong moat.

play23:27

For chip industry

play23:28

there are Moore’s Law, Huang’s Law

play23:30

No matter how good of a chip you release today

play23:32

it’ll be outdated in two years

play23:34

You won’t know who’ll make it next year

play23:35

and take you down with his new technology

play23:37

In the 90s

play23:38

during the graphics card war

play23:40

A company established less than two years

play23:41

3DFX

play23:42

quickly became the industry standard

play23:44

with their Voodoo graphics card

play23:45

5 years later they became dreary

play23:46

and was acquired by NVIDIA

play23:47

This is because their moat

play23:49

hasn’t finished building and was destroyed

play23:50

In this industry, it’s difficult to rely on a single product

play23:52

or a single technology

play23:54

to form a moat that lasts more than 2 years

play23:55

Of course you can slowly accumulate many patents

play23:57

to defend yourself

play23:58

but the truth is

play24:00

people will always find a way

play24:01

to get around those patents

play24:02

it doesn’t provide much protection

play24:04

You have to keep running

play24:06

and run faster than everyone

play24:08

This running ability is your moat

play24:10

In chip industry

play24:11

the moat is R&D

play24:12

You have to develop a complete set of

play24:14

talents, facilities, organisation structure

play24:16

These form the moat for chip company

play24:20

This is why R&D is expensive

play24:21

First is the initial cost is very high

play24:23

requires a lot of talents

play24:24

most importantly you have to continuously

play24:26

keep the iteration and running speed high

play24:27

Not a lot of people can stand it

play24:29

Those tech giants are

play24:30

actually very focused on technological innovation

play24:32

Amazon, Google, Microsoft

play24:34

their R&D investment is 10% to 15%

play24:35

of their revenue

play24:36

For NVIDIA

play24:37

they have to keep invest around

play24:38

25% of their revenue for R&D

play24:41

The Huang’s Law

play24:42

is not a natural phenomenon

play24:44

It is the result of

play24:45

a rat race he created

play24:47

If we were to tell a story

play24:48

we can say NVIDIA

play24:48

has been monopolising graphics card market since 2006

play24:50

end in just one sentence

play24:51

However for many times

play24:52

they overturned

play24:53

their own framework and technology

play24:55

They pushed themselves to release

play24:56

new generation of chip every 6 months

play24:58

For example their latest ray tracing RTX technology

play25:01

completely overturned

play25:02

the program accumulated before

play25:03

They use deep learning method

play25:04

through 1 pixel

play25:05

they guess what the other 8 pixels surround it looks like

play25:08

to speed up image processing.

play25:08

This intense running speed

play25:10

makes only those who run fastest

play25:12

with largest pool of talent

play25:13

and wealthiest companies

play25:15

can make money.

play25:16

The rest could only run behind

play25:17

This is why we often see that

play25:19

after a war in chip industry

play25:20

it always ends in acquisition

play25:21

It's not just the difficulty of developing chips

play25:24

it’s mainly because it’s not economical

play25:25

Even those tech giants

play25:26

with plentiful talents

play25:27

won't set foot in the chip industry

play25:28

unless they are forced to

play25:30

But things are different now

play25:32

The AI field is one of the biggest possible

play25:33

battleground for the tech giants

play25:35

In this battle, graphics card is too important

play25:37

after all it determines computing speed

play25:38

Non of them wish to be

play25:40

strangled by NVIDIA

play25:41

during AI era right

play25:44

Although currently they are

play25:45

NVIDIA’s biggest customer

play25:46

but at the same time they delve into

play25:48

R&D and chip making in full swing

play25:50

Google already developed a chip

play25:51

specifically designed for AI training called TPU

play25:54

According to them,

play25:54

it is more efficient than Nvidia's graphics card.

play25:56

In 2017, Meta already used

play25:58

over 20,000 NVIDIA's graphics cards

play26:00

for AI training

play26:01

but now they are also

play26:02

all in in researching chip, calling it MTIA

play26:05

Actually from 2020

play26:06

NVIDIA has been actively seeking

play26:08

Masayoshi Son’s Softbank

play26:09

to acquire their chip company ARM

play26:11

at the price of $40 billion

play26:13

But Qualcomm, Microsoft and Google

play26:15

are all strongly against it

play26:17

The deal fell through

play26:18

From here you can see that those tech giants

play26:19

are actually very afraid of Nvidia.

play26:23

play26:24

Emerging from the early days graphics card industry

play26:27

is his ability

play26:28

Having laid out CUDA since 20 years ago

play26:31

is his vision

play26:32

Continuous monopoly in graphics card industry

play26:34

is his endurance

play26:35

Meeting the demand for massive computing power in mining

play26:37

is his luck

play26:39

The unchanged fashion style

play26:41

is his devotion

play26:45

When long accumulation period

play26:47

meets the AI wave

play26:48

NVIDIA naturally obtain

play26:49

first-mover advantage

play26:51

Facing new vast potential market

play26:53

and competitors looking to get in on the action

play26:55

Can NVIDIA fight their

play26:56

way out again in this

play26:58

new wave of AI fight?

play26:59

It’s hard to say

Rate This

5.0 / 5 (0 votes)

英語で要約が必要ですか?