The State of Data & AI - Trevor Jones

Thorogood
2 Jul 202429:50

Summary

TLDRThe video script discusses the evolution and impact of AI, particularly generative AI, in the business landscape. It highlights the importance of demystifying AI and understanding its true potential beyond the hype. The script covers the dramatic events surrounding Open AI and Microsoft, the competitive AI space, and the role of data in training AI models. It emphasizes the need for a balanced approach to AI's commercialization and safety, and the integration of AI with established data science and engineering practices for valuable business outcomes.

Takeaways

  • 🧠 AI Demystification: The script emphasizes the need to demystify AI, clarifying that it's not magic and builds upon previous technological advancements.
  • πŸ“ˆ Generative AI's Role: It highlights generative AI's capability to capture signals from unstructured data and combine them with structured data sources, enhancing data utilization.
  • πŸ€– AI in Business: The speaker stresses the importance of using AI in conjunction with established data science and engineering practices to serve valuable business cases.
  • πŸ”„ Open AI Drama: The script recounts the dramatic events involving Open AI, including the dismissal and subsequent return of CEO Sam Altman, and the shift in board composition.
  • 🀝 Microsoft's AI Alliances: It discusses Microsoft's strategic moves in AI, including its relationships with Open AI, Mistral AI, and Inflection AI, showcasing its broad engagement in the AI space.
  • πŸ’‘ Chip Makers' Influence: The script underscores the pivotal role of chip manufacturers like Nvidia and AMD in powering AI training and usage, highlighting their significant market positions.
  • 🌐 Cloud Vendors' Competition: The document outlines the competitive landscape among cloud platform vendorsβ€”Google, AWS, and Microsoftβ€”as they vie for dominance in the AI-driven cloud market.
  • πŸ“Š Investment Surge: The script notes the substantial investments being made in AI by both private companies and governments, indicating a massive influx of capital into the sector.
  • πŸ› οΈ Technological Mosaic: AI is part of a larger technological ecosystem that includes sensors, 5G, robotics, and biotechnology, contributing to a rich tapestry of innovation.
  • πŸ“š Realism in AI Impact: There's a call for realism regarding AI's macroeconomic impact, suggesting that while significant, it may not be instantaneous and requires a long-term perspective.
  • πŸ“ˆ Data Centrality: The script concludes by reiterating the fundamental role of data in AI, stating that data is both the foundation for training models and the key to leveraging AI for business success.

Q & A

  • What was the main aim of the AI and data update event in London on October 19th?

    -The main aim of the event was to demystify AI, helping people discern between the true potential of the technology and the hype that was building around it.

  • What is generative AI's role in handling data according to the script?

    -Generative AI helps capture signals from unstructured data sources and combines them with structured data sources, enhancing the tools available for business cases.

  • What significant event occurred with OpenAI around a month after the London event?

    -There was a sudden drama where OpenAI's CEO, Sam Altman, was dismissed by the board, but later returned as CEO with a recomposed board.

  • What was the reported outcome of Microsoft's involvement with OpenAI during the boardroom drama?

    -Microsoft was involved in trying to ensure something was preserved from the situation, and it was believed that up to 95% of OpenAI's staff would move to Microsoft.

  • How did Microsoft's strategy evolve post the OpenAI incident?

    -Microsoft started to diversify its alliances by striking a deal with a French AI company, MISTL, and later acquiring Inflection AI, indicating a push beyond its previous reliance on OpenAI.

  • What is Nvidia's role in the AI industry as mentioned in the script?

    -Nvidia is a major supplier of processing power for AI training and usage, with around 85% of the market share, and has developed a software platform highly valued by model developers.

  • What is the significance of the bipartisan Senate AI working group's report?

    -The report calls for at least $32 billion per year in non-defense AI innovation spending, starting from 2026, indicating a significant commitment to AI from the U.S. government.

  • How is the investment in AI expected to impact the technology's development?

    -The investment is driving rapid improvements in AI models, including longer input context windows, improved reasoning, and the ability to handle multimodal inputs such as text, image, audio, and video.

  • What is the importance of data in the context of generative AI as discussed in the script?

    -Data is fundamental for training foundational models and is essential for applying the strength of these models to a company's unique information and customer interactions.

  • What is the current sentiment towards the macroeconomic impact of generative AI according to the script?

    -There is a greater realism about the impact of generative AI, with the understanding that while it will have a significant effect, it may not be instantaneous.

  • How does the script describe the current state of the AI market in terms of competition and innovation?

    -The AI market is described as highly competitive and innovative, with many significant players and new entrants like MISTL, and it's not solely dominated by large companies.

Outlines

00:00

πŸ€– Demystifying AI: Separating Hype from Reality

The script begins with a reflection on the AI hype that peaked in October 2023, following the launch of GPT in late 2022. The speaker's aim was to demystify AI, emphasizing that it is not magic and has evolved from previous technologies. Generative AI was highlighted as a tool for capturing signals from unstructured data, which can be combined with structured data. The speaker stresses the importance of using AI in conjunction with established data science and engineering practices to serve valuable business cases, rather than as a standalone solution. The script also touches on the dramatic events surrounding OpenAI, including the dismissal and subsequent return of CEO Sam Altman, and the potential shift of staff to Microsoft.

05:01

🌐 AI's Competitive Landscape and Strategic Alliances

This paragraph delves into the competitive landscape of AI, focusing on the roles of major players like Microsoft, Google, and Amazon in the cloud platform market. It discusses Microsoft's strategic moves, such as its investment in OpenAI and its partnership with the French AI company, Mistral. The speaker also mentions the importance of safety versus commercialization in AI development. Chip makers like Nvidia and AMD are highlighted for their crucial role in providing the processing power for AI training and usage. The paragraph concludes by discussing the significance of managed analytics platforms like Snowflake and Databricks, which offer cloud-based solutions for data warehousing and engineering.

10:03

πŸ“Š Forrester Wave Analysis and Market Presence of AI Models

The speaker presents an analysis based on the Forrester Wave, which evaluates AI foundation models for language. The analysis considers market presence and strategy, with Google positioned as a leader in both. The paragraph discusses the difficulty of categorizing the rapidly evolving AI market and the importance of not underestimating smaller players. The analysis serves to frame the competitive space, highlighting the presence of various companies in the market, including OpenAI, Microsoft, and Mistral, with a note on the peculiar absence of Meta's Llama model from the analysis.

15:04

πŸ’° Investment Surge in AI and its Economic Impact

The script addresses the significant investment in AI by major tech companies like Meta, AWS, Google, and Microsoft, which are expected to invest around 200 billion dollars. It also mentions the US government's spending on defense-related AI procurement and a bipartisan Senate AI working group's call for increased non-defense AI innovation spending. The speaker notes the global nature of this investment, with countries like China also actively participating. The paragraph concludes by discussing the rapid improvement of AI models, focusing on three facets: longer input context windows, improved reasoning, and the ability to handle multimodal inputs.

20:07

πŸ“ˆ The Evolution of AI Models and Their Commercialization

This paragraph discusses the evolution of AI models, particularly the increase in input context windows, which has significantly expanded the amount of text that can be processed. It also touches on the importance of creating sophisticated prompts for these models and the improved reasoning capabilities that come with larger context windows. The speaker mentions the cost associated with using more advanced models, such as OpenAI's 3.5 and 4.0, and how pricing is likely to become more competitive. The paragraph concludes with a historical perspective on the development of technology from the 1970s to the present, emphasizing the layered nature of technological advancements.

25:09

🌐 Realism in AI and Its Integration with Data Science

The final paragraph emphasizes the shift towards realism in the perception of AI's capabilities and impact. It discusses the integration of AI with other data science and engineering techniques, highlighting the importance of using data effectively. The speaker argues that generative AI should be used in combination with existing capabilities to address valuable business goals. The paragraph concludes by reiterating that data is central to the development and application of AI, and that everything can be considered data in the context of AI's ability to analyze unstructured information.

Mindmap

Keywords

πŸ’‘AI Chat GPT

AI Chat GPT refers to a type of artificial intelligence designed to engage in conversation with humans. In the script, it is mentioned as a significant launch in November 2022 that generated a lot of hype and awareness in the AI field. The video aims to demystify AI, emphasizing that it is not magic but built on previous technological advancements.

πŸ’‘Generative AI

Generative AI is a subset of AI that can create new content, such as text, images, or music, based on existing data. The script discusses how generative AI can capture signals from unstructured data and combine them with structured data sources, illustrating its potential in aiding businesses beyond just hype.

πŸ’‘Data Science

Data Science is an interdisciplinary field that uses scientific methods, processes, and algorithms to extract knowledge and insights from data. The script highlights the importance of using generative AI in conjunction with established data science practices to serve valuable business cases.

πŸ’‘Data Engineering

Data Engineering involves the collection, storage, and processing of data to ensure its quality and accessibility for analysis. The video script mentions that generative AI is best used in combination with data engineering to address business goals effectively.

πŸ’‘Open AI

Open AI is a research laboratory that aims to develop artificial general intelligence (AGI) in a way that benefits humanity. The script discusses a dramatic event involving Open AI's CEO dismissal and subsequent return, indicating the volatility and rapid changes in the AI industry.

πŸ’‘Microsoft

Microsoft is a leading technology company that has heavily invested in AI, particularly through its alliance with Open AI. The script mentions Microsoft's involvement in the AI space, including its efforts to preserve Open AI and its strategic partnerships with other AI companies.

πŸ’‘AI Safety

AI Safety refers to the practices and measures taken to ensure that AI development and deployment do not pose risks to humans or society. The script touches on the challenge of balancing AI safety with commercialization, indicating the importance of this aspect in the development of AI technologies.

πŸ’‘Cloud Platform Vendors

Cloud Platform Vendors are companies that provide cloud computing services, such as Amazon Web Services (AWS), Google Cloud, and Microsoft Azure. The script positions these vendors as key players in the AI competition, emphasizing their role in driving market share and growth in the AI space.

πŸ’‘Chip Makers

Chip Makers are companies that design and manufacture semiconductor chips, which are crucial for powering AI models and cloud platforms. The script specifically mentions Nvidia, highlighting its significant market share and influence in providing processing power for AI training and usage.

πŸ’‘Hugging Face

Hugging Face is a company that provides a platform for developers to access and integrate the latest AI models. The script describes Hugging Face as a valuable resource for developers, helping them understand and utilize the rapidly evolving options in AI models.

πŸ’‘Forrester Wave

The Forrester Wave is a series of reports that evaluate and rank the top vendors in a given market based on their current offerings and strategies. The script uses the Forrester Wave for AI Foundation Models for Language to illustrate the competitive landscape of AI companies and their market presence.

Highlights

AI demystification efforts aimed at discerning the true potential of technology amidst the hype.

Generative AI's capability to capture signals from unstructured data sources and combine with structured data.

The importance of using generative AI in conjunction with established data science and engineering for business cases.

Drama around Open AI's CEO dismissal and subsequent developments with Microsoft.

Microsoft's strategic moves in AI, including deals with Mistral AI and Inflection AI.

Concerns about the stability and management of Open AI amidst significant investment in the AI space.

The balance between AI safety and commercialization as a critical challenge.

Competition in the AI space among cloud platform vendors like Google, AWS, and Microsoft.

The role of chip makers like Nvidia and AMD in powering AI training and usage.

Hugging Face and Llama Index as valuable resources for developers integrating the latest AI models.

Analyst perspectives on AI foundation models for language from Forrester Wave.

Massive investments by tech giants like Meta, AWS, Google, and Microsoft in AI.

U.S. government spending and bipartisan Senate AI working group's call for increased AI innovation funding.

Improvement of AI models in aspects such as longer input context windows, reasoning, and multimodal input handling.

Economic implications and the potential long-term impact of AI on productivity and the world economy.

The integration of generative AI with other data science techniques and technologies for enhanced decision-making.

Realism in the investment community regarding the macroeconomic impact of generative AI.

Generative AI as part of a larger technological mosaic including sensors, 5G, robotics, and biotechnology.

The necessity for businesses to focus on data and its effective use in combination with AI models.

Transcripts

play00:01

[Music]

play00:05

good morning everybody back in October

play00:07

October the 19th was the last thorough

play00:10

good data and AI update in

play00:12

London uh at that

play00:15

time there was an awful lot of awareness

play00:18

uh you awareness hype some would say

play00:20

about AI chat GPT had been launched on

play00:24

the world at the end of November 2022

play00:27

and and all of the months up to October

play00:29

in in in 2023 were really about that AI

play00:33

hype and

play00:34

our aim really going into that day was

play00:38

to demystify

play00:40

AI uh to help really we thought what we

play00:44

what we would be most useful trying to

play00:47

do was to help people discern between

play00:49

the true potential of the technology and

play00:51

some of the hype that was building

play00:52

around it um and so our our aim was to

play00:58

demystify

play01:00

uh basic points were that it's not magic

play01:04

and that it didn't just come out of

play01:06

nowhere so like all technology it's

play01:09

built on waves and waves of things that

play01:12

go before it but during the sessions of

play01:15

that day we were trying to illustrate in

play01:17

fact I think we're pretty successful in

play01:19

illustrating that generative AI is able

play01:22

to do some things for us so generative

play01:24

AI is able to help us capture signals

play01:27

from unstructured data sources and to

play01:29

comp combine those with the data sources

play01:31

the structured data sources we're so

play01:33

familiar with with

play01:35

using um but also we were trying to

play01:39

illustrate that generative AI is best

play01:42

used in combination with established

play01:44

data science data Engineering in the

play01:47

service of valuable business cases not

play01:51

uh you as opposed to just trying to look

play01:53

for things that generative AI on its own

play01:56

can do

play02:00

the side of you know capturing signals

play02:02

from unstructured data in in many ways

play02:04

what we were saying is data is or

play02:07

everything is is is data or everything

play02:10

can potentially be

play02:12

data and on the other side we're saying

play02:15

that generative AI adds to the tools

play02:17

that we have at our disposal to serve

play02:19

business cases doesn't replace

play02:22

everything so generative AI adds rather

play02:24

than replaces

play02:30

within a month in fact almost exactly a

play02:32

month after the event there was some

play02:33

sudden drama around open AI um open AI

play02:38

obviously having been at the center of

play02:40

of things as the the creators of chat

play02:43

GPT um and in fact on the 17th Friday

play02:47

the 17th of November uh for some reason

play02:52

the board decided to dismiss the the the

play02:55

the CEO Sam

play02:57

mman reported here in the the Ft on the

play03:00

18th but you a pretty busy weekend

play03:05

ensued lots of uh rumors and things

play03:08

coming out but by the end of that

play03:10

weekend it seemed like uh Sam Alman

play03:14

would join Microsoft and take a job

play03:16

there to develop AI there and that

play03:18

perhaps 95% up to 95% of the staff

play03:21

working for open AI would go to

play03:23

Microsoft and within a day or two of

play03:26

that well it was announced that Sam

play03:29

Alman would in fact return as CEO to

play03:31

open aai and that the board at open AI

play03:34

would be a different board different

play03:36

composition so uh a weekend a long

play03:40

weekend of drama um for one of the

play03:43

players at the sort of center of of

play03:45

things now of course open AI had been

play03:50

you know uh supported heavily invested

play03:53

in by Microsoft and Microsoft had placed

play03:55

quite a lot of bets on open AI in terms

play03:58

of its own strategy so during that

play04:00

weekend its involvement in trying to

play04:02

make sure that there was something

play04:04

preserved out of the bsed place was was

play04:07

important but also uh by here late

play04:11

February uh you could see again another

play04:13

ft clip Microsoft strikes deal with

play04:16

mistel so mistl AI French AI company uh

play04:21

had at that point been established

play04:23

perhaps a year uh gone from startup to

play04:27

well today valued at something like $5

play04:30

billion so I know deal with Microsoft

play04:34

and that really showing that Microsoft

play04:36

was pushing Beyond its depend or

play04:39

Alliance that to that point with just

play04:41

open

play04:42

Ai and it's believed that you microsof

play04:46

started that conversation with Mr all in

play04:49

December so pretty much immediately

play04:50

after that that sort of blow up at open

play04:54

AI uh

play04:57

March Microsoft hes somebody uh the

play05:00

chief executive of inflection AI but

play05:04

also the guy who really had been behind

play05:06

deep mind that was bought by Google uh

play05:09

mustapa

play05:11

sullon they didn't acquire inflection AI

play05:14

that doesn't seem to be Microsoft's

play05:16

style in this but but somehow Microsoft

play05:19

is building connection with a lot of

play05:21

players and particularly a lot of

play05:23

innovators and startups in in the space

play05:26

um but you coming much more up to dat by

play05:29

the end of May still concerns about the

play05:33

stability of the the sort of structures

play05:35

that that that owner or manage and

play05:38

control uh open Ai and in some ways we'd

play05:41

say that you that concern about

play05:44

stability there's clearly a lot of money

play05:47

flowing into to the a very hot

play05:50

technology space the main players in it

play05:52

are

play05:53

very recently established uh so that's a

play05:57

recipe for for volatility but of course

play05:59

there's a a very important uh challenge

play06:03

with AI about safety versus

play06:05

commercialization or safety with

play06:07

commercialization so how to play that is

play06:10

is clearly an important and a pretty

play06:13

difficult balance to get right so

play06:15

there's some

play06:19

volatility it's all almost everything

play06:22

that I've said so far has has sort of

play06:24

been about open Ai and Microsoft and in

play06:27

some ways the hype of 2023 was Ed by

play06:30

Microsoft and open AI but it's a very

play06:33

very competitive space and there are a

play06:35

lot of players many of them in fact

play06:38

Google as an example would have been

play06:41

clearly perceived as the leader before

play06:44

that that sort of splurge by Microsoft

play06:47

in in in in November

play06:50

2022 so the cloud platform vendors

play06:55

Google AWS Microsoft for them they're

play06:58

competing for market share of the uh

play07:01

Cloud Market but there you know

play07:04

obviously with that goes competing for

play07:07

growth rate of growth to gain market

play07:09

share in the space and the promise of AI

play07:12

for them is just too important a source

play07:14

of growth for them not to compete very

play07:16

very heavily for so there's you know the

play07:19

the cloud vendors are key players in

play07:21

this AI

play07:23

competition but the model developers are

play07:26

very important too so in this sort of

play07:28

second box on on the screen here uh I've

play07:31

mentioned mistol mentioned open a a lot

play07:35

but there are many other significant

play07:36

players meta massive company that has

play07:39

developed its llama models and put them

play07:41

into the open source

play07:44

Community um Elon Musk with X aai active

play07:49

in the

play07:50

space got a smile just mentioning Elon

play07:52

Musk I I I won't go any further um and

play07:56

and I I'll I'll say a few more things

play07:59

about that in a minute but um the chip

play08:01

makers are important so it's very

play08:04

unusual for us at a Thor good type of

play08:06

event to to worry about the hardware too

play08:09

much but the chip makers well

play08:12

Nvidia you know I can remember the days

play08:15

of trying to select the right cards for

play08:17

gaming not that I gamed much but members

play08:20

of my family did um and now you Nvidia

play08:24

the the world's most valuable company as

play08:26

as of a few days ago uh three trillion

play08:30

dollar valuation market

play08:32

capitalization uh but at the moment uh

play08:35

supplying around 85% of the processing

play08:39

power that is being used by the model

play08:41

developers and the cloud platform

play08:43

providers to power AI training and AI

play08:47

usage uh so an incredibly powerful

play08:50

position in in the market because of

play08:52

some of the very distinctive properties

play08:55

of the processes they make the massively

play08:57

parallel processing capabilities but not

play09:00

only that they've really um developed a

play09:03

software platform that allows model

play09:06

developers to harness those chips in a

play09:08

way that nobody there's no other

play09:10

software platform that is their

play09:13

equivalent that has the acceptance among

play09:16

model developers but it also has some

play09:18

networking technology to connect those

play09:20

processors together so Nvidia has a very

play09:23

powerful

play09:25

position AMD another uh chip designer in

play09:29

fact uh Nvidia doesn't fabricate its

play09:32

chips it designs its chips and they're

play09:35

fabricated by Taiwan silicon uh

play09:38

semiconductor Manufacturing Corporation

play09:40

AMD similarly designs chips and the

play09:44

fabrication is also done by

play09:46

tsmc uh but AMD now with chips that in

play09:50

performance terms can challenge and

play09:52

surpass nvidias is trying to mount a a

play09:56

challenge Intel struggling with some of

play09:59

the the the the very um fine grain

play10:03

processor technology but nevertheless a

play10:05

significant place and interestingly

play10:08

Google AWS and Microsoft all entering

play10:12

this chip uh Market

play10:16

too also important to our probably much

play10:19

less talked about but important to

play10:21

developers are the places that

play10:23

developers can go to pick up the latest

play10:25

models when they're

play10:27

integrating hugging face American French

play10:30

company uh llama index providing

play10:34

something that is incredibly valuable to

play10:37

developers at a time when model options

play10:40

are coming up fast and furiously and

play10:43

there's so much to choose from its way

play10:45

of understanding what's available what's

play10:46

trending how to use it and how to

play10:49

integrate um and the final box managed

play10:55

analytics

play10:56

platforms uh I've got two here snowflake

play11:00

data

play11:01

bricks essentially similar in some ways

play11:04

both essentially providing

play11:06

platforms that will run on any of the

play11:09

clouds or in a multicloud sense across

play11:12

clouds so

play11:15

insulating the application layer from

play11:17

the particular cloud provider Choice uh

play11:21

to a large degree but doing it in quite

play11:23

different way so snowflakes approach to

play11:25

it to come really from a a relational uh

play11:29

uh

play11:30

you well making the appeal to people who

play11:33

have used relational technology to to do

play11:36

relational uh data warehousing and

play11:39

making that fast and easy in in the

play11:42

cloud and in a multicloud way data

play11:44

bricks coming at it in a more I would

play11:47

say Innovative way uh harnessing spark

play11:50

and getting spark technology as a

play11:52

foundation enabling data engineering

play11:56

data science and now ai to be done again

play11:59

AC cross multic clouds but without the

play12:01

constraint of relational uh technology

play12:04

or yeah with the option of relational

play12:06

technology but without that being a

play12:11

constraint I'll go to an analyst uh

play12:14

perspective on some of those

play12:16

foundational models so this is the

play12:18

Forester wave a AI F Foundation models

play12:22

for language now the first thing I'll

play12:24

say there's a lot of uh I suppose

play12:27

skepticism or cynicism about some of the

play12:30

analysts work in any of the fields but

play12:32

in in this field at the moment it's

play12:34

quite interesting to try and even you

play12:37

describe who's got leadership and and

play12:40

and and who's got strategic Advantage

play12:42

but the what I'll do is I'll do a zoomed

play12:45

in version in the middle so that we can

play12:47

look at some of the players in the space

play12:49

but I'll describe the the overall um

play12:53

setup of the analysis first of all so

play12:56

Market presence the size of circle is

play12:59

indic indicating in Foresters terms the

play13:02

the the market

play13:04

presence um if the circle is gray or the

play13:09

Little Dot is empty has a space in the

play13:12

middle of it uh that denot and you'll

play13:14

see that more clearly when we zoom in

play13:16

that denotes whether somebody has

play13:18

participated in the Forester study or

play13:20

not uh so here anthropic is gray haven't

play13:25

explicitly participated open AI has

play13:28

participated so that's an important

play13:29

distinction I think uh the

play13:32

axes

play13:34

um going on the vertical axis from at

play13:37

the bottom weaker current offering to

play13:40

Stronger current offering now this is a

play13:42

very fast moving Market it's very hard

play13:45

to I think classify what's weak and

play13:48

what's strong in in a pure

play13:51

sense uh and then along the horizontal

play13:54

axis weaker strategy stronger

play13:57

strategy uh again very hard to I think

play14:01

decide you know well I'll zoom in a

play14:06

little bit um so in the leadership

play14:09

positioning position

play14:11

Google clearly a powerful strategy

play14:14

clearly some pretty established presence

play14:16

in this kind of

play14:18

Technology at the other end of the

play14:20

spectrum though mistol classified here

play14:22

as just coming into it well mistra can

play14:26

you judge it in isolation how you judge

play14:29

it because mistal in combination with

play14:31

Microsoft is quite a different

play14:32

proposition to mistol a

play14:35

startup um and open AI in many ways you

play14:39

know showing a lot of Market presence

play14:42

Microsoft not oh and there's an example

play14:44

of a hollow dot so Microsoft not a

play14:47

participant with Forester in its own

play14:50

right but clearly

play14:52

presence in the market through open AI

play14:54

so it's a bit of an odd analysis but it

play14:56

serves to put the names in the frame for

play14:59

us that this is a wide open space

play15:01

there's a lot of competition a lot of

play15:03

innovation you can't just say somebody's

play15:06

got this because they're big or somebody

play15:09

hasn't got it because they're small you

play15:10

could come from nowhere and indeed

play15:13

M you know has done and is

play15:17

doing I'll move on from that uh to

play15:21

something about investment so you know

play15:25

2023 there was clearly investment going

play15:27

in uh a lot of

play15:29

talk you amongst the you Market

play15:33

participants private Equity investors

play15:36

Etc about all of the money going

play15:39

in but at the moment you know this year

play15:43

it's expected that about 200 billion

play15:46

will be invested

play15:48

by those four players meta AWS Google

play15:53

and

play15:54

Microsoft meta by the way whose

play15:57

technology llama

play15:59

or whose Foundation model technology

play16:01

llama put into open the open source

play16:03

space not even figuring on the previous

play16:06

slide we don't know why but it's an odd

play16:09

one to be missing from from that Forest

play16:11

analysis but colossal amounts of money

play16:14

being spent by those companies now

play16:17

they're private companies they're making

play16:19

Investments those investments will be

play16:23

depreciated and that depreciation will

play16:25

show up in their income statements at

play16:27

some point their p&l and and they will

play16:30

be you working very hard to make sure

play16:32

they're showing a profit on on those

play16:34

charges not a a loss on them so there's

play16:36

something very real world focused about

play16:39

the Investments that those companies are

play16:41

making colossal though they

play16:43

are in a different sense this is looking

play16:47

backwards so it's estimated that the US

play16:50

government uh spent more than 4 billion

play16:53

last year on procurement for uh defense

play16:57

related AI just procurement for defense

play17:00

related so $44 billion last

play17:03

year a more forward-looking thing uh a

play17:07

senate a bipartisan which is quite

play17:09

interesting you don't hear much that's

play17:11

positive and bipartisan about us

play17:13

politics at the moment but a bipartisan

play17:16

Senate AI working group published a

play17:19

report in May that calls for at least 32

play17:23

billion per year non-defense AI

play17:26

Innovation spending 32 billion per year

play17:30

and the target is for that to be uh

play17:32

raised and spent starting from 2026 so a

play17:37

pretty aggressive thing I think it would

play17:40

be safe to assume that they won't stop

play17:42

spending on a defense front as well so

play17:45

an awful lot of money coming into this

play17:47

space not just from the private players

play17:50

but from government

play17:52

players and that's supplemented by

play17:55

public announcements coming along quite

play17:56

frequently at the moment from other

play17:58

countries

play17:59

uh making in in many cases you know

play18:02

multi hundred million commitments to

play18:04

investment in the space I've listed a

play18:07

few here um but you clearly somebody

play18:12

like China is going to be pretty active

play18:14

too although it's not obvious you what

play18:16

this what the equivalent expenditure

play18:18

is um so so it's a very active space

play18:22

some people need to show a return on

play18:25

that investment government's probably

play18:27

different to that they able to act on a

play18:30

a different belief basis they certainly

play18:32

don't want to be left out of something

play18:34

that has the potential that that AI

play18:38

has um so all of this energy going into

play18:44

those companies producing the models

play18:46

well the models are being improved very

play18:49

quickly and there I've just reduced it

play18:51

to three different facets here but there

play18:52

are three important facets so longer

play18:57

input context windows improved reasoning

play19:00

and the third one ability to handle text

play19:03

and image together as input prompts and

play19:07

more generally the ability to handle

play19:10

text image audio video coming in

play19:14

multimodel becoming ever more important

play19:16

in these models but to the first point

play19:20

longer input context

play19:22

Windows

play19:24

um I suppose if I confess my own naivity

play19:28

year and a half ago I was thinking a

play19:31

prompt for chat GPT is not that

play19:34

dissimilar to you typing a a query into

play19:38

Google obviously a bit more to it than

play19:41

that because you could continue to

play19:43

develop context as you as you prompted

play19:46

but nevertheless I'm thinking you

play19:47

relatively small things whereas uh some

play19:51

of these figures here if we take the

play19:53

table here uh check uh GPT or GPT 3.4

play19:58

five turbo the input Windows there were

play20:02

supporting 16,000 tokens converted into

play20:06

rough pages of text equivalent that's

play20:09

about 24 pages of text so even

play20:13

3.5 a enabling a lot of prompting to be

play20:17

to be put

play20:19

forth

play20:21

128,000 take takes that up to probably

play20:23

you know getting on for a hundred pages

play20:25

of

play20:26

text um by the time you get to yeah a

play20:30

million tokens

play20:33

uh I won't do the arithmetic for you but

play20:35

but many many pages uh that that we can

play20:39

put in to to these things so you think

play20:41

well this isn't like T typing a query

play20:43

into Google this is something more

play20:45

significant that than that

play20:48

now in a recent Thing by uh one of the

play20:52

sort of uh bloggers that I trust in the

play20:55

space Andrew in he was talking about

play20:57

multipay prompts as being really Mega

play21:01

prompts well you'll see later this

play21:03

morning in this session actually when we

play21:06

get to the Oxford university endowment

play21:08

management application the prompts that

play21:10

we're providing there are many many

play21:13

pages long and and quite sophisticated

play21:16

in their structure so uh the ability to

play21:19

really create very intelligent prompts

play21:22

for these models is is or very big

play21:24

prompts I should say is is one important

play21:27

facet the next Point improved reasoning

play21:30

Builds on that because the more

play21:32

reasoning power that is being developed

play21:35

into the foundation models the better

play21:37

they're able to interpret and make sense

play21:39

of large prompts so it the the two

play21:42

things feed themselves bigger prompt

play21:45

Windows better ability to make sense of

play21:48

the prompts and then the ability to

play21:51

handle text and image well you we might

play21:53

not have ever thought of putting a

play21:55

diagram into Google but um you certainly

play21:57

can start thinking now about putting

play21:59

images as well as text into your

play22:03

prompting now there's a there's a cost

play22:06

Associated so yeah I've put here the the

play22:09

pricing first of all for the open AI uh

play22:13

3.5 40 and um yeah four

play22:18

turbo price of four turbo $10 per

play22:22

million tokens versus 50 cents per

play22:26

million tokens so it gives gives you

play22:28

something more but you pay a lot lot

play22:30

more for it so these guys are trying to

play22:33

commercialize this space it's not it's

play22:36

not a

play22:36

charity

play22:38

um table somewhat inconsistent I put

play22:41

anthropic Claude 3 Opus there that's the

play22:44

most powerful model at the moment coming

play22:46

from

play22:47

anthropic much more expensive relatively

play22:50

speaking but it is their very top

play22:53

offering they have cheaper offerings too

play22:57

uh or yeah good Google Gemini 1.5 Pro

play23:00

that you colossal uh prompt window

play23:03

capacity priced relatively competitively

play23:06

and the pricing here I've put is for um

play23:09

prompts that are over

play23:11

128,000 tokens but if you can keep your

play23:15

prompts within a 100 you keep within 100

play23:17

Pages you could pay half that price

play23:20

three $350 for yeah per per million

play23:24

tokens so pricing no doubt will move and

play23:27

move and move competitively and quickly

play23:30

but generally people will be trying to

play23:31

get more for the latest better

play23:36

capability um in the last one of these

play23:40

we kind of built a a pretty elaborate

play23:42

picture of what's happened in terms of

play23:44

waves and layers of Technology since the

play23:47

1970s right up to the current with that

play23:50

sort of Starburst moment at the top of

play23:53

the diagram around Google releasing its

play23:56

file system and and reduce papers which

play23:59

started something well what did it start

play24:03

that that Google release started the

play24:04

cloud era the Big Data era and on top of

play24:08

that the AI era has been built uh in

play24:11

some ways you know eclipsing what has

play24:15

happened over decades with relational

play24:17

and before that multi-dimensional

play24:20

technology all of those relevant to us

play24:22

though yeah as a business thorough Goods

play24:24

business is helping customers make

play24:27

better faster decisions

play24:29

with the data that they have available

play24:31

so all of these layers have been uh

play24:34

applicable and still are in in various

play24:36

ways uh applicable the technology just

play24:39

gets better the options get richer and

play24:42

that continues uh with some of the data

play24:47

handling capabilities graph databases

play24:50

geospatial databases you'll see examples

play24:53

of the applicability of those later this

play24:56

morning in some of the cases but and

play24:59

Vector databases you'll see uh reference

play25:01

to that in the Oxford university

play25:03

endowment management case so these

play25:06

Technologies just layering in at the

play25:08

cloud big data and AI level enriching

play25:12

what we can do so it's not all about

play25:14

large language models or Foundation

play25:16

models it's about the combination of

play25:19

those models with other data science

play25:22

techniques other data engineering

play25:24

techniques and and platting it together

play25:27

effectively um

play25:31

um greater realism I think one of the

play25:35

big differences between you know

play25:37

standing talking today and even eight

play25:40

months ago there's much greater realism

play25:43

if I talk to people in the investment

play25:45

community at the moment they've realized

play25:47

that it's not just about large language

play25:50

models it's not just about generative AI

play25:52

it's about data engineering capabilities

play25:55

data science capabilities business focus

play25:58

yeah what you do with it um but there

play26:02

was a a point the world economic Forum

play26:06

um some economists saying stay stay

play26:08

realistic about generative ai's

play26:11

macroeconomic impact but it always

play26:14

amuses me when economists tell us to

play26:16

stay realistic but but um putting that

play26:18

aside and looking at one of my

play26:20

colleagues who might be upset by that

play26:22

but

play26:23

um you know stay realistic about

play26:26

generative ai's macroeconomic impact

play26:29

I the economists making that statement

play26:31

weren't saying that there won't be much

play26:33

impact but they're saying it's likely to

play26:36

take

play26:37

longer than instant it won't be

play26:41

instantaneous that the effect is felt

play26:43

but even small impacts over long periods

play26:47

of productivity difference will be

play26:50

enormously beneficial to the world

play26:52

economy so it's too important to ignore

play26:55

the first bullet point though generative

play26:57

AI is is a critical piece of a

play26:59

technology technological

play27:01

Mosaic including sensors 5G robotics

play27:05

biotechnology etc etc is part of a

play27:09

mosaic and the second bullet point

play27:12

exuberance typically accompanies

play27:14

remarkable

play27:15

Innovations but I think when an

play27:18

economist uses

play27:19

exuberance they're probably referencing

play27:22

back to the irrational exuberance phrase

play27:25

that Alan Greenspan used to use when he

play27:28

was uh chairman of the Federal Reserve

play27:30

in the United States and he was using

play27:32

that to describe what was going on in

play27:35

the do com boom of the 1990s and into

play27:38

into the uh early 2000s so the doom boom

play27:43

people thought that was a boom that

play27:45

burst and bust but out of it came Google

play27:48

Amazon I mean that wasn't irrational but

play27:51

there was a lot of stuff around the ones

play27:52

that did survive and make it big that

play27:54

was much more irrational than that so

play27:57

we're in a similar phase I suspect where

play27:59

it's sensible to be grounded and the

play28:02

theme that we'll be pursuing today is

play28:04

how to ground that how to ground in

play28:06

things that will be valuable for your

play28:10

business

play28:13

um so coming back and and in some ways

play28:16

relating to that that that previous

play28:19

Economist Point um generative AI as I

play28:23

said at the beginning is best used in

play28:25

combination with established data

play28:27

science and data engineering ing

play28:28

capabilities to address valuable

play28:30

business goals rather than looking for

play28:33

things that generative AI alone can

play28:36

do generative AI adds rather than

play28:42

replaces we'd say in fact data is

play28:46

everything data is the basis for

play28:49

training these foundational

play28:52

models that's what's enabling them to

play28:54

develop their intelligence and their

play28:56

usefulness but

play28:58

the data that any particular company has

play29:01

uniquely to itself the latest

play29:03

information particularly the most unique

play29:06

information that a company has that can

play29:08

be used in combination with those models

play29:10

to apply the strength of those models to

play29:13

the strength of the company's contact

play29:16

with its customers in the world so it's

play29:19

about

play29:20

using data effectively data is

play29:23

everything and as we said earlier

play29:27

everything is dat as well in this world

play29:29

where large language models can look at

play29:31

unstructured information and pull

play29:32

signals out of it so everything is data

play29:36

data is everything it's not magic it

play29:40

didn't come out of nowhere and I'll hand

play29:43

over to Amanda

play29:45

[Music]

Rate This
β˜…
β˜…
β˜…
β˜…
β˜…

5.0 / 5 (0 votes)

Related Tags
Artificial IntelligenceData ScienceGenerative AIAI HypeBusiness CasesTech InnovationCloud PlatformsModel DevelopmentAI SafetyCommercializationInvestment Trends