How Perplexity works | Aravind Srinivas and Lex Fridman

Lex Clips
23 Jun 202413:33

Summary

TLDRThe transcript discusses 'Perplexity,' an AI-driven answer engine that functions by combining search engine capabilities with a large language model (LLM). It emphasizes the importance of sourcing, akin to academic research, to ensure accuracy. Perplexity is designed to provide direct answers with citations, inspired by the need for reliable information in a world of abundant but often misleading online content. The conversation also touches on the limitations of Perplexity compared to traditional search engines like Google, highlighting the ongoing challenges in AI integration for real-time information and user experience.

Takeaways

  • 🧠 Perplexity is described as an 'answer engine' that provides answers backed by sources, similar to academic referencing.
  • 🔍 The search engine component of Perplexity extracts relevant results from the internet, which are then fed into a large language model (LLM).
  • 🤖 The LLM processes the information and generates a well-formatted answer with citations, adhering to the academic principle of sourcing every statement.
  • 📚 Perplexity's approach is inspired by the need for accuracy in academic writing, aiming to ensure that chatbots provide reliable and citable information.
  • 💡 The founders of Perplexity, with academic backgrounds, sought to apply the rigor of academic research to chatbot technology to enhance accuracy.
  • 🌐 Perplexity is not just a search engine but a 'knowledge discovery engine' that encourages users to explore further after receiving an answer.
  • 📊 A comparison between Perplexity and traditional search engines like Google highlights Perplexity's strengths in direct answers and synthesized information.
  • 🚀 Perplexity's AI, while impressive, is not yet a full replacement for Google due to limitations in accuracy, speed, and real-time information provision.
  • 🔗 The platform generates related searches and questions to spark curiosity and continue the knowledge discovery process.
  • 🌟 The potential for personalization and localization in Perplexity's responses could significantly enhance the user experience by anticipating needs and providing contextually relevant information.

Q & A

  • What is the primary function of Perplexity?

    -Perplexity is primarily described as an answer engine that provides direct answers to questions by synthesizing information from various sources, backed by citations.

  • How does Perplexity combine the search engine and LLM functionalities?

    -Perplexity uses a search engine to find relevant sources, extracts information, and then feeds it into a large language model (LLM) to formulate a well-formatted answer with appropriate citations.

  • What is the significance of sourcing in Perplexity's operation?

    -Sourcing is crucial as it ensures that every statement made by Perplexity is backed by credible sources, similar to academic writing, which enhances the accuracy and reliability of the answers provided.

  • How does Perplexity's approach differ from traditional search engines?

    -Unlike traditional search engines that provide a list of links, Perplexity focuses on delivering direct answers synthesized from multiple sources, aiming to be more like an academic paper with citations.

  • What inspired the creation of Perplexity?

    -The creators' academic roots and the need for accurate information led to the development of Perplexity. They wanted a system that would only provide information that could be cited from the internet, ensuring accuracy.

  • Why did the founders feel the need to build Perplexity?

    -The founders felt the need to build Perplexity due to their own experiences with the inadequacy of existing search engines in providing clear and accurate answers, especially in areas like insurance where they were novices.

  • How does Perplexity ensure the accuracy of its answers?

    -Perplexity ensures accuracy by mandating that every piece of information provided is backed by sources found on the internet, following the academic principle of citing every statement made in a paper.

  • What is the role of user experience in Perplexity's design?

    -User experience plays a significant role in Perplexity's design as it aims to provide direct answers and synthesize information in a way that is easy for users to understand and appreciate, much like Wikipedia.

  • How does Perplexity handle real-time information?

    -Perplexity is working on integrating real-time information, but it is a complex task that requires prioritizing recent information and presenting it in a user-friendly manner.

  • What are some of the challenges Perplexity faces in providing accurate answers?

    -Some challenges include ensuring accuracy and speed, integrating real-time information, and designing custom UIs for specific sets of questions to present information in the best possible way.

  • How does Perplexity's approach compare to Google for everyday searches?

    -While Perplexity provides direct answers and summaries, Google is faster and more efficient for simple navigational queries and real-time information. Perplexity is not yet a full replacement for Google for everyday searches.

Outlines

00:00

🔍 The Concept of Perplexity as an Answer Engine

The first paragraph introduces Perplexity as an innovative answer engine that functions by combining the capabilities of a traditional search engine with a large language model (LLM). It emphasizes that Perplexity is designed to provide answers backed by sources, akin to academic research. The process involves extracting relevant information from search results, feeding it into the LLM, and generating a well-formatted answer with citations. The goal is to ensure accuracy and coherence in the responses, inspired by the rigorous standards of academic writing where every statement is supported by a citation. The narrative also touches upon the challenges faced by the founders, who, despite being new to building a startup, sought to create a product that could answer questions with the same reliability and depth as academic research.

05:00

🌐 Perplexity as a Knowledge Discovery Engine

The second paragraph delves into the distinction between Perplexity and traditional search engines. It positions Perplexity as a knowledge discovery engine, emphasizing the journey of learning that begins after receiving an answer. The speaker highlights the importance of related questions and the continuous pursuit of knowledge, which is central to Perplexity's design. The paragraph contrasts Perplexity's approach of providing direct answers with the link-based results of search engines like Google. It also discusses the user experience and technological integration that sets Perplexity apart, including its ability to handle poorly phrased questions and generate related searches to spark curiosity and further exploration. The speaker acknowledges that while Perplexity is impressive, it is not yet a full replacement for Google for everyday searches, citing differences in speed, accuracy, and the ability to provide real-time information.

10:01

🛠️ Challenges in Building a User-Centric Knowledge Engine

The third paragraph addresses the complexities involved in creating a user-centric knowledge engine like Perplexity. It discusses the need for custom user interfaces (UIs) tailored to specific queries and the importance of presenting information in a way that anticipates user needs. The speaker reflects on the limitations of relying solely on advanced models and emphasizes the importance of product design in delivering a personalized and anticipatory user experience. The paragraph also touches on the potential for personalization based on user habits and interests, suggesting that a combination of location, frequented sites, and topics of interest can significantly enhance the personalization of search results. The discussion concludes with a hypothetical scenario where Perplexity could provide not just the weather but also practical advice like what to wear, based on the user's activities and preferences.

Mindmap

Keywords

💡Perplexity

Perplexity in the script refers to an AI system that acts as an 'answer engine'. It is designed to provide direct answers to questions by synthesizing information from various sources, backed by citations. This is akin to how an academic writes a paper with each claim supported by references. The system is described as a combination of a search engine and a large language model (LLM), aiming to deliver accurate and well-referenced responses.

💡Search Engine

A search engine, as mentioned in the script, is a system that retrieves information from the internet based on user queries. It plays a crucial role in the Perplexity system by sourcing relevant content from the web. The script differentiates Perplexity from a traditional search engine by highlighting that while search engines provide a list of links, Perplexity synthesizes information into a direct answer.

💡LLM (Large Language Model)

LLM stands for Large Language Model, which is a type of AI that processes and generates human-like text based on the input it receives. In the context of the script, the LLM takes the relevant information extracted by the search engine and formulates a coherent, well-structured answer to the user's query. The LLM is integral to how Perplexity operates, as it ensures the response is not only informative but also readable.

💡Citations

Citations in the script are references to the sources of information used to construct an answer. They are compared to how an academic paper is written, where every statement is backed by a citation from a reliable source. Perplexity's use of citations ensures that the answers provided are not only accurate but also verifiable, adding credibility to the responses.

💡Academic Writing

Academic writing is a formal style of composing that is characterized by its precision, objectivity, and extensive use of citations. The script uses academic writing as a model for how Perplexity should construct its answers. It emphasizes the importance of basing every statement on credible sources, which is a principle that Perplexity adopts to ensure the accuracy and reliability of its responses.

💡Coherence

Coherence in the script refers to the quality of being logically organized and consistent. It is applied to how Perplexity generates responses that are not only accurate but also make sense as a narrative. The system is instructed to create coherent answers, which means the information must flow logically and be easy for users to understand.

💡Knowledge Discovery

Knowledge discovery is the process of finding new information or insights from existing data. In the script, this concept is applied to how Perplexity functions as an 'answer engine' that not only provides answers but also sparks further inquiry. It suggests that the journey of learning begins after receiving an answer, encouraging users to explore more related questions.

💡Real-time Information

Real-time information refers to data that is provided as soon as it becomes available. The script mentions that Google excels at providing real-time information, such as sports scores, which is a feature that Perplexity aims to integrate. This is important for queries where up-to-date information is crucial, such as weather updates or stock prices.

💡User Experience (UX)

User experience (UX) in the script pertains to how interactions with a system are designed to be user-friendly and satisfying. Perplexity focuses on enhancing UX by providing direct answers and synthesizing information in a way that is easy for users to consume. The script discusses how Perplexity's approach to UX differs from traditional search engines by offering a more streamlined and engaging experience.

💡Personalization

Personalization in the script refers to tailoring the information or services provided by a system to individual users based on their habits, preferences, or past behavior. It is mentioned as a potential area for improvement in Perplexity, where the system could provide more relevant answers by understanding the user's context, such as location or interests.

Highlights

Perplexity is described as an answer engine that provides answers backed by sources, similar to academic referencing.

The search engine component of Perplexity extracts relevant results from the internet to feed into the large language model (LLM).

The LLM processes the extracted information and generates a well-formatted answer with citations.

Perplexity is instructed to write answers with academic rigor, ensuring each statement is supported by a citation.

The concept of Perplexity was born out of the need for accurate information, inspired by the founders' academic backgrounds.

The founders aimed to create a chatbot that only states facts it can source from the internet, mirroring academic practices.

Perplexity's approach is compared to Wikipedia, which requires sources for edits, emphasizing the importance of notable sources.

The product's development was driven by the need to answer questions accurately, especially in areas like insurance where the founders were inexperienced.

Perplexity is positioned as a knowledge discovery engine, going beyond just providing answers to spark curiosity and further inquiry.

The user experience with Perplexity includes AI-generated summaries and a focus on direct answers rather than a list of links.

Perplexity generates related searches to continue the knowledge discovery process, encouraging users to explore more.

The comparison between Perplexity and Google highlights Perplexity's strengths in direct answers and summarization.

Google's speed and efficiency in providing real-time information and direct navigation are noted as advantages over Perplexity.

Perplexity's challenge in accuracy and speed is discussed, with a focus on the need for improvement in these areas.

The importance of UI design in presenting information effectively and anticipating user needs is emphasized.

The potential for personalization in Perplexity is discussed, suggesting that it could enhance the user experience.

The concept of localization and personalization based on user habits and preferences is explored as a way to improve the product.

The idea of providing additional context beyond the direct answer, such as suggesting what to wear based on weather, is presented as a future enhancement.

Transcripts

play00:03

perplexity is part search engine part

play00:05

llm so how does it work and what role

play00:09

does each part of that the search and

play00:11

the llm play in uh serving the final

play00:13

result perplexity is best described as

play00:16

an answer engine so you ask it a

play00:18

question you get an answer except the

play00:21

difference is all the answers are backed

play00:24

by

play00:25

sources this is like how an academic

play00:27

writes a paper now that referencing part

play00:31

the sourcing part is where the search

play00:33

engine part comes in so you combine

play00:35

traditional search extract results

play00:38

relevant to the query the user asked you

play00:41

read those links extract the relevant

play00:44

paragraphs feed it into an llm llm means

play00:48

large language model and that llm takes

play00:52

the relevant paragraphs looks at the

play00:54

query and comes up with a well formatted

play00:57

answer with appropriate footnote to

play01:00

every sentence it says because it's been

play01:02

instructed to do so it's been instructed

play01:04

with that one particular instruction of

play01:06

given a bunch of links and paragraphs

play01:08

write a concise answer for the user with

play01:11

the appropriate citation so the magic is

play01:14

all of this working together in one

play01:16

single orchestrated product and that's

play01:19

what we build perplexity for so it was

play01:21

explicitly instructed to uh write like

play01:24

an academic essentially you found a

play01:26

bunch of stuff on the internet and now

play01:28

you generate something called coherent

play01:31

and uh something that humans will

play01:33

appreciate and cite the things you found

play01:36

on the internet in the narrative you

play01:38

create for the human correct when I

play01:40

wrote my first paper uh the senior

play01:42

people who are working with me on the

play01:44

paper told me this one profound thing

play01:47

which is that every sentence you write

play01:49

in a

play01:50

paper should be backed with a citation

play01:54

with a with a citation from another

play01:56

peer-reviewed paper or an experimental

play01:59

result in your on paper anything else

play02:01

that you say in the paper is more like

play02:03

an

play02:04

opinion that's it's it's a very simple

play02:06

statement but pretty profound and how

play02:08

much it forces you to say things that

play02:10

are only

play02:11

right and we took this principle and

play02:14

asked

play02:16

ourselves what is the best way to make

play02:18

chat

play02:19

Bots

play02:21

accurate is force it to only say things

play02:24

that it can find on the

play02:26

internet right and find from multiple

play02:28

sources so

play02:31

this kind of came out of a need rather

play02:33

than oh let's try this idea when we

play02:36

started the startup there were like so

play02:38

many questions all of us had because we

play02:40

were complete

play02:42

noobs never built a product before never

play02:44

built like a startup before of course we

play02:47

had worked on like a lot of cool

play02:48

engineering and research problems but

play02:50

doing something from scratch is the

play02:53

ultimate

play02:54

test and there were like lots of

play02:56

questions you know what is the health

play02:58

insur like the first employee we had

play02:59

hired he came and asked us for health

play03:02

insurance normal need I didn't care I

play03:05

was like why do I need a health

play03:07

insurance this company dies like who

play03:10

cares um my other two co-founders had

play03:13

were married so they had health

play03:14

insurance to their spouses but this guy

play03:17

was like looking for health

play03:19

insurance and I didn't even know

play03:22

anything who are the providers what is

play03:24

co insurance or deductible or like none

play03:26

of these made any sense to me and you go

play03:28

to Google insurance is a category where

play03:32

like a major ad spend category so even

play03:35

if you ask for something you're not

play03:37

Google has no incentive to give you

play03:38

clear answers they want you to click on

play03:40

all these links and read for yourself

play03:42

because all these insurance providers

play03:43

are biding to get your attention so we

play03:47

integrated a slackbot that just PS GPD

play03:51

3.5 and answered a

play03:53

question now sounds like problem solve

play03:56

except we didn't even know whether what

play03:58

it said was correct or not and in fact

play04:00

was saying incorrect things we were like

play04:02

okay how do we address this problem and

play04:04

we remembered our academic Roots um you

play04:07

know Dennis and myself were both

play04:09

academics then is my

play04:10

co-founder and we said okay what is one

play04:13

way we stop ourselves from saying

play04:15

nonsense in a peerreview paper we're

play04:18

always making sure we can cite what it

play04:19

says what what we what we write every

play04:21

sentence now what if we ask the chatbot

play04:23

to do that and then we realized that's

play04:25

literally how Wikipedia works in

play04:27

Wikipedia if you do a random edit people

play04:31

expect you to actually have a source for

play04:33

that and not just any random Source they

play04:36

expect you to make sure that the source

play04:38

is

play04:38

notable you know there are so many

play04:40

standards for like what counts is

play04:42

notable and not so you decide this is

play04:45

worth working on it's not just a problem

play04:47

that will be solved by an smarter model

play04:49

because there's so many other things to

play04:51

do on the search layer and the sources

play04:52

layer and making sure like how well the

play04:55

answer is formatted and presented to the

play04:57

user so that's why the product exists

play05:00

well there's a lot of questions to ask

play05:01

there but first zoom out once again so

play05:05

fundamentally it's about search so you

play05:08

said first there's a search element mhm

play05:12

and then there's an storytelling element

play05:15

via

play05:16

llm and the citation element but it's

play05:19

about search first so you think of

play05:21

perplexity as a search engine mhm I

play05:23

think of perplexity as a knowledge

play05:26

discovery engine neither a search engine

play05:28

I mean of course we call it an answer

play05:30

engine but everything matters here uh

play05:34

the journey doesn't end once you get an

play05:36

answer in my opinion the Journey Begins

play05:38

after you get an answer you see related

play05:41

questions at the bottom suggested

play05:43

questions to ask why because maybe the

play05:47

answer was not good enough or the answer

play05:49

was good enough but you probably want to

play05:51

dig deeper and ask

play05:54

more

play05:56

and that's why in in the search bar we

play05:58

say where knowledge begins

play06:00

cuz there's no end to knowledge you can

play06:03

only expand and grow like that's the

play06:04

whole concept of the beginning of

play06:06

infinity book by David do you always

play06:08

seek new knowledge so I see this as sort

play06:11

of a discovery process you start you

play06:14

know let's say you literally whatever

play06:15

you ask me to right now you could have

play06:18

asked perplexity too MH hey perplexity

play06:22

is it a search engine or is it an answer

play06:23

oral what is it and then like you see

play06:25

some questions at the bottom right we're

play06:27

going to straight up ask this right now

play06:29

I I don't know how it's going to work is

play06:32

a perplexity a search engine or an

play06:35

answer

play06:36

engine that's a poorly phrased question

play06:39

but one of the things I love about

play06:41

perplexity the poorly phrased questions

play06:43

will nevertheless lead to interesting

play06:45

directions perplexity is primarily

play06:47

described as an answer engine rather

play06:49

than a traditional search engine key

play06:51

points showing the difference between

play06:54

answer Engine versus search engine uh

play06:57

this is so nice and it compares

play06:59

perplexity versus a traditional search

play07:01

engine like Google so Google provides a

play07:04

a list of links to websites perplexity

play07:06

focuses on providing direct answers

play07:08

synthesizing information from various

play07:10

sources user experience technological

play07:14

approach uh so there's AI integration

play07:18

with Wikipedia like responses this is

play07:20

really well done and you look at the

play07:22

bottom right right so you you were not

play07:24

intending to ask those

play07:27

questions but they're relevant like can

play07:29

perplexity replace Google for everyday

play07:32

searches all right let's click on that

play07:34

but the really interesting generation

play07:35

that task that step of generating

play07:38

related searches so the next step of the

play07:42

Curiosity journey of expanding your

play07:43

knowledge is really interesting exactly

play07:45

so that's what David do shes in this

play07:46

book which is for creation of new

play07:49

knowledge starts from the spark of

play07:51

curiosity to seek

play07:54

explanations and then you find new

play07:56

phenomenon or you get more depth on

play07:58

whatever knowledge you already have I

play08:00

really love the steps that the pro

play08:02

search is doing compare perplexity and

play08:03

Google for everyday searches step two

play08:05

evaluate strengths and weaknesses of

play08:07

perplexity evaluate strengths and

play08:08

weaknesses of Google it's like a

play08:10

procedure yeah complete okay answer

play08:13

perplexity AI while impressive is not

play08:15

yet a full replacement for Google for

play08:17

everyday searches yes here are the key

play08:19

points based on the provided sources

play08:22

strength of perplexity AI direct answers

play08:24

AI part summaries Focus search user

play08:27

experience we can dig into the details

play08:29

of a lot of these weaknesses of

play08:31

perplexity AI accuracy and speed

play08:34

interesting I don't know if that's

play08:35

accurate well Google Google is faster

play08:37

than perplexity because you instantly

play08:38

render the links the latency is it's

play08:41

like you get 200 300 to 400 milliseconds

play08:43

results here it's like you know still

play08:46

not about like th000 milliseconds here

play08:48

right for simple navigational queries

play08:51

such as finding a specific website

play08:52

Google is more efficient and reliable so

play08:54

if you actually want to get straight to

play08:56

the source yeah you just want to go to

play08:58

kayak yeah just want to go fill up a

play09:01

form like you want to go like pay your

play09:02

credit card dues real time information

play09:05

Google excels in providing real time

play09:07

information like sports SC so like while

play09:10

I think perplexity is trying to

play09:12

integrate real time like recent

play09:14

information put Priority on recent

play09:15

information that require that's like a

play09:17

lot of work to integrate exactly because

play09:19

that's not just about throwing an

play09:21

llm uh you like when you're asking oh

play09:24

like what what dress should I wear out

play09:26

today in Austin um you you do want to

play09:30

get the weather across the time of the

play09:31

day even though you didn't ask for it

play09:34

and then Google presents this

play09:35

information in like cool

play09:37

widgets um and I think that is where

play09:41

this is a very different problem from

play09:42

just building another

play09:45

chatbot and and and the information

play09:47

needs to be presented well and and the

play09:49

user intent like for example if you ask

play09:51

for a stock price uh you might even be

play09:55

interested in looking at the historic

play09:56

stock price even though you never asked

play09:57

for it you might be interested in

play09:59

today's price these are the kind of

play10:01

things that like you have to build as

play10:04

custom uis for every query and why I

play10:08

think this is a hard problem it's not

play10:10

just like the Next Generation model will

play10:13

solve the previous generation models

play10:15

problems here the next Generation model

play10:16

will be smarter you can do these amazing

play10:19

things like planning like query breaking

play10:22

it down into pieces collecting

play10:23

information aggregating from sources

play10:25

using different tools those kind of

play10:27

things you can do you can keep answering

play10:29

in harder and harder queries but there's

play10:32

still a lot of work to do on the product

play10:34

layer in terms of how the information is

play10:36

best presented to the user and how you

play10:39

think backwards from what the user

play10:41

really wanted and might want as a next

play10:43

step and give it to them before they

play10:44

even ask for it but I don't know how

play10:47

much of that is a UI problem of

play10:50

Designing custom UI for a specific set

play10:53

of questions I think at the end of the

play10:55

day Wikipedia

play10:57

looking uh UI is good enough if the raw

play11:02

content that's provided the text content

play11:05

is is powerful so if I want to know the

play11:07

weather m in Austin if it like gives

play11:11

me five little pieces of information

play11:14

around that maybe the weather today and

play11:16

maybe uh other links to say do you want

play11:19

hourly and maybe it gives a little extra

play11:21

information about rain and temperature

play11:24

all that kind of stuff yeah exactly but

play11:25

you would like the product when you as

play11:29

for weather uh let's say it localizes

play11:32

you to Austin automatically and not just

play11:35

tell you it's hot not just tell you it's

play11:37

humid but also tells you what to

play11:40

wear you wouldn't ask for what to wear

play11:43

but it would be amazing If the product

play11:45

came and told you what to wear how much

play11:47

of that could be made much more powerful

play11:50

with some memory with some

play11:51

personalization a lot more definitely I

play11:54

mean but the personalization there's an

play11:56

8020 here the 8020 is

play12:00

achieved uh

play12:02

with your

play12:05

location let's say your

play12:07

Jer and then you know like like sites

play12:11

you typically go to like a rough sense

play12:13

of topics of what you're interested in

play12:15

all that can already give you a great

play12:17

personalized experience mhm it doesn't

play12:19

have to like have infinite

play12:22

memory infinite context Windows have

play12:25

access to every single activity you've

play12:26

done that's an Overkill yeah yeah I mean

play12:30

humans are creatures of habit most of

play12:31

the time we do the same thing and yeah

play12:33

it's like first few principal

play12:36

vectors first few principal first like

play12:38

most most important I vectors yes yeah

play12:42

thank you for reducing humans to that to

play12:45

the most important IG vectors right but

play12:47

like for me usually I check the weather

play12:49

if I'm going running so it's important

play12:51

for the system to know that running is

play12:53

an activity that I do but also depends

play12:56

on like you know when you when you run

play12:58

like if you're asking night maybe you're

play12:59

not looking for running but right but

play13:02

then that that starts to get into

play13:03

details really I never ask a night cuz I

play13:06

don't care so like usually it's always

play13:08

going to be a running about running and

play13:09

even at night it's going to be about

play13:11

running because I love running at night

Rate This

5.0 / 5 (0 votes)

Связанные теги
AI AnswersSearch EngineAcademic SourcingKnowledge DiscoveryReal-Time InfoUser ExperienceChatbot AccuracyHealth InsuranceStartup InsightsCuriosity Journey
Вам нужно краткое изложение на английском?