Book Recommendation System in Python with LLMs

NeuralNine

31 Jul 202424:33

Summary

TLDRIn this informative video, the host guides viewers through the process of coding a book recommendation system using Python and large language models. The system involves creating a vector store to hold vector representations of books, transforming textual data into 4,096-dimensional vectors with the help of LLMs like Llama 2. The video demonstrates how to use a dataset from Kaggle, craft textual representations, and perform similarity searches to recommend the most relevant books. The host also highlights the importance of maintaining consistent data structures for accurate recommendations.

Takeaways

📚 The video is about creating a book recommendation system using large language models in Python.
🔍 The goal is to build a vector store (Vector Store Service, VSS) that contains vector representations of various books.
📈 The books' attributes like title, description, author, and publishing date will be transformed into a textual representation and then into a high-dimensional vector.
📈📈 These vectors will be 4,096-dimensional, intelligently derived from the text to represent each book uniquely.
🔎 The system performs a similarity search to find the closest vector in the vector space to a newly input book's vector, suggesting the most similar books.
🤖 Large language models (LLMs) are used for the embedding process, which is crucial for intelligently converting text into meaningful vectors.
🛠️ The video mentions using 'ollama' for convenience, a tool that allows running models locally to get text embeddings.
🗃️ A 'faiss' vector store from Facebook is used to store and search through the vectors.
📊 The data set used is the 7K books data set from Kaggle, chosen for its inclusion of book descriptions, which are essential for accurate representation.
📝 A textual representation function is created to structure the book data in a way that is useful for the LLM.
🔑 The video emphasizes the importance of maintaining consistent data structure when building and querying the vector store to ensure accurate recommendations.

Q & A

What is the main topic of the video?
-The main topic of the video is how to code a book recommendation system using large language models in Python.
What is the purpose of building a vector store for the book recommendation system?
-The purpose of building a vector store is to contain vector representations of different books, which will be used for similarity search to recommend books.
What attributes of books are mentioned in the script as being used for the recommendation system?
-The attributes mentioned include title, description, author, publishing date, categories, average rating, and number of pages.
Why are vector representations used instead of raw text for the book recommendation system?
-Vector representations are used because they intelligently encode the text into a high-dimensional space where similarity can be measured numerically, which is not possible with raw text.
What is the dimensionality of the vectors that represent the books in the system?
-The dimensionality of the vectors is 4096, meaning each vector has 4096 numerical values.
Which model is used for text embedding in the video?
-The video uses 'llama 2' for text embedding, which is a large language model that can convert text into a meaningful vector representation.
What is the role of the 'requests' package in the video script?
-The 'requests' package is used to send a request to the 'llama 2' model's API to get the embedding for a given text representation of a book.
What is the data set used in the video for building the book recommendation system?
-The data set used is the '7K books data set' from Kaggle, which includes descriptions along with other attributes of the books.
How does the script handle the process of finding similar books once the vector store is built?
-The script performs a similarity search in the vector store by finding the vector that is closest to the vector representation of a new book, and then recommends the books associated with the closest vectors.
What is the importance of keeping the textual representation structure consistent when building the vector store?
-Keeping the textual representation structure consistent is important because it ensures that the vector store accurately reflects the text and maintains the integrity of the similarity search results.
How does the video demonstrate the effectiveness of the book recommendation system?
-The video demonstrates the effectiveness by showing the process of finding similar books to a given book and displaying the recommended books that are indeed similar in genre or theme.

Outlines

00:00

📚 Introduction to Building a Book Recommendation System

The video begins with an introduction to building a book recommendation system using large language models in Python. The presenter outlines the process of creating a vector store to hold vector representations of various books, including attributes like title, description, author, and publishing date. The main goal is to convert these textual attributes into 4,096-dimensional vectors intelligently using large language models, allowing for a similarity search to recommend books.

05:00

🔍 Crafting Textual Representations for Books

The second paragraph delves into the specifics of creating textual representations for each book from the dataset. The presenter discusses the importance of selecting relevant information and structuring it in a way that is useful for the language model. A function is introduced to convert each row of the dataset into a string containing the book's title, authors, categories, description, publishing year, average rating, and number of pages, which will be used for generating embeddings.

10:03

🚀 Embedding Text into Vectors and Storing Them

In this segment, the focus shifts to embedding the textual representations into vectors and storing them in a vector store. The presenter discusses using the Faiss library for the vector store and the Llama 2 model for generating embeddings from text. The process involves sending requests to the Llama 2 API with the textual representations and receiving the corresponding 4,096-dimensional vectors, which are then added to the Faiss index.

15:04

🔎 Searching for Similar Books Using the Vector Store

The fourth paragraph explains how to use the created vector store to find similar books. The presenter demonstrates how to take a book's textual representation, embed it using the same model, and perform a similarity search to find the closest vectors in the vector space. This process involves using the Faiss index to search for the top matches based on the embedded vector of the book in question.

20:04

🛠️ Troubleshooting and Finalizing the Recommendation System

The final paragraph addresses potential issues that may arise when using different structures for the textual representation during the embedding process. The presenter emphasizes the importance of maintaining consistency in the structure used for training the index and when performing searches. After resolving any issues, the video concludes with a demonstration of how to use the system to recommend similar books based on a given title, showcasing the effectiveness of the recommendation system.

Mindmap

Keywords

💡Book Recommendation System

A book recommendation system is an algorithmic tool designed to suggest books to users based on their interests or past reading habits. In the context of the video, the system utilizes large language models to analyze textual data about books and provide suggestions. The script describes the process of building such a system in Python, which includes creating a vector store to hold vector representations of books for similarity searches.

💡Large Language Models (LLMs)

Large Language Models, or LLMs, refer to artificial intelligence models that are trained on vast amounts of text data and can generate human-like text. In the video, LLMs are used for the embedding process, which involves converting textual information about books into vector representations that can be understood and compared by a computer system.

💡Vector Representation

Vector representation in this context is the conversion of text data into a numerical form that can be processed by a machine learning model. The script mentions that each book is represented as a 4,096-dimensional vector, which is a point in a high-dimensional space where each dimension corresponds to a specific feature of the text.

💡Vector Store

A vector store is a database designed to store and manage vector representations of data. In the script, the vector store is used to hold the vector representations of books, allowing for efficient similarity searches to find books similar to a given query.

💡Embedding

Embedding, in the context of natural language processing, is the process of transforming words, phrases, or texts into vectors of real numbers. The video script describes using LLMs to create embeddings for books, which capture the semantic meaning of the book's attributes in a 4,096-dimensional space.

💡Faiss

Faiss is a library developed by Facebook AI Research for efficient similarity search and clustering of dense vectors. In the video, Faiss is used as the vector store to manage the embeddings of the books, allowing for fast retrieval of the most similar books based on their vector representations.

💡Kaggle Dataset

The Kaggle Dataset mentioned in the script is a collection of data used for machine learning projects, in this case, a dataset of 7K books that includes attributes like title, author, description, and publishing date. This dataset is used to train the book recommendation system and create the vector store.

💡Textual Representation

Textual representation in the script refers to the structured string that contains the information about a book, such as title, author, description, and other attributes. This representation is crafted to be fed into the LLM to generate a meaningful vector representation.

💡Similarity Search

Similarity search is the process of finding data points that are similar to a given query point in a multidimensional space. In the video, similarity search is performed in the vector space to identify books whose vector representations are closest to the vector of a user-provided book.

💡API

API, or Application Programming Interface, is a set of rules and protocols for building software applications. The script mentions using an API to communicate with the LLM, specifically to send requests for generating embeddings from textual representations of books.

💡Dimensionality

Dimensionality in the context of the video refers to the number of features or attributes that a vector representation can have. The embeddings for the books are 4,096-dimensional, meaning each vector has 4,096 numerical values that represent different aspects of the book's text.

Highlights

Introduction to building a book recommendation system using large language models in Python.

Creating a vector store (VSS) to contain vector representations of books.

Using attributes like title, description, author, and publishing date to build textual representations of books.

Transformation of textual data into 4,096-dimensional vectors for intelligent representation.

Similarity search to find the closest vector in the vector space for book recommendations.

Utilization of large language models (LLMs) for the intelligent embedding process.

Choice of using the 'llama' model for convenience in local model running.

Explanation of using the Facebook AI Similarity Search (FAISS) vector store.

Selection of the 7K books dataset from Kaggle for its descriptive content.

Installation of necessary packages like pandas, numpy, and FAISS for data processing.

Crafting a textual representation function to structure book information.

Application of the textual representation function to the entire dataset.

Importance of maintaining consistent structure for embedding and similarity search.

Process of embedding book data into the vector store using the 'llama' model.

Performance of similarity search to find the top five most similar books.

Demonstration of finding similar books to 'How to Win Friends and Influence People'.

Discussion on the practicality and potential improvements of the book recommendation system.

Encouragement for viewers to experiment with different textual representations for better results.

Conclusion and invitation for feedback on the video's content and approach.

Transcripts

00:00

what is going on guys welcome back in

00:01

this video today we're going to learn

00:02

how to code a book recommendation system

00:05

by utilizing large language models in

00:07

Python so let us get right into

00:09

[Music]

00:17

it all right so we're going to build a

00:19

book recommendation system in Python

00:21

today by utilizing large language models

00:24

and I want to briefly sketch the process

00:25

nothing too complicated and nothing too

00:27

detailed I just want to show you

00:28

basically here uh visually what we're

00:31

going to do very basic I'm going to use

00:32

my mouse so this is not going to be the

00:34

most beautiful drawing but our goal is

00:36

to build up a vector store so a vector

00:39

database I'm going to say VSS here

00:40

Vector store uh and this is going to

00:42

contain Vector representations of a

00:45

bunch of different books so the idea is

00:47

we have a data set full of different

00:48

books and these books have certain um

00:52

attributes like a title a description an

00:55

author a publishing date and so on so we

00:57

have a bunch of attributes here and what

00:59

we want to do is who want to get those

01:01

uh buil a textural representation so

01:04

basically just raw text containing this

01:06

information in some way and then we want

01:08

to somehow intelligently take that uh

01:11

and turn it into a vector so all

01:15

representation are going to be turned

01:16

into vectors and these vectors have some

01:19

values I don't know four 0.5 something

01:22

and this is an uh a very high

01:24

dimensional Vector so I think let me

01:25

just double check here in my prepared

01:27

code this is going to be a 4,096

01:29

dimensional Vector so it's going to have

01:31

496 values numerical values uh which are

01:35

not random which are intelligently

01:37

Chosen and then these vectors are going

01:39

to be stored in the vector store and

01:41

when I get a new book with new

01:43

information what I do is I take that

01:46

turn it into the same kind of textual

01:48

representation turn it into the same

01:50

kind of vector using the same model so

01:52

into the same kind of uh Vector

01:55

representation here and then I ask

01:57

what's the closest Vector to this one

01:59

this is a simp similarity search so um

02:02

mathematically if I have two points in a

02:04

496 dimensional Vector space there is a

02:06

distance between all the different

02:08

points and what I'm asking is using this

02:10

new Vector what's the closest Vector

02:12

what's the closest point in this

02:13

high-dimensional Vector space to this

02:16

one and I assume this is going to be the

02:17

most similar book so I'm going to use

02:19

the five uh closest vectors to determine

02:22

the five most similar books and

02:24

recommend them uh as a result now the

02:27

interesting part of this whole system is

02:29

uh basically this Arrow here because

02:31

that is the embedding process taking

02:34

text and turning it into a vector

02:36

intelligently and for this we're going

02:38

to use

02:39

llms so large language models this is

02:42

where the intelligence is needed because

02:43

we need to take a text and we need to

02:45

somehow take the content of this text

02:47

and turn it into something that is

02:49

meaningfully represented in a 496

02:51

dimensional Vector space and then we

02:53

need to be able to do a similarity

02:55

search there so that's what we're going

02:56

to do now for the embedding model um if

03:00

you want to change the code to use

03:01

something else you can do whatever you

03:02

want you can use chat GPT so the open AI

03:05

API you can use GPT you can use uh any

03:08

kind of self-hosted model I'm going to

03:10

use olama just for convenience ol Lama I

03:12

have a video on his Channel showing how

03:14

to install and use olama basically it

03:16

allows you to easily run models locally

03:18

I'm going to use uh I think let me just

03:20

double check here I'm going to use llama

03:22

2 um just because it fits into my

03:24

hardware and I'm going to use llama 2 to

03:26

get text feed it into it and get an

03:29

embedding out of a 496 dimensional

03:31

embedding uh and as a vector store we're

03:33

going to use face so the Facebook uh

03:35

Vector store um all right and as a data

03:38

set we're going to use a kaggle data set

03:40

which is the 7K books data set the

03:42

reason I chose this one is because it

03:43

also has descriptions so we have

03:45

actually text describing the content of

03:48

the book not just the title because the

03:49

title can be very misleading or very uh

03:51

simple so we want to have these

03:54

descriptions here as well so this is the

03:56

data set I'm going to use you will find

03:57

a link to it in the description down

03:59

below um and actually I think I have to

04:02

download it because I don't have it in

04:04

my directory so I'm going to just

04:06

download um going to go to python

04:09

current here I'm going to download the

04:10

archive zip and in here we have the book

04:13

CSV file I'm going to extract

04:15

it and uh I'm going to just close this

04:19

all right so now I should have the book

04:23

CSV file here I'm going to open up a new

04:25

Jupiter notebook instance and we should

04:27

install a couple of packages first the

04:29

basic data assign stuff as always so

04:31

pandas numpy should always be part of

04:34

the equation here uh and I think for

04:36

this we're going to use um face as well

04:39

as I said so we're going to use also the

04:40

request package because we need to send

04:41

a request to the API of AMA and we're

04:44

going to use face and then you can use

04:46

uh I think phase GPU and phase CPU I

04:50

think I'm using phas GPU here so this

04:53

utilizes of course the graphical

04:55

Processing Unit uh and not the CPU but

04:58

if you don't have a GPU a strong GPU you

05:00

can also go with face CPU but this is

05:02

what we need for this video today all

05:04

right so we're going to start by loading

05:06

the data set and taking a look at it

05:08

this should not be too difficult so the

05:10

data frame is equal to pandas read uh

05:14

the CSV file which is called book

05:17

CSV and then I can look at it basically

05:20

what kind of textual representation you

05:22

want to use is up to you you can be

05:23

creative with this one because at the

05:25

end of the day you want to pick a

05:26

representation that contains only the

05:28

necessary information only the useful

05:30

information uh and you also want to

05:32

structure it in a way that is the most

05:34

useful for the llm now what this way is

05:37

I don't really know you have to play

05:38

around with it you have to see if

05:40

different representations give you

05:41

better results my Approach here is to

05:44

just craft a string saying um listing

05:46

the specific information that we need so

05:48

for example saying the title is this the

05:51

author is this the publishing year is

05:53

this the categories are these and so on

05:55

and so forth um so yeah this is the data

05:59

that we have I'm going to use the title

06:01

I'm going to use the authors I'm going

06:02

to use the category I'm going to use the

06:04

description uh the publishing year and

06:08

maybe maybe let's go also with the

06:10

rating so the rating could also be

06:12

interesting because yeah I mean if a

06:14

book has an average rating of one it's

06:16

probably not that

06:18

good um all right so what we're going to

06:22

do is we're going to create a function

06:24

which is going to take a row and turn it

06:26

into a textual representation so we're

06:27

going to say textual

06:30

representation by the way I have a very

06:32

very similar video where I do the same

06:33

thing with movies if you're more

06:34

interested in movie recommendation you

06:36

can check this out uh today we're going

06:38

to do books so textual

06:41

representation uh we get a row ENT input

06:44

here uh and then basically we just take

06:47

the content of this row so title the

06:50

authors and so on and turn it into a

06:51

string so I'm going to use an F string

06:53

here A multi-line F string so three

06:55

quotation marks uh and I'm going to say

06:58

that this is the text ual

07:01

[Applause]

07:03

representation uh and we're going to

07:04

start by saying first of all the title

07:06

of the book is

07:10

row title like

07:13

this um actually shouldn't this oh I'm

07:16

using double why okay we're not in flask

07:19

here we use just single uh curly

07:21

brackets so this is the title now and

07:23

then I can do a line break and I can say

07:25

the next thing is uh the authors the

07:28

authors is going to be equal equal to

07:30

row and then

07:34

authors and then I can do the same thing

07:37

for all the other rows uh for all the

07:39

other columns so again we have title we

07:41

have authors categories description so

07:43

let's go with uh description first

07:47

description is going to be equal to row

07:51

description

07:54

obviously then the categories or maybe

07:57

we should say I mean c atories is fine

08:00

maybe we should say say genre but I'm

08:02

not sure about

08:03

that categories um and then what did I

08:08

say we want to go with the publishing

08:10

year and the average rating that should

08:12

be it so we're going to go with

08:14

publishing year is going to be

08:19

row published year is the column

08:23

name and then finally we have uh

08:27

average rating

08:32

row uh

08:34

average rating and I think one more

08:37

feature that is useful is uh the number

08:40

of pages because maybe I'm only

08:41

interested in very small books so I

08:43

usually only read books like 70 pages

08:45

long uh that might also be a factor even

08:48

though maybe not the most important one

08:49

so let's say number of pages going to be

08:53

equal to

08:55

row num

08:57

Pages all right so that function takes a

09:00

row and turns it into such a string and

09:03

then all I can do is or all I have to do

09:05

is I have to just return this textual

09:10

representation like this all right so

09:14

let's see what happens when I apply it

09:16

so let's go and say DF and then iog up

09:20

until five and then

09:22

apply the function textual

09:25

representation

09:30

um do I have to provide an access I

09:34

think

09:35

so there you go so this function now

09:39

applied gives us that maybe we can go

09:41

values zero and actually I can go and

09:45

print that to get the result here and

09:48

you can see we get the title the

09:49

author's description categories

09:51

publishing your average rating and

09:53

number of pages so that is what our

09:56

function does now now we need to apply

09:58

that function to all the

10:00

individual um rows so we're going to say

10:02

DF

10:05

textual

10:08

representation is going to be equal to

10:10

DF apply textual representation AIS

10:14

equals

10:16

1 that is uh turning our data set into

10:20

one where we have these textual

10:22

representations here all

10:25

right

10:27

so we have the this now and the next

10:30

thing we want to do is we want to take

10:32

all of this and make an embedding and

10:35

put everything into a vector store so

10:36

we're going to say here import Face

10:39

import requests now we don't need to

10:42

really interact with ol Lama uh the only

10:44

thing that you need with AMA is of

10:46

course you need to say uh AMA surf I

10:49

think or AMA run um and then you need to

10:52

say AMA pull llama 2 that's important

10:56

because you need to have the model on

10:57

your system um and again I have a video

10:59

on ama if you have struggles with AMA

11:02

check out that video so import phase

11:04

import request and then import numpy as

11:08

NP then we say the dimensionality is

11:11

4,096 as I said and then the index so

11:15

the vector store is going to be

11:17

face index flat L2 with the

11:21

dimensionality here we choose this

11:23

dimensionality because that is the

11:24

dimensionality of the response we get

11:26

from llama 2 when it comes to the

11:28

embedding

11:29

uh and then we want to say x is equal to

11:33

np0 and here we pass length data frame

11:37

textual

11:40

representation and dimensionality so we

11:43

just initialize uh these uh actually we

11:47

need to also pass the data type D type

11:50

is float 32 uh we just initialize uh

11:54

input full of zeros here now um and what

11:57

we want to do next is we want to

11:58

actually get the embeddings from llama 2

12:01

so we say 4

12:03

I

12:05

representation in enumerate so we have

12:08

an index enumerate DF textual

12:13

representation uh for that we say take

12:16

the representation and make a request so

12:19

say response equal to requests. poost

12:24

and now we just need to use the Local

12:26

Host URL of llama 2 uh o Lama sorry

12:29

sorry uh which is by default

12:32

HTTP Local Host and then port 11 434 if

12:38

you didn't change that that should be

12:39

the default Port of olama again here if

12:42

you don't want to use ol Lama you can

12:44

also use the open AI API there is an API

12:46

for embeddings if you want to replace

12:48

this code with code that gets you the

12:50

embeddings from open AI where you have

12:51

to pay money you can do that uh you

12:54

don't have to do it with ama if you

12:56

don't want to you just have to get

12:57

somehow the embeddings so / API

13:01

embeddings and here now we need to pass

13:03

some data the data is going to be

13:05

adjacent object and the Json object will

13:08

say I want to use the

13:10

model llama 2 and I want to use the

13:15

prompt for the embedding which is the

13:19

representation um all right so that is

13:22

our request and then the result is going

13:24

to

13:25

be or the embedding is going to be equal

13:28

to the response

13:30

do get the Json and then get a specific

13:33

field called

13:35

embedding and then in order to store

13:37

that go to index I this is why we do the

13:39

enumeration here uh go to index I and

13:42

say that that is now our

13:44

new uh input here so NP array

13:49

embedding actually I think we can also

13:50

use

13:52

npm uh that would save us some time I

13:55

guess but it doesn't really matter

13:57

so yeah uh and in the end when we're

14:00

done with that we want to do index. add

14:03

X

14:05

so we can do it like that and it will

14:08

take quite some time so I can run this

14:10

and you will see it will start working

14:11

and it takes some time so I can actually

14:13

go ahead and add a line here saying if I

14:16

modulo 100 is equal to Zer print I and

14:20

remember we have uh how many we have

14:24

6,810

14:26

rows so I can run this you can see I get

14:29

zero then at some point I'm going to get

14:31

100 200 and so on but you see the

14:33

progress is quite slow so when you see

14:36

100 and when you see 200 you're going to

14:38

see how slow this actually is so I'm not

14:40

going to do all of this here now on

14:42

camera and I think actually okay it

14:45

seems like I cannot run this while

14:46

recording because it crashes my

14:47

recording or at least it makes it very

14:49

laggy but you can see it doesn't uh work

14:52

very quickly at least on my Hardware

14:54

maybe you have some power GPU and it

14:55

works instantly so you have to run this

14:58

for a while I'm not going to do this on

14:59

camera I already did this this is why I

15:02

have this index file here I'm going to

15:03

show you how to create that here in a

15:04

second but you can run this for example

15:06

on the first couple of instances if you

15:08

want to you can run it on the whole data

15:09

set and just wait but the idea is that

15:12

once this process is done so once this

15:14

Loop here is finished and you add

15:15

everything to the index what you can do

15:17

easily is let me just close this

15:20

here um what you can do easily is you

15:23

can export the index by saying phase

15:26

right index and then

15:29

uh you take the index and you save it to

15:31

index now in my case I already did that

15:34

so uh I'm not going to do this but this

15:37

is the line of code you would run to do

15:39

that I'm just adding a two here in case

15:40

I accidentally run this and delete my

15:42

index uh and what you can do then is you

15:45

can load the index from the file Again

15:47

by saying ph. read index and then just a

15:50

file name so index here and then you can

15:53

store that in an index so in this case

15:55

what I'm doing now is I'm loading the

15:56

index from a file instead of creating it

15:58

here by training because I already did

16:00

this exact code here I ran it I weighed

16:02

it I produced the index I wrote it to

16:04

dis and now I can just say index face

16:06

read index and I have the full index so

16:09

everything that is the result of running

16:11

this just that I now uh terminated that

16:14

but I now have the index and this is now

16:17

the same thing that you get when you

16:18

just run this and let it run until it's

16:20

finished or you can also go ahead and

16:21

just truncate it you can say okay give

16:23

me a random sample of the data I don't

16:25

need all of it uh it's up to you so it

16:28

takes some time

16:29

um now what do we do with this index

16:32

what we can do with this index now is we

16:34

can provide a new instance and find the

16:37

most similar instance of the database so

16:39

for example uh let's use from our uh

16:43

data frame here so let's go and say data

16:45

frame DF where the title

16:48

contains uh let's look for a book uh

16:51

classic self-improvement book would be

16:53

something like uh how to in friends and

16:55

influence people so let's look for

16:57

friends uh uh what's the problem here DF

17:00

title

17:02

contains oh title. string

17:05

contains so we have little house friends

17:08

friends friends how do I friends and

17:10

influence people there you go or

17:11

actually this is the book so it's 4533

17:15

let's say I want to find the most

17:16

similar book to this one now this is not

17:18

a new book you can also do that with a

17:19

book that is not part of the data frame

17:21

uh but then you would have to craft your

17:22

string yourself but let's go and say

17:25

that my favorite book now is equal to DF

17:30

iog and it's

17:33

4533 so if I look at my favorite book

17:36

you can see it's this one I can also

17:38

look at the textual representation

17:43

here and I will

17:47

get the data for the book now let's say

17:50

I like this book and I want to find

17:52

similar books because I want to learn

17:54

more about this or similar topics here

17:57

what I can do is I can use the vector

17:59

store to embed this again and this again

18:01

this could be something completely

18:03

different I could go ahead now and craft

18:04

this myself I don't need to use a string

18:07

that is already part of the um of the

18:10

data frame I can go and say title uh

18:13

python Bible 7 and one which is a book

18:17

from me I can say

18:18

authors my name and I can I can put my

18:21

book here if I want to and I can feed

18:24

that in as well so it doesn't have to be

18:26

a book that's already part of the data

18:27

frame you just craft your string and you

18:29

feed it into it it doesn't even have to

18:30

have the structure so you can also go

18:32

ahead and feed in hello world and embed

18:33

it it also works uh but it's not very

18:36

useful so we have this book here and

18:40

what I want to do now is I want to embed

18:42

this again assuming that this is not

18:44

part of the data frame or you can again

18:46

embed your own string and then take that

18:48

embedding and perform a similarity

18:50

search so we do that again by saying

18:52

basically the exact same thing that we

18:54

did here so we're going to copy that

18:56

code

18:58

the response is equal to requests post

19:00

and then that but here now instead of

19:02

representation we pass favorite

19:04

book uh

19:07

textual

19:09

representation or you could also as I

19:11

said pass your own string um yeah so

19:16

that is that we get a response from this

19:19

now we need to get an embedding so

19:21

what's the embedding of this particular

19:22

book it's equal to NP

19:24

array of uh response

19:29

response.

19:30

[Music]

19:32

Json uh yeah we need to actually use

19:36

this

19:37

uh thing here for the shape so response

19:40

Json

19:41

embedding and then the data type is

19:45

equal to float

19:48

32 this is now the closing bracket

19:53

actually we need no we need it like this

19:56

there you go so that's the betting and

19:59

now we have to feed this into our index

20:02

and search for similarities so we say di

20:04

is equal to index sech so we performed

20:08

the search based on the embedding and

20:10

we're interested in a top five results

20:12

so I pass Five here and then I can get

20:15

the matches by saying best

20:17

matches is equal to NP

20:22

array um DF

20:27

textual represent

20:29

ation so I get only the column with the

20:32

representations from the data frame and

20:34

I say that I'm interested in particular

20:36

in a couple of indices and these indices

20:40

are what I get as a result here from I

20:43

so I flatten that because what you need

20:45

to understand is that I'm doing this and

20:47

as a result I don't get a textual

20:48

representation I get positions I get

20:50

indices of the individual entries and I

20:53

then need to translate them back to

20:54

actual representations from the data

20:56

frame so I can say for match in best

21:02

matches print the

21:05

match print an empty line and basically

21:10

run this and you can see now not

21:13

surprising actually this is surprising

21:17

because this is not what I was expecting

21:21

let me see this is I think the issue is

21:25

that okay so I actually figured out that

21:27

the problem was a different one and it

21:29

was that I was not using the exact same

21:31

structure that I was using uh when

21:33

training the previous index because of

21:35

course I trained the index with my

21:36

prepared code and there I had a slightly

21:39

different structure now I changed this

21:40

you can see now title is no longer the

21:42

first thing we have categories title

21:44

authors average raing number of pages

21:46

publishing year then a blank line in

21:48

description not because that's

21:49

necessarily the best way to do it just

21:51

because that's the way I did it when I

21:53

trained my index which I loaded so this

21:56

is just the reason you want to keep this

21:57

the same you can train or you cannot

22:00

build your vector store with examples

22:02

like these and then use a completely

22:04

different structure so you have to keep

22:06

it the same in your case it shouldn't

22:08

make a difference you should uh get good

22:10

results immediately because you have

22:12

only been using one structure in my case

22:14

it made a difference so just as a side

22:15

note here it's good that we can learn

22:17

from mistakes uh you need to keep this

22:19

the same so you cannot just swap things

22:21

around here because it's going to mess

22:22

up the database so I changed the

22:24

structure to be the exact same as the

22:25

one I used so now I can run uh these

22:28

these things here again I'm not going to

22:31

run this one uh I can read index I can

22:36

find this book again I can post I can

22:40

get the best matches and then I can get

22:42

my results which are in this case now

22:45

way better now of course this one here

22:47

is going to be number one because it's

22:49

the exact same thing but besides that we

22:52

have here conduct of life from Steven

22:54

cvy also a self-improvement book we have

22:57

psychology the of intimacy from this

23:00

author here we have uh Marketing in the

23:03

bottom line oh actually this was not the

23:05

type conduct of life is not the type

23:07

first things first is the type of the

23:08

book uh and the dance of intimacy is the

23:11

type the title of this book so this is

23:12

just a category here um but yeah so you

23:16

can see that what we get here how to

23:18

talk so teens will listen and listen so

23:21

team will talk yeah whatever but these

23:23

are all like self-improvement SL

23:25

productivity SL communication books

23:27

maybe we can go and look at a couple of

23:29

more here and we see that for the most

23:32

part art of Happiness these are all

23:34

self-improvement books so it seems to

23:37

work to some degree you can play around

23:38

with that you can play around with

23:39

different representations you can also

23:41

try first of all smaller samples and

23:44

then do it on the whole Vector store or

23:45

on the whole data uh data set but this

23:48

is how you can build a uh recommendation

23:50

system because all you have to do now is

23:52

you have to come up with new books like

23:55

uh in this structure here and then you

23:56

can just feed them in and get

23:57

recommendations for uh similar books so

24:01

that's it for today's video I hope you

24:02

enjoyed it and hope you learned

24:03

something if so let me know by hitting a

24:05

like button and leing a comment in the

24:06

comment section down below and of course

24:08

don't forget to subscribe to this

24:09

Channel and hit the notification Bell to

24:11

not miss a single future video for free

24:13

other than that thank you much for

24:14

watching see you on the next video and

24:16

bye

Voir Plus de Vidéos Connexes

Movie Recommender System in Python with LLMs

Introduction to Generative AI (Day 10/20) What are vector databases?

Plant Leaf Disease Detection Using CNN | Python

Vector Databases simply explained! (Embeddings & Indexes)

Llama Index ( GPT Index) step by step introduction

Linguistik Digital - Video Material 1

Rate This

★

★

★

★

★

5.0 / 5 (0 votes)

Étiquettes Connexes

PythonRecommendation SystemLarge Language ModelsBook DatabaseVector StoreEmbeddingSimilarity SearchData AnalysisMachine LearningAI Applications

Besoin d'un résumé en anglais ?