How to Find, Build, and Deliver GenAI Projects

Dave Ebbelaar
13 Jun 202446:22

Summary

TLDRThis video script offers a comprehensive guide for engineers and developers on finding, building, and delivering custom generative AI solutions. It covers tactics for project discovery, the tech stack and processes for development, and the challenges of production deployment. The speaker shares insights from his company, Data Lumina, and emphasizes the importance of prompt engineering, leveraging social media for lead generation, and maintaining applications at scale.

Takeaways

  • πŸ“š The video outlines a comprehensive process used by Data Lumina for building custom generative AI solutions, tailored for engineers and developers interested in AI applications.
  • πŸ› οΈ The process is divided into three main parts: finding generative AI projects, building these projects, and delivering them to clients.
  • πŸ’‘ When starting, it's recommended to offer general AI services and then specialize (niche down) based on client needs and feedback.
  • πŸ”Ž Finding projects involves creating a clear offer, using acquisition methods like social media and networking, and effectively selling the project through understanding client needs and building trust.
  • πŸ€– Building generative AI projects requires understanding the core pattern of inputs, processing, and outputs, and validating ideas locally before scaling.
  • πŸ‘¨β€πŸ« Prompt engineering is crucial and should be based on proven frameworks and templates to ensure the effectiveness of AI models.
  • πŸ›‘ Minimize AI usage by using LLM processing as a last resort, and focus on maximizing functionality through regular code before resorting to AI.
  • πŸ”§ Utilize tools like Azure Document Intelligence for tasks like document parsing, and FastAPI for web applications to streamline development.
  • πŸ”’ Security is paramount, especially when dealing with sensitive data, and multifactor authentication should be implemented for all connections.
  • 🌐 The transition from local development to production requires careful consideration of hosting options, cloud platforms, and CI/CD pipelines for consistent deployment.
  • πŸ“ˆ Ongoing monitoring and maintenance are essential for generative AI projects, with tools like Sentry for error tracking and Lang fuse for monitoring LLM interactions.

Q & A

  • What is the main focus of the video by Data Lumina?

    -The video focuses on guiding engineers and developers on how to find, build, and deliver custom generative AI solutions for clients, rather than using no-code tools.

  • Who is the target audience for this video?

    -The target audience includes engineers, developers, independent contractors, and freelancers interested in building and exploring generative AI applications.

  • What are the three main parts the video is divided into?

    -The video is divided into three parts: finding generative AI projects, building these projects, and delivering them to clients.

  • What does Dave Abar recommend for someone just starting out with generative AI solutions?

    -Dave Abar recommends starting general, talking to clients to understand their needs, and possibly working for free in return for testimonials and referrals to build confidence and a portfolio.

  • How can one leverage social media platforms for finding clients interested in generative AI?

    -One can leverage social media platforms like LinkedIn and YouTube by sharing experiences, learnings, and insights related to generative AI to attract potential clients.

  • What is the importance of the discovery call in the sales process?

    -The discovery call is important for understanding the customer's pain points, offering advice, and building trust, which are crucial for converting leads into customers.

  • Why is prompt engineering considered as important as coding in generative AI projects?

    -Prompt engineering is as important as coding because the quality of prompts can significantly impact the output of the AI model, affecting the overall effectiveness of the application.

  • What is the significance of using a version control system like GitHub in managing generative AI projects?

    -Using a version control system like GitHub helps in managing changes, protecting branches, ensuring security, and facilitating collaboration among team members.

  • What are some challenges faced when moving generative AI projects from a local machine to a production environment?

    -Challenges include ensuring software reliability, dealing with hallucinations in AI outputs at scale, constant monitoring and maintenance, and managing ongoing costs.

  • How does Data Lumina handle the integration of AI into existing systems in their projects?

    -Data Lumina uses an architecture that involves connecting with existing applications via APIs, using queuing mechanisms, data processing pipelines, and integrating outputs back into the system.

  • What is the role of task queuing in scaling generative AI applications?

    -Task queuing, implemented using tools like Celery, helps manage the load by creating a queue for incoming requests, ensuring the system does not get overloaded and maintains performance even under high demand.

  • What are some best practices for monitoring and maintaining generative AI applications in production?

    -Best practices include using monitoring tools like Sentry, setting up CI/CD pipelines for automated testing and deployment, implementing unit tests, and being transparent about costs associated with server and API usage.

Outlines

00:00

πŸ› οΈ Building Custom Generative AI Solutions

Dave Abar, founder of Data Lumina, introduces a video guide for engineers and developers to build and deliver custom generative AI projects. The video is divided into three parts: finding projects, building solutions, and delivering them. Dave emphasizes that the video is not about using no-code tools but about creating tailored AI applications. He shares tactics for finding projects, building with specific tech stacks, and the challenges of production deployment. The audience is encouraged to follow along with a document provided in the description.

05:02

πŸ” Finding Generative AI Projects and Building Confidence

The second paragraph focuses on the initial steps for finding generative AI projects, which include defining the offer, exploring acquisition methods, and understanding sales strategies. Dave suggests starting with a general approach, gaining experience, and researching successful use cases. He also recommends offering services for free in return for testimonials and referrals to build a portfolio and confidence in delivering value with generative AI.

10:03

πŸ“ž Acquisition and Sales Strategies for AI Projects

Dave discusses tactics for acquiring leads and converting them into clients. He advises leveraging current networks, social media platforms like LinkedIn and YouTube, and Discord communities. Dave also highlights the importance of providing value over direct sales pitches on social media. For sales, he emphasizes the difficulty of converting leads and suggests having strategies and experience in place. The process involves discovery calls, creating proposals, and discussing contract terms and payment schedules.

15:04

πŸš€ Leveraging Freelancing in the Age of AI

In this part, Dave explores the impact of AI on freelancing, citing statistics that show an increase in productivity and opportunities for tech professionals to build solutions for startups and enterprises. He mentions that generative AI makes freelancers more likely to pursue independent careers, and he shares his personal experience transitioning from freelancing to running a full development company due to the demand for generative AI solutions.

20:06

πŸ’‘ Fundamentals of Building Generative AI Projects

Dave outlines the core pattern of generative AI projects, which involves inputs, processing, and outputs. He stresses the importance of validating ideas locally, creating flow diagrams, and taking prompt engineering seriously. The paragraph also covers the significance of data quality and the use of large language models, with a preference for Azure Open AI due to data safety assurances.

25:06

πŸ›‘ Minimizing AI Usage and Utilizing Tools for AI Projects

The paragraph discusses best practices for integrating large language models into applications, advocating for the use of AI as a last resort after all other coding solutions have been exhausted. Dave recommends using Pinecone and Langchain for robust integration and emphasizes the importance of monitoring applications at scale using tools like Langfuse. He also suggests leveraging Azure Document Intelligence for parsing documents and outlines the tech stack used for building generative AI solutions.

30:09

πŸ€– Project Management and Tools for AI Development

Dave covers project management tools and techniques for developing AI projects, including version control with GitHub, task management with platforms like ClickUp, and the importance of automation. He highlights ClickUp's features and integrations for task management and emphasizes the need for a solid project management approach to handle complexity and ensure project success.

35:11

🌐 Delivering Generative AI Projects: Challenges and Solutions

The paragraph delves into the complexities of moving AI projects from local development to production environments. Dave talks about the challenges data scientists face in software engineering, the importance of monitoring for hallucinations in AI outputs, and the ongoing costs of maintenance. He also discusses hosting options, the use of Docker for deployment, and the need for testing, monitoring, and security measures.

40:11

πŸ”’ Security and Example Architecture for AI Projects

In this section, Dave discusses security measures for AI projects, including the use of multifactor authentication and secure storage of API keys. He presents an example architecture diagram for a project currently in development, which includes a secure and scalable framework using Azure services, VPN for access control, and a focus on security at every layer of the application.

45:14

🏒 Conclusion and Next Steps for Generative AI Projects

The final paragraph concludes the video with a recap of the in-depth discussion on building and delivering generative AI projects within Data Lumina. Dave invites viewers to like and subscribe for more content on freelancing and working with clients. He also encourages viewers to watch the next video for further insights into client acquisition strategies.

Mindmap

Keywords

πŸ’‘Generative AI

Generative AI refers to artificial intelligence systems that are capable of creating new content, such as text, images, or audio, based on input data or prompts. In the video, this concept is central to the discussion around building custom AI solutions for various business applications, emphasizing the creative and innovative potential of these systems.

πŸ’‘Custom Solutions

Custom solutions are tailor-made products or services designed to meet the specific needs of a client or a business. The video focuses on the process of developing such solutions in the realm of generative AI, highlighting the importance of understanding client requirements and creating unique applications to address those needs.

πŸ’‘Data Lumina

Data Lumina is the company mentioned in the script that specializes in building custom generative AI solutions for clients. The video discusses the company's approach and methodologies in finding, building, and delivering AI projects, positioning Data Lumina as an expert in the field.

πŸ’‘Acquisition Methods

Acquisition methods in the context of the video refer to the strategies used to find and secure leads for AI projects. The script outlines various tactics such as leveraging social media, engaging in Discord communities, and participating in professional networks to attract potential clients.

πŸ’‘Sales

Sales, as discussed in the video, involves the process of converting leads into customers by effectively communicating the value of a generative AI project and building trust. The script emphasizes the importance of understanding the customer's needs, offering advice, and providing a clear proposal with defined scope and pricing.

πŸ’‘Prompt Engineering

Prompt engineering is the process of carefully designing the prompts or inputs given to generative AI models to elicit desired outputs. The video underscores the significance of this process, suggesting that a well-crafted prompt can be as crucial as the code itself in determining the success of an AI application.

πŸ’‘Azure Open AI

Azure Open AI is a service mentioned in the script that provides access to AI models and capabilities through the Azure platform. The video discusses its use in building generative AI solutions, noting the benefits of using a trusted provider that ensures data safety and compliance.

πŸ’‘APIs

APIs, or Application Programming Interfaces, are sets of rules and protocols that allow different software applications to communicate with each other. In the video, APIs are discussed in the context of integrating AI models into applications and the potential costs associated with using third-party services.

πŸ’‘SaaS (Software as a Service)

Though not explicitly mentioned in the script, the concept of SaaS is implied in the discussion of delivering AI solutions. SaaS refers to a software delivery model in which software is provided over the internet, on a subscription basis. The video touches on the idea of offering AI applications as a service to clients, which aligns with the SaaS model.

πŸ’‘Freelancing

Freelancing is a work arrangement where a person offers services, such as AI project development, to clients on a flexible and independent basis. The video discusses the opportunities for developers and engineers to explore generative AI applications as freelancers, highlighting the potential for building a business in this niche.

πŸ’‘Project Management

Project management involves the organization and coordination of activities in a project. The video mentions tools and practices for managing the development of generative AI projects, such as using GitHub for version control and ClickUp for task management, emphasizing the importance of structured project management in delivering successful outcomes.

Highlights

Dave Abar introduces a comprehensive process for building custom generative AI solutions at Data Lumina.

The video is tailored for engineers and developers interested in exploring AI applications, not for those seeking guidance on no-code tools.

A three-part structure covers finding projects, building AI solutions, and delivering them effectively.

Dave emphasizes starting generative AI projects with a general approach, understanding client needs, and then niche-ing down.

The importance of having a clear 'offer' when starting out in generative AI projects is discussed, including choosing between a general or niche approach.

Tactics for finding leads include leveraging social media, engaging in Discord communities, and participating in educational groups like Data Alchemy.

Sales strategy involves understanding the customer's pain points and building trust through expertise and valuable insights.

The video discusses the importance of prompt engineering in generative AI and the significance of data quality over the AI model used.

Dave recommends using Python for backend development, Azure Open AI for model provider, and emphasizes minimizing AI usage until necessary.

An example architecture diagram is presented for a robust and secure generative AI project integration.

The video covers the complexities of moving from a local development environment to a production environment for AI projects.

Different hosting options are explored, including client-hosting and self-hosting solutions with considerations for recurring revenue.

The use of Docker for consistent and scalable application deployment is recommended, along with setting up CI/CD pipelines.

Security best practices such as using multifactor authentication and VPN connections are highlighted for secure application development.

The video concludes with a call to action to subscribe for more insights on freelancing and AI project management.

Transcripts

play00:00

how to find build and deliver generative

play00:02

AI projects in this video I'm going to

play00:05

walk you through the entire process that

play00:06

we use inside of our company data Lumina

play00:09

to build custom generative AI solutions

play00:11

for our clients now on that note this

play00:14

video is not for you if you're looking

play00:16

to learn more about how to build and

play00:18

deliver generative AI Solutions using

play00:20

nood tools like sapper make voice flow

play00:23

or bpress which are all great by the way

play00:25

but this video is really going to dive

play00:27

into building custom Solutions so this

play00:30

video is for engineers and developers

play00:32

who want to build and explore generative

play00:35

AI applications in their own company or

play00:38

for example do that as an independent

play00:40

contractor or freelancer to really start

play00:42

your own thing which is really a great

play00:44

opportunity right now for every

play00:46

developer that is really interested in

play00:48

working with generative AI this video is

play00:51

split up into three parts first we'll

play00:53

look at how to find generative AI

play00:55

projects we'll share some tactics that

play00:56

we use inside of our company then how to

play00:59

build where really get into the

play01:00

specifics the Tex stack the the

play01:03

Frameworks the processes the tools that

play01:05

we use everything and then we talk about

play01:07

how to deliver these generative AI

play01:09

projects because it's one thing to build

play01:11

something locally on your laptop that

play01:13

works but bringing them putting that

play01:15

into production where it can run and the

play01:17

client can access it that's a whole

play01:19

different story and that's the most

play01:21

challenging part so that's how it is

play01:23

split up quick context about me for

play01:25

those of you that don't know me my name

play01:27

is Dave abar I'm the founder of data

play01:29

Lumina and we help organizations and

play01:31

individuals to navigate the AI

play01:33

Revolution and we do that by building

play01:35

custom Data Solutions for the clients

play01:37

that we work with and we also cultivate

play01:40

technological leadership through our

play01:42

educational resources that is our

play01:44

YouTube channel our free community data

play01:46

Alchemy which you can check out and our

play01:48

paid Community data freelancer where we

play01:50

help data professionals to get started

play01:53

with freelancing by showing them how we

play01:55

run our company how we find clients and

play01:58

collaborating with them now if you want

play02:00

to follow along with this document you

play02:02

can click the link in the description

play02:03

you'll uh see a link to this document if

play02:05

you click it you can duplicate or

play02:07

download it after signing up for clickup

play02:09

so if that's something you want you can

play02:11

do that you can also export it to

play02:12

markdown to put into notion obsidian

play02:14

whatever tool that you are using so with

play02:18

that out of the way let's get straight

play02:20

into things and start with part one how

play02:22

to find generative AI projects I have a

play02:26

diagram over here and a stepbystep

play02:28

instructions uh let's first break it

play02:31

down really at the core most top

play02:33

fundamental level what do you need in

play02:34

order to find these projects it's split

play02:36

up into three parts you have the offer

play02:39

you have acquisition methods so how to

play02:40

find leads and you have sales after

play02:42

finding those leads how do you actually

play02:44

sell the project to them so let's start

play02:47

with the offer this is really what are

play02:50

you going to sell and when you're

play02:51

starting out you should ask yourself the

play02:54

question are you going to go general or

play02:56

are you going to Niche down General

play02:58

really is okay what's your problem you

play03:00

talk to the to the clients they say hey

play03:02

we have problem X Y and Z and you say

play03:04

hey I can build that for you going Niche

play03:07

is you you go to the client say you have

play03:09

a problem I have the solution you go

play03:11

specific you know a certain problem that

play03:13

for example a niche or a market is

play03:14

dealing with and you built a tailored

play03:16

solution for that if you're just

play03:18

starting out and that's probably you if

play03:19

you're watching this video then I would

play03:21

recommend start general first just talk

play03:24

to people and let them tell you what it

play03:26

is that they want and then figure out if

play03:28

you can help them so how do you start

play03:30

and for this you need to be experienced

play03:32

so you need to be a developer a data

play03:34

scientist machine learning engineer

play03:35

software engineer data engineer and

play03:36

ideally have some experience with

play03:39

generative AI under your belth because

play03:41

building custom Solutions is hard and

play03:45

you you can start by for example solving

play03:48

a problem that you are experienced this

play03:50

is a great way to start something else

play03:52

that you can do is research successful

play03:53

gen fi use cases and I've listed some of

play03:56

them here where you could really think

play03:58

about the main departments that every

play04:00

almost every company has so Marketing

play04:02

sales support and operations these are

play04:05

really for all of these departments

play04:07

generative AI can be great and there are

play04:10

lots of solutions so take some

play04:11

inspiration from there see if there's

play04:13

something that you like that could also

play04:15

be a great starting point so you can

play04:16

either just pick something on your own

play04:18

start with that to build your own

play04:20

portfolio or what you could also do very

play04:22

powerful offer to work for free in

play04:24

return for testimonial and some

play04:26

referrals so go out to some people in

play04:29

your Network a company your boss uh

play04:31

clients you've worked with in the past

play04:34

doesn't matter find a company that is

play04:36

interested in doing something with

play04:37

generative Ai and see if you can help

play04:39

them out maybe do it for small fee maybe

play04:41

do it for free but really the important

play04:44

the the goal here really is to get that

play04:46

case study and ideally a testimonial to

play04:49

first and foremost also build confidence

play04:51

so you know hey I can do this I can

play04:54

actually deliver value to companies

play04:56

using the skills and uh that I have and

play04:59

these tools that is the offer what are

play05:01

you going to sell get clear on that I

play05:03

would recommend starting General and

play05:05

then from there we get into the next

play05:07

step which is acquisition so how to find

play05:10

and convert leads and now if you're here

play05:13

because you just want to know how to

play05:14

build these projects or you want to do

play05:15

something within your company then you

play05:17

can kind of like skip over these steps

play05:19

and skip this video a little bit uh

play05:21

where we get into part two but if you're

play05:23

new to this and you're looking to find

play05:24

clients this is where you should start

play05:27

so with acquisition here are some tips

play05:29

to get started with this and I will

play05:31

leave some I will link some more

play05:32

resources later but high level start

play05:35

with your current Network this is the

play05:37

most uh this is the easiest way to start

play05:39

so especially if you already have a case

play05:41

study you have some experience just

play05:43

reach out to people in your current

play05:44

Network either via email direct messages

play05:47

WhatsApp LinkedIn DMS and ask if the

play05:49

solutions you've built could be

play05:51

interesting for them or someone they

play05:53

know the cool thing about generative AI

play05:55

right now is that if implemented

play05:56

correctly of course it can really add

play05:58

value to any company any industry it

play06:01

doesn't matter every company has

play06:03

Marketing sales operations support and

play06:07

gener that's really where generative AI

play06:09

shines and then it's really about

play06:10

getting the right use case making it

play06:13

small and tangible enough to solve a

play06:15

specific problem so start with your

play06:17

network next to that I would recommend

play06:19

leveraging social media platforms

play06:21

LinkedIn YouTube are the two best uh

play06:24

ways to get started I understand YouTube

play06:26

is not for everyone can be a big step

play06:28

but it's highly effective LinkedIn a

play06:30

little bit easier to get started with

play06:32

you just write a post a lot of people

play06:34

then ask okay but what do I share what

play06:36

do I write about the best thing that you

play06:38

can do is share your experience and

play06:40

share the things that you've learned

play06:42

because in doing so you let people know

play06:44

that you're actively working on this

play06:46

you're learning uh you're learning about

play06:48

this and people over time will naturally

play06:50

start to reach out to you asking like

play06:52

hey I'm actually interested in a you

play06:54

talk about ax all the time can you help

play06:56

me with this that's how it works it's

play06:58

really powerful and even even like one

play07:01

LinkedIn post could get the ball rolling

play07:03

so don't underestimate this then another

play07:05

important principle to keep in mind when

play07:07

posting on social media don't sell

play07:09

directly give value so don't go out on

play07:11

LinkedIn for example asking all the time

play07:13

like hey I'm looking for clients to work

play07:14

with can you help me now occasionally

play07:16

you can do this but the long-term play

play07:18

is to give value so share what you've

play07:20

learned share interesting insights share

play07:22

knowledge share news and over time

play07:24

people will naturally start to reach out

play07:25

to you that's the principle that you

play07:27

should keep in mind another way to find

play07:29

leads is by by actually posting in

play07:30

Discord communities for popular

play07:32

Frameworks and tools that you're for

play07:33

example working with so this is a very

play07:36

interesting one and is gaining more and

play07:38

more popularity but by just showing up

play07:40

there and answering the technical

play07:42

questions people might have they will

play07:44

naturally again start to reach out to

play07:46

you asking like hey I'm actually working

play07:47

on this project I have this question

play07:49

about this particular framework here I

play07:51

give an example of Fai and then they

play07:54

they come into that Community to look

play07:56

for answers and if you see that they're

play07:57

the expert and they're willing to pay

play07:59

you some for that they might hire you as

play08:01

a consultant or a developer to start

play08:03

working with it interesting place for

play08:05

developers to find leads then another

play08:07

great way to connect with people and

play08:09

share insights is via school so you can

play08:11

for example come to data Alchemy which

play08:13

is my free group I will leave the link

play08:14

in the description uh we have over

play08:16

15,000 members over here and it's

play08:18

actually quite funny the people that

play08:19

always show up here in the community and

play08:21

that post a lot that add value people

play08:23

will naturally start to reach out to

play08:25

them and ask for help so for example

play08:27

that happened to to Brandon uh multiple

play08:29

times already because he's really active

play08:31

in here and that could be a great way to

play08:33

get your foot in the door as well and

play08:36

then finally you could look into

play08:37

freelance platforms like Fifer or upwork

play08:40

but please note while these platforms

play08:42

are great they can work in the beginning

play08:44

it's really challenging to get started

play08:46

with that all right and then let's get

play08:48

into the next part which is sales so how

play08:51

to sell a project and the thing that I

play08:53

want you to keep in mind here is finding

play08:55

leads is easy sales is hard if you don't

play08:58

have the right experience or strategies

play09:01

and the interesting thing here is that

play09:03

everyone thinks that finding potential

play09:05

customers finding leads is hard because

play09:08

this is the number one question that uh

play09:11

people ask when for example they come to

play09:13

us on a discovery call for data

play09:15

freelancer number one question that they

play09:17

have is how do I find clients I always

play09:19

tell them finding leads finding clients

play09:21

is very easy you can literally walk up

play09:24

to any business owner right now probably

play09:26

and ask them hey is AI on your road map

play09:29

do you want to do something with AI

play09:31

probably all of them almost all of them

play09:33

will say yes we're not sure but we know

play09:36

AI is there we want to do something with

play09:38

this that's a lead they're interested

play09:40

but now converting that lead into a

play09:43

customer I.E sales by now coming in

play09:47

stepping in saying like here's how I can

play09:48

help you here's the price here's what I

play09:50

can do building confidence building

play09:51

trust that is a lot harder again if you

play09:54

don't have the right experience and the

play09:56

right strategies so how do you do it

play09:57

correctly and of course I'm going to

play09:59

give you a summary here all of the

play10:00

topics that we're discussing here could

play10:03

have a separate 60-minute video on its

play10:05

own but this video the goal is to give

play10:07

you an outline if at any point in this

play10:10

video by the way you have questions or

play10:12

you want me to dive deeper into certain

play10:14

topics leave a comment I'll consider it

play10:16

for a future video or I I can I can

play10:19

answer you in the comments but here's

play10:20

how we do it at data Lumina currently

play10:22

and for this I'm going to scroll up to

play10:24

this image over here let me see yeah so

play10:26

we talked about the offer we talked

play10:27

about the acquisition strategies and and

play10:29

by the way for our company right now

play10:30

just in case you're curious all of our

play10:32

clients come through either LinkedIn

play10:34

YouTube or referrals or previous clients

play10:37

that's where we are currently at so we

play10:38

don't have to do any called Outreach

play10:40

send messages that's how powerful social

play10:42

media can be but now let's get into

play10:45

sales so sales for us at least for most

play10:48

companies starts with the discovery call

play10:50

so this is an online call we do that via

play10:53

Zoom we let people schedule via kly um

play10:56

we also sometimes let them fill out some

play10:58

questionnaires so

play10:59

uh what's your company about what's the

play11:01

problem what's the budget so we get some

play11:03

data points already and on the call you

play11:05

shouldn't really think about it as a

play11:07

sales call because really while it is a

play11:09

discovery call really the core thing

play11:11

that you should do is ask questions lots

play11:14

lots of questions so you don't open open

play11:16

up the call and say hey I'm an I'm a

play11:18

developer I have this I can do that I

play11:20

can do that I can do that do you want to

play11:22

buy from me no that's the last thing you

play11:23

want to do a successful Discovery call

play11:26

is asking as many questions as possible

play11:29

to first of all really understand the

play11:31

pain points of the customer understand

play11:33

that you can actually help them

play11:34

understand that it's the right fit and

play11:36

in doing so you also build trust because

play11:39

you show that you're an expert you ask

play11:40

the right questions you trigger them

play11:43

next to that you can also offer advice

play11:45

the funny thing is a lot of

play11:47

companies they have no idea what they

play11:49

want the thing that they they come up

play11:51

they they show up to you on a Q on a

play11:53

discovery call and they say we want to

play11:54

do things with AI sometimes it's a

play11:56

little bit more specific we want to do

play11:58

AI for marketing we want to do AI for

play12:00

Content but they they have no idea so

play12:02

you should then also step in as the

play12:04

expert and offer advice this again

play12:07

builds trust higher chance that the sale

play12:10

will come through it's also important on

play12:13

that initial call to talk about some

play12:14

price expectation hey are you sure you

play12:17

want to do this the scope that we're

play12:18

talking about right now this could very

play12:20

well go into the 15 15 20K depending of

play12:24

course on the size of the project make

play12:26

sure that they are aware of the the kind

play12:29

of range that that J you're at for such

play12:31

a project now later you can see the

play12:34

arrow over here we after the call you

play12:37

can create a specific proposal because

play12:39

because a lot of the times you get a lot

play12:42

of data on that call by asking all of

play12:44

those questions and then you need to

play12:45

take take a step back and dive in and

play12:48

really create a thorough proposal so

play12:50

this is where you really where you

play12:51

outline scope timelines prices

play12:54

deliverables and then you put a price on

play12:56

it this can either be a fixed price so

play12:58

you say okay this is a 10K project it's

play13:00

going to take me four weeks this is what

play13:01

I'm going to deliver you can do

play13:03

sprint-based pricing where you say okay

play13:05

we do two week Sprints uh price for

play13:07

Sprint is X or you could say here's my

play13:10

houry rate I think it's going to take me

play13:12

around 40 to 50 hours maybe and we'll do

play13:15

it like that there are multiple ways to

play13:17

go about this and you should figure out

play13:19

something that works for you what we

play13:21

usually like to do is fix fix prices in

play13:23

set intervals so for example we sell a

play13:25

proof of concept we sell an MVP we for

play13:27

example put four weeks on it it 6 weeks

play13:29

on it Etc to clearly Define the scope

play13:32

but talk about it on the first call so

play13:34

they know what to expect also important

play13:37

talk about potential hidden costs API

play13:40

cost management cost hey do you know

play13:42

actually if we if we implement this

play13:43

solution for you do you know that we can

play13:45

build this there's an upfront cost for

play13:47

the development but also there's going

play13:49

to be maintenance cost API costs give ex

play13:52

set set the ranges for them give the

play13:53

expectations so they know oh it's not

play13:55

only upfront I pay you one time but this

play13:57

is an ongoing thing because maintaining

play14:01

and improving custom Solutions like this

play14:03

a lot of work it's challenging it takes

play14:06

time so you need to get paid for that as

play14:08

well usually what we like to do once we

play14:10

create the proposal which we usually do

play14:13

either in a simple document or

play14:14

PowerPoint presentation we discuss the

play14:16

proposal on a second call or share it

play14:18

via email we found that uh discussing it

play14:21

on a second call where the client is

play14:23

there works better because if you share

play14:25

it via email they might have lot lot of

play14:27

questions um they're not sure about

play14:29

things and sometimes they just ghost you

play14:32

and then next step if everything is all

play14:34

right you can sign a contract so put a

play14:36

contract in place uh we use doku sign

play14:38

for that simple contract so let them

play14:40

sign the contract and also consider a

play14:42

payment schedule what you can do is you

play14:44

can do for example 50/50 split 50% up

play14:47

front 50% afterwards by weekly payments

play14:50

uh payment uh after successful delivery

play14:53

something you can play with so that's

play14:55

how sales Works high level how we do it

play14:57

what you need to consider all right

play14:59

right and before we get into part two

play15:00

there's one more thing that I want to

play15:02

show you and that's this interesting

play15:03

article over here thanks to AI we're in

play15:06

the Golden Age of freelancing there are

play15:08

a lot of interesting statistics in here

play15:10

and what they found they interviewed a

play15:13

bunch of top Tech professionals and they

play15:16

indicated that they are all leveraging

play15:18

AI first of all to increase their

play15:19

productivity well this is I think what

play15:21

we all should be doing but they also

play15:23

capitalize on this generative AI Hye by

play15:26

Building Solutions for startup and

play15:27

Enterprises so they took a group of top

play15:30

Tech professionals and it they it

play15:33

basically said that half of them had

play15:35

already built a solution a gen solution

play15:38

for startups of uh or or an Enterprise

play15:40

and 68% of the respondents in the survey

play15:43

said that generative AI makes them more

play15:44

likely to pursue an independent career

play15:47

working for multiple companies it's a

play15:49

really interesting article a really

play15:50

interesting take out how generative a

play15:53

automation reduces the need for

play15:55

Freelancers in some Fields copywriters

play15:58

Marketing the stuff that can really be

play16:00

automated but on the other hand there

play16:03

because companies notice that they can

play16:05

automate this they now need Freelancers

play16:08

contractors who can help them to create

play16:11

these automations and this is an ideal

play16:14

opportunity for for Freelancers because

play16:16

these are companies that don't have ai

play16:19

Engineers or data scientists inside of

play16:21

their company on payroll that wouldn't

play16:23

make sense they just need someone a

play16:25

technical partner freelancer contractor

play16:27

whatever that can help them every now

play16:28

now and then to build these automations

play16:30

and then they can over time replace the

play16:33

overhead that they're spending on

play16:35

customer support marketing Etc and now

play16:38

funnel that money into potentially you

play16:41

who steps in as a contractor really

play16:43

interesting take and it's definitely

play16:45

something that I've noticed um I've been

play16:47

freelancing for 5 years already

play16:49

basically whenever when uh the open AI

play16:51

API came out I almost completely

play16:53

switched to building generative AI

play16:55

Solutions just because of how much

play16:57

demand and opportun unities there is so

play16:59

for my freelancing business which um is

play17:02

now not even a freelancing business

play17:03

anymore data Lum is now a full full

play17:06

development company but I've transition

play17:08

I've been able to make that transition

play17:11

because of generative AI do you want to

play17:13

dive deeper into how to find J fi

play17:16

projects so how to find clients then you

play17:18

can check out this YouTube video some

play17:21

link that as well so this is how to find

play17:23

freelance data and AI project in 2024

play17:25

it's a 23 minute video going deeper uh

play17:29

into the more specific some books some

play17:31

strategies about everything that we've

play17:33

discussed here all right with that said

play17:36

we conclud this part one let's now get

play17:39

into how to build generative AI projects

play17:42

and let's start with fundamentals to

play17:44

keep in mind at the core every

play17:47

generative AI project follows this

play17:50

pattern you have inputs you have

play17:53

processing and you have outputs

play17:54

generative AI generative meaning output

play17:57

these models gener at stuff and they

play18:00

generate stuff based on input this is

play18:03

your data and your prompts and the

play18:06

processing happens inside the

play18:08

application so how do you bring all of

play18:09

this together and there is an output

play18:12

this can be in the form of text U this

play18:14

can be in the form of even images this

play18:16

can be in the form of any type of

play18:18

combination documents you can put it

play18:20

together and then also you have just the

play18:24

file outputs but you also have the

play18:25

destination same for the input you have

play18:28

the data but you also have to Source

play18:29

where it's coming from and usually what

play18:32

you if you consider a full generative AI

play18:35

project it combines the elements of

play18:37

getting some information from a system

play18:40

or an input or a form there's some type

play18:42

of connection you take you take that

play18:44

data you process it you add the prompts

play18:46

you add the large language model you

play18:48

generate something new put it into

play18:50

desired format and then also put it in

play18:53

uh the desired output place whether

play18:55

that's on the website in a Content

play18:57

management system in a on a storage

play19:00

account that is generative AI projects

play19:03

101 a typical flow that the applications

play19:06

follow when you start to build these

play19:09

projects you always want to start by

play19:11

validating the ID locally meaning

play19:14

getting a proof of concept you can even

play19:16

sell this as a separate phase which is

play19:18

what we sometimes do but you should also

play19:20

be careful with that and really

play19:22

communicate with the client that if you

play19:24

do a proof of concept it's literally

play19:26

only to test and validate f validate the

play19:29

idea not something tangible that they

play19:31

can use this is a mistake that that

play19:33

we've made in the past where okay let's

play19:35

do proof concept we're going to prove

play19:37

that this works because these projects

play19:39

are very Innovative Innovative they're

play19:42

new a lot of these use cases are not

play19:44

proven yet and sometimes there's a

play19:46

little bit of ambiguity or uncertainty

play19:49

whether this will actually work uh

play19:51

mostly in terms of is the the quality of

play19:54

the output good enough can we control

play19:56

for hallucinations all of those

play19:58

questions

play19:59

but you want to start locally and you

play20:02

should really get clear on that pattern

play20:05

so get clear on the input data the

play20:07

processing step the desired output in

play20:08

the location and usually it helps to

play20:10

create a flow diagram so create a flow

play20:13

diagram you could use something like

play20:14

draw.io or figma and write out the

play20:17

applications flow think about where do

play20:19

we need to get data from what do we need

play20:21

to combine what kind of prompts do we

play20:22

maybe need and then what is that output

play20:24

uh going to look like then also take

play20:26

prompt engineering seriously so don't

play20:28

just write a prompt simply by the first

play20:30

thing that comes up uh when you're

play20:31

working on this or by asking C GPT to

play20:33

write the prompt for you don't be lazy

play20:36

here your prompt is almost as important

play20:38

maybe it is as important as the code

play20:41

that you're writing so follow proven uh

play20:44

prompt Frameworks and templates more on

play20:46

that uh in a little bit but take prompt

play20:48

engineering seriously and also consider

play20:50

the data is more important than the AI

play20:52

that you're using with when it comes to

play20:54

generative AI everyone has the same

play20:57

models almost of course there's fine

play20:59

tuning all of that but generally

play21:01

everyone is using the same intelligence

play21:04

here the difference is the data and the

play21:08

data is in terms of your your literally

play21:11

your input data your company data

play21:13

proprietary data that that you're using

play21:16

but also your prompts so combining those

play21:18

two is really what makes your

play21:20

application unique and potentially

play21:22

better than the competition for example

play21:24

all right then let's get into tools and

play21:27

techniques so we use use pure python to

play21:29

build our generative AI solution so we

play21:31

use Python backends for prompt

play21:33

engineering something that you could

play21:35

look into is we like to structure our

play21:37

prompts by first always focusing on a

play21:39

role then a task description step by

play21:42

step using Chain of Thought prompting uh

play21:44

instructions we give it context we add

play21:46

emotional impact and we also add some

play21:49

some one uh some few shot examples and

play21:51

then we tie all of this together in a

play21:53

markdown format so these are proven

play21:55

prompting techniques based on literature

play21:58

so so if you're interested in that I

play21:59

would recommend looking this up then for

play22:03

large language model provider so the

play22:05

brain the intelligence of your

play22:06

generative AI applications uh you can

play22:09

consider open AI Azure open AI or CLA we

play22:12

almost always use Azure open AI because

play22:14

we have the added check mark where

play22:18

Microsoft ensures that they're not

play22:19

sharing your data with open AI even

play22:22

though Microsoft is partnered partly

play22:24

owns open AI but for the clients that we

play22:27

work with we can say hey your data is as

play22:30

safe as the emails in your outlook inbox

play22:33

um most of our the clients that we work

play22:34

with they they are already using

play22:36

Microsoft it's a trusted partner so that

play22:39

just helps that's why we use Azure open

play22:41

AI here's a big one huge one which I

play22:44

also have a video on utilize penic and

play22:46

instructor for robust llm integration I

play22:49

will not be diving into what that is all

play22:52

about but I will link this video this is

play22:54

going to completely change the way you

play22:56

build llm application development best

play22:59

practices when it comes to large

play23:01

language models and AI be Innovative but

play23:05

minimize AI usage use llm processing as

play23:08

a last resort what do I mean by this if

play23:12

we consider the typical flow of input

play23:15

processing output Etc there are

play23:17

naturally going to be steps within your

play23:20

application where some of these

play23:23

processing steps you can just do through

play23:25

a regular function maybe through regx

play23:28

maybe through a simple IFL statement

play23:31

make sure to

play23:33

maximize that functionality first and

play23:36

using an llm should really be your last

play23:39

resort so really when you cannot

play23:41

continue in your data processing

play23:43

pipeline without AI that is where you

play23:46

introduce Ai and why do you want to do

play23:48

this because first of all it slows down

play23:51

your application it introduce cost it

play23:53

can introduce hallucination the more API

play23:55

calls you have to make the more

play23:57

challenging it's going to be to scale

play24:00

and optimize and maintain your

play24:02

application now we're talking about

play24:04

generative AI application so at some

play24:06

point you're going to have to ju

play24:08

introduce an llm or AI because otherwise

play24:11

it's not the generative AI application

play24:13

but just keep that in mind because these

play24:16

models are so good they can if if you

play24:18

really break it down to like if you

play24:22

really break problems down to the most

play24:24

fundamental small part an llm can solve

play24:27

almost any problem especially if you

play24:29

utilize penic and instructor you can let

play24:32

it analyze all kinds of problems and

play24:34

then give for example a Boolean value

play24:36

true false and use that to decide what

play24:39

the next flow of the application should

play24:41

be but really think about it can I do

play24:44

this just with code first if not only

play24:47

then resort to llms here's another just

play24:50

random tip uh we leverage Azure document

play24:53

intelligence a lot for parsing documents

play24:57

so uh rack is a common use case for

play25:00

generative AI

play25:01

applications a problem with rack is

play25:04

splitting up the data into relevant

play25:06

chunks that you can then put into your

play25:07

vector database for the retrieval we use

play25:10

Azure document intelligence to split up

play25:13

documents and let it output markdown and

play25:16

then here's the trick we take that

play25:17

markdown and we split the document based

play25:19

on the header so we split by hashtag and

play25:22

this way we can easily split any PDF

play25:25

document up into sections where all of

play25:29

the context of this section is is

play25:31

relevant versus just chunking by for

play25:33

example thousand characters and

play25:34

splitting something up Midway so use

play25:37

this to your advantage this is really a

play25:39

really great tool it's cheap it's it's

play25:41

fast it's it's great okay then for web

play25:44

applications so whenever we have to put

play25:47

something an application Live where it

play25:50

can run in the background and clients

play25:52

can access it or it can listen for for

play25:54

web hooks triggers Etc we use fast API

play25:58

so this is our framework of choice it

play26:00

works on top of the penic uh schemas so

play26:04

that helps because we also leverage that

play26:06

for our large language model so penic

play26:08

plays a core role in how we Define our

play26:11

data models and how we validate our

play26:13

applications with which overall

play26:15

increases the robustness of your

play26:16

applications all right so then next if

play26:18

the application requires scale meaning

play26:21

lots of users maybe even like multiple

play26:23

organizations multiple users you can

play26:25

Implement something like task queuing we

play26:27

use salary for that if scaling is needed

play26:30

so this lets you create a task cue where

play26:32

every time a request comes in we use a

play26:35

separate like Q to place that new

play26:38

request in and then salary just handles

play26:41

that step by step in order to make sure

play26:43

your system does not overload quick

play26:45

example if you have your application and

play26:47

all of a sudden 1,000 requests come in

play26:50

it's very likely that the surfer that

play26:52

you're uh running that on is not going

play26:53

to process that very well that's where

play26:56

something like salery can come in let's

play26:58

see where was I okay then large language

play27:01

model monitoring we use l views having a

play27:05

way to monitor your your applications in

play27:08

terms of traces meaning you can

play27:10

literally see the prompt and the data

play27:12

that was sent to the model to the large

play27:15

language model is really a must for

play27:17

debugging and monitoring at scale so we

play27:20

use Lang fuse because it's full and open

play27:22

fully open source so we self-host it uh

play27:25

it's great it's very similar to Langs

play27:27

Smith okay then front end Development A

play27:30

lot of these applications also require

play27:33

some kind of front end for the client to

play27:35

interact with not always but it it

play27:37

usually it happens so either they have

play27:39

to trigger a pipeline or they have to

play27:42

upload some documents they want to start

play27:43

a process if you want a simple UI you

play27:46

can look into streamlet which is nice

play27:48

because uh you can build front ends

play27:50

using pure python so it's a language you

play27:53

probably already know so you can get

play27:55

started with that if you want more

play27:56

complex UI you can develop something

play27:58

custom with JavaScript react nexas uh

play28:01

but this is for a lot of people probably

play28:03

watching data scientist machine learning

play28:05

Engineers uh this is not our default our

play28:08

default stack I don't know yavas script

play28:10

so I always partner up with someone if

play28:12

this is required another great tip

play28:15

sometimes air table is all you need air

play28:17

table is really awesome it's a low code

play28:19

database tool it's basically Google

play28:21

Sheets in Excel on steroids it works

play28:23

really great but the cool thing is you

play28:25

can also create forms so sometimes that

play28:28

is all you need and you can trigger web

play28:29

hooks you can trigger automation so uh

play28:32

you can even have custom code uh trigger

play28:35

in intermediate steps so sometimes that

play28:37

is all you

play28:39

need databases we personally if we need

play28:42

a structured database we use postgress

play28:44

for unstructured databases we use

play28:46

mongodb and again we just follow the

play28:49

open source principle here and then for

play28:51

Factor databases uh you can look into

play28:53

your quadrant or for simpler use cases

play28:57

uh you can look into postgress with PG

play28:59

factor which is actually what we running

play29:02

right now for a rack application that we

play29:03

have live quadrant is a factor database

play29:06

specific database so it has a little bit

play29:10

it has more functionality specifically

play29:12

for uh querying filtering Etc but

play29:15

sometimes this is all you need and it's

play29:17

really straightforward to get started

play29:19

with that all right and then let's get

play29:20

into project management so up until this

play29:23

point I've talked a lot about Frameworks

play29:25

tools techniques that we use to consider

play29:28

this core project pattern input

play29:30

processing output and tie everything

play29:32

together this is the stack that we're

play29:34

using but how do you manage these

play29:36

projects which becomes more and more

play29:38

important as you start to work with the

play29:40

team but even if it's just you this is

play29:42

also important for verion control we use

play29:44

GitHub but we're on the paid plan to

play29:46

protect our branches so for example we

play29:48

can just straight push to the main if

play29:50

that is uh the production Branch for one

play29:52

of our applications you have a little

play29:54

bit more uh control when it comes to

play29:56

security privileges all of that so we

play29:59

just use kup but you can also use what

play30:00

are the other ones you have bit bit

play30:02

bucket as your devops but make sure you

play30:05

have a solid Version Control in place

play30:08

and then when it comes to task

play30:09

management you can use a simple planning

play30:11

board so you can use giup issues you can

play30:13

use Trello jira linear or clickup that's

play30:16

what we're using and that's also the

play30:18

sponsor of today's video if you're

play30:20

anything like me juggling all your tasks

play30:22

documents and projects can be a real

play30:24

headache that's where clickup comes in a

play30:26

single app to bring all of this together

play30:29

now I personally love clickup because it

play30:31

lets me manage everything in one place

play30:33

with customized views seamless

play30:35

Integrations and realtime updates also

play30:38

as a developer I love the giup

play30:40

integration and automations to manage my

play30:42

development projects and creating and

play30:44

sharing documents for my team or for my

play30:46

clients has never been easier I've tried

play30:48

all the other apps like Trello notion

play30:51

jira or linear but to me there was

play30:53

always something missing and clickup

play30:55

just made sense so if you're looking to

play30:57

boost your product ity whether that's

play30:58

for personal projects or team

play31:00

collaborations give click up a try it's

play31:02

made a huge difference for me click the

play31:04

link in the description to get started

play31:06

so yeah clickup is awesome so even this

play31:08

document that I'm sharing right now all

play31:10

clickup so just I love how we can do

play31:12

everything in one place inside of our

play31:14

company but that concludes part two how

play31:16

to build these generative AI projects

play31:18

and again I did a high level overview of

play31:20

the stuff that we're using this is

play31:22

endlessly infinitely complex and again

play31:25

each video could have a separate part on

play31:27

its own so make sure if you want to know

play31:29

more about a certain topic framework let

play31:31

me know in the comments then let's get

play31:34

into part three and this is like I've

play31:37

said this is the most challenging of

play31:39

them all how to deliver generative AI

play31:43

projects because like I've said building

play31:45

something on your machine that runs when

play31:48

you click play or run script is

play31:50

something entirely different from

play31:52

something that you put into production

play31:54

on a server where it just runs clients

play31:57

can use it client can clients can access

play31:59

it and most importantly it adds value to

play32:03

the company on a consistent basis so

play32:06

let's talk about that and let's talk

play32:08

about some principles and things to keep

play32:10

in mind so going from local PC to

play32:12

production is hard data scientists are

play32:15

not software Engineers this is a tricky

play32:17

one so I have a data science background

play32:19

I'm learning a ton about software

play32:22

engineering so for the past year that

play32:24

I've been one and a half year already

play32:25

that I've really been diving deep into

play32:27

this I've learned a ton but I still

play32:30

wouldn't consider myself a software

play32:32

engineer so whenever it comes to the

play32:34

point where we really have to put things

play32:36

into production this is where I partner

play32:38

up with a true software engineer who

play32:41

knows this stuff and who's done it

play32:43

before next hallucinations become hard

play32:46

to Monitor and fix at scale writing your

play32:49

prompt putting in your data running a

play32:51

couple of tests locally is all great but

play32:55

when you have a system that has 100

play32:58

maybe even thousands of of queries every

play33:01

day there are going to be hallucinations

play33:04

especially in the beginning and spotting

play33:06

those

play33:07

hallucinations is pretty

play33:09

challenging and that becomes really hard

play33:11

and this is also where where tools like

play33:13

Lang views can come in and where you

play33:15

should really closely monitor your o

play33:18

system overall in order to make sure

play33:21

that it remains stable another tricky

play33:24

thing is okay what okay let's say you

play33:26

spot a hallucination

play33:28

you change the prompt now you fix the

play33:31

hallucination but changing this prompt

play33:34

by maybe adding this one sentence don't

play33:36

do this or specifically do this or that

play33:40

how is that going to affect the system

play33:42

as a whole these are all questions and

play33:45

challenges that everyone right now

play33:47

working on these type of project is

play33:49

literally figuring out how to manage

play33:51

this at scale it's challenging also

play33:54

remember software requires constant mon

play33:56

monitoring and maintenance

play33:58

many companies don't have the right

play34:00

mindset for these ongoing costs and

play34:03

improvements a lot of clients they come

play34:05

to you they have a request we want to do

play34:07

something with AI we want to build this

play34:08

automation blah blah blah cool but they

play34:11

expect you to just come in uh usually

play34:14

they they think that it's actually way

play34:16

cheaper than it actually is these

play34:18

projects are expensive so you have to

play34:20

first elaborate on that but then also

play34:22

explain them that maintaining those

play34:25

applications and improving them is also

play34:27

also going to be pricey and now this is

play34:30

going to be a tradeoff between are they

play34:32

going to self host do they already have

play34:33

the infrastructure in the Personnel in

play34:35

place for you to literally come in as a

play34:38

developer and then hand it over to them

play34:40

then of course it becomes a lot easier

play34:42

but if you have if you have to self host

play34:44

becomes more tricky let's get into that

play34:45

hosting options so you should decide

play34:48

what what's the best fit for this

play34:50

project so letting the client Host this

play34:52

or self hosting we do both sometimes our

play34:55

clients come to us and they say Dave we

play34:57

already have an measure environment uh

play34:59

we can give you access develop in our

play35:01

environment and we'll maintain it we'll

play35:03

take care of the costs that's on one

play35:06

part that's great because then the

play35:08

infrastructure is already there on the

play35:10

other hand if clients come to us like

play35:12

you ask them about infrastructure cloud

play35:14

provider they have no idea no we don't

play35:15

have that okay you can offer to self

play35:17

host so self host meaning we can offer

play35:20

that we host the solution so this is

play35:24

more work something you should be

play35:26

prepared for but but it's a chance an

play35:28

opportunity for recurring Revenue so

play35:30

that could be an interesting one but

play35:31

then you should also really consider

play35:33

this do you do you actually want to do

play35:36

this because it can be stressful and you

play35:39

need the right skills so this could also

play35:41

be the point really where you partner up

play35:43

with someone that has experience with

play35:45

that so all things to consider all right

play35:49

then let's get into Cloud platform so

play35:52

like I've said we use Microsoft Azure uh

play35:54

that is just mainly because most of the

play35:57

big companies and clients that we work

play35:59

with here in the Netherlands use

play36:00

Microsoft Azure but you can also

play36:02

consider AWS or Google cloud cloud I

play36:04

know it in the US ad ad is a lot bigger

play36:08

pick something that works for you in

play36:10

general they all high level can do the

play36:12

same stuff deployment and maintenance so

play36:16

we recommend if you put things in

play36:17

production to use Docker for consistent

play36:20

and scalable application deployment so

play36:22

take everything that you've built put it

play36:24

inside a a Docker file a Docker

play36:26

container and then the deploy it through

play36:28

that environments if it's really a big

play36:31

project affecting lots of users

play36:34

potentially I would always recommend

play36:36

splitting it up into a production and a

play36:38

test environment so you have your own

play36:39

production environment running where

play36:41

clients users are interacting with and

play36:43

you also have a test environment where

play36:45

first you develop then you push it to

play36:46

test see make sure that everything runs

play36:48

stable and then you push it to

play36:50

production then your cicd pipeline so we

play36:53

use giup automation to streamline the

play36:54

deploy deployment process so we create

play36:57

sep seate branches to push to separate

play37:00

servers for example so to the production

play37:01

or the test environment uh it's all

play37:03

automated you can set it up using giup

play37:05

automations then testing so make sure to

play37:07

implement unit test so to so you can

play37:10

sure the reliability and functionality

play37:12

of your application when you start to uh

play37:15

improve add new features then also again

play37:18

the cost transparency be clear about

play37:20

server SL API cost and maintenance fees

play37:23

so for this can sometimes be a little

play37:26

bit tricky but all these Cloud providers

play37:28

they have calculators that you can use

play37:31

to say okay what resources do you use

play37:33

what's the plan that you're using what's

play37:35

the volume and you can get a ballpark

play37:37

estimate of what this is going to cost

play37:40

first and foremost at the server level

play37:42

so just running the Surfers running the

play37:43

infrastructure and then with these AI

play37:46

applications where also a lot of the

play37:47

costs depending on the scale of the

play37:49

application come from is just the the

play37:51

API cost of doing the calls to either

play37:54

open AI to clot or to Azure open

play37:58

and for this you ideally with a client

play38:00

want to set set up a monthly payment

play38:02

structure so be clear about this uh if

play38:04

it's in their invironment you don't have

play38:06

to deal with this maybe only just a

play38:09

maintenance fee or just talk talk or

play38:12

explain to them okay if if you ever need

play38:14

me for maintenance improvements this is

play38:16

my hourly rate and you can do it a

play38:18

little bit more flexible on Dem mon or

play38:20

you could offer a set hours per month or

play38:22

per week that you are available and put

play38:24

them on a retainer different way ways to

play38:26

go about that monitoring we use Sentry

play38:30

which I've personally recently

play38:32

discovered this is great so Sentry is a

play38:35

monitoring tool and it has connectors

play38:38

templates for all kinds of Frameworks so

play38:41

it also has a framework uh or template

play38:44

whatever you would call that for fast

play38:46

API and basically the thing that Sentry

play38:48

does is uh whenever something in your

play38:51

application results in an error it's

play38:54

going to lock that and we have it hooked

play38:56

up to our slack Channel and now when we

play38:59

put this into when we have an

play39:00

application running into production and

play39:02

there is for example a new uh there is a

play39:05

new uh example where there is some data

play39:07

missing and through that a function

play39:10

errors because there is a uh what what

play39:13

really happens often is for example that

play39:15

the key is not available so you get a

play39:17

key error on some of your data uh that's

play39:19

very common with these at least

play39:21

applications that we're building it

play39:23

shows up and then we can catch that make

play39:25

it optional more of do or perform a

play39:27

check Etc so we use Sentry for that

play39:30

great tool okay finally security metrics

play39:33

and then I want to dive into an example

play39:36

architecture that we've recently worked

play39:38

on for a project and that you can

play39:40

hopefully take a little bit of

play39:41

inspiration from but security message

play39:44

also really important so first of all we

play39:46

have API keys and credentials so you're

play39:48

working with clients they're sharing

play39:49

information with you make sure to store

play39:52

and treat all of this information

play39:54

carefully we use one password to just

play39:56

store it internally so whenever a client

play39:59

shares an API Key password whatever we

play40:02

use one password then also for all the

play40:05

authentication methods or connections

play40:07

that you have on your application make

play40:09

always make sure that there is some form

play40:11

of multiactor

play40:12

authentication so this is for for

play40:14

databases for apis for web hooks uh so

play40:17

so use a header key for example uh so

play40:20

make sure that that is all safe and of

play40:22

course this gets more and more important

play40:25

dependent uh depending on on the

play40:27

sensitivity of the data that you're

play40:30

dealing with if it's really sensitive

play40:32

data you can also consider uh VPN to

play40:35

really put your application on Lock and

play40:38

block everything except for one WID

play40:41

listed IP or Azure has something where

play40:44

you can create an internal VPN Gateway

play40:46

and it can just connect to that so

play40:48

security measures really important and

play40:51

the thing is also for a lot of people

play40:53

working on these

play40:55

applications not their core Str

play40:57

I see a lot of people either data

play41:00

scientists data Engineers uh machine

play41:03

learning Engineers enter into this field

play41:05

and you really want to consider all of

play41:08

the software development best practices

play41:10

when you're going to put applications

play41:11

like this into your production also for

play41:14

people just in general new to the space

play41:16

new to AI new to software

play41:18

development these are all things that

play41:20

you you have no idea about and I say

play41:22

that because I still am like literally a

play41:25

noob to most of these things and I've

play41:26

been doing this this for 5 years already

play41:29

so make sure that you understand what

play41:32

you don't know and partner up with

play41:34

people that do know that stuff in order

play41:36

to make sure you can just provide

play41:38

reliable solutions for your clients all

play41:40

right and then let's get into an example

play41:43

architecture diagram of a project that

play41:45

we're literally working on right now and

play41:48

I don't see a lot of people talking

play41:49

about this and I'm just going to give it

play41:51

away either way but this is I think in

play41:55

my opinion such a cool such a nice

play41:58

blueprint for lots of generative AI

play42:01

solutions that that you can build right

play42:03

now this the only thing that that's

play42:05

missing is that this doesn't have a

play42:06

front end that's because we're

play42:07

integrating with another application

play42:10

that we're uh getting data from and

play42:12

sending data through but other than that

play42:14

this is a really scalable robust and

play42:17

secure framework that that we're working

play42:20

with right now so high level it's it's

play42:22

mostly just a a summary of of everything

play42:24

that that I've discussed in part two in

play42:27

part three of this video but for this

play42:29

particular solution we're connecting

play42:31

with an existing application so we're

play42:35

using their API and we're using HTTP

play42:38

triggers whenever something happens in

play42:40

the system in that application we can

play42:42

set up uh a trigger and it hits our back

play42:45

end so it sends it sends a request to

play42:48

our fast API backend and from there we

play42:50

get all the data and that is where the

play42:53

the pipeline is triggered so we use

play42:57

queuing mechanism uh in here and we use

play43:00

Reddit as an intermediate step I think

play43:02

that's the only thing that I didn't

play43:03

mention in uh in this dock in the video

play43:05

in the document and through redis we

play43:07

sent it to to celery to put it into a

play43:10

queue we can also visualize that using

play43:11

flow and then we set uh the output is

play43:16

then uh put into the database so in the

play43:18

postc SQL database so this is how we lck

play43:21

internally all of the data that we're

play43:23

using uh later for referencing for

play43:25

monitoring and we enrich the data as

play43:27

well but the the back end and the Q

play43:30

worker that is where all the the magic

play43:32

is happening in terms of the processing

play43:34

so we take that data we we take it

play43:36

through a data processing pipeline where

play43:39

we perform various data enrichments and

play43:42

also uh multiple large language model

play43:45

queries in between to then finally get a

play43:48

desired output and that output is sent

play43:51

back into the system that we're dealing

play43:54

with so this is a architecture that you

play43:59

can copy for for lots of use cases where

play44:02

you integrate AI into a current system

play44:05

which is a really good use case now next

play44:08

to that here we have a separate Resource

play44:10

Group where we also have l view so we

play44:12

talked about that then we also this is

play44:14

the production version we also have the

play44:16

test version which is a copy a literal

play44:18

copy of what we have running in

play44:20

production the only difference is that

play44:22

it cannot send information back to the

play44:25

source application and then the the cool

play44:28

thing about this where when it comes to

play44:30

security we put so we host all of this

play44:32

in Microsoft Azure and we put it into uh

play44:35

we created an Azure virtual Network and

play44:37

then created the VPN Gateway and we only

play44:41

allow incoming request through this VPN

play44:44

Gateway and that means when we as

play44:46

developers want to work with this we

play44:49

with our laptops can connect with this

play44:51

VPN Gateway and that is the only way to

play44:54

connect with this and connect with these

play44:56

resources anything else will be blocked

play44:59

by default so this is how you can

play45:01

entirely shut off your application so no

play45:04

one can reach it because uh we're

play45:06

dealing with sensitive information in

play45:09

this uh in this database in this whole

play45:11

system and we don't want uh that to be

play45:14

interrupted of course and then here we

play45:16

make sure we have the uh we have some

play45:18

some type of multip multiactor

play45:20

authentication so we send the request to

play45:22

our back end but we also have an

play45:24

authentication through a header in there

play45:26

that we implement it so this is a really

play45:29

cool in my opinion an architecture and

play45:32

not all props to me this is really the

play45:34

the software engineer that that I'm

play45:36

currently working with that has

play45:37

engineered this for the project that

play45:39

we're working on so that concludes this

play45:41

video so I hope this was helpful it's a

play45:43

pretty long one I really went in depth

play45:46

talking about how what we currently do

play45:48

inside our company data Lumina so if you

play45:51

found this video helpful please leave a

play45:53

like and also consider subscribing if

play45:55

you want to learn more about

play45:57

uh what we're doing how we're helping

play45:59

our clients or want to learn more about

play46:00

freelancing how to get started with that

play46:02

so make sure to subscribe if that sounds

play46:05

like you and then for now make sure to

play46:07

watch this video next that's the video

play46:09

where I dive deeper into the client

play46:11

acquisition part how you can find your

play46:13

own clients how you can get started with

play46:15

tools resources books that you can dive

play46:17

into so make sure to check that out next

play46:19

and then I'll see you in the next one

Rate This
β˜…
β˜…
β˜…
β˜…
β˜…

5.0 / 5 (0 votes)

Related Tags
Generative AICustom SolutionsAI ProjectsData LuminaAI DevelopmentFreelancingTech InnovationAI AutomationSoftware EngineeringAI Entrepreneurship