How To Create Your Own AI Clone for Videos (No More Shooting)

100x Engineers
5 Dec 202311:50

Summary

TLDRThis video tutorial showcases the creation of a personalized AI Avatar using the tool 'haen' in just 10 minutes. The process involves recording a high-quality video of oneself, adhering to specific guidelines for optimal results. The tool then generates an AI replica that can mimic the user's gestures and voice, automating video content creation. The video also covers the importance of legal consent due to the rise of deepfakes, pricing plans, and the benefits of fine-tuning the AI model for higher quality outputs. The presenter demonstrates creating a video with the AI Avatar and emphasizes its potential, especially for those with non-Western accents, by recommending voiceover recording.

Takeaways

  • πŸ˜€ The video demonstrates how to create a personalized AI Avatar using a tool called haen.
  • ⏱️ The process of creating an AI Avatar is claimed to be quick, taking only 10 minutes.
  • 🌐 The tool automates content creation, reducing the need for manual video recording.
  • πŸ‘€ The AI Avatar mimics the user's appearance, speech, and gestures based on a short video clip.
  • πŸŽ₯ High-quality video footage is essential for the best results, with recommendations for resolution, lighting, and environment.
  • πŸ“ The tool has a learning process that understands and replicates the user's facial expressions, hand movements, and voice.
  • πŸ’¬ Users are advised to maintain eye contact and pause with closed mouths between sentences for better AI replication.
  • πŸ’‘ The video mentions the importance of avoiding footage cuts and excessive hand gestures above the chest.
  • πŸ”’ Legal consent is required to ensure the authenticity and legal use of the AI Avatar, preventing misuse.
  • πŸ’° The tool offers a free tier and various paid plans based on the number of credits for video creation.
  • 🎬 Fine-tuning the AI model is an option for users seeking higher quality and more detailed video outputs, though it's more expensive and time-consuming.

Q & A

  • What is the purpose of creating an AI Avatar as described in the video?

    -The purpose of creating an AI Avatar is to automate video content creation, reducing the need for multiple takes and allowing for the generation of videos using a script or voiceover.

  • How long does it take to create an AI Avatar using the haen tool?

    -The video claims that the AI Avatar can be created in only 10 minutes using the haen tool.

  • What are the system requirements for recording a video for the AI Avatar?

    -The video should be recorded using a high-resolution camera, in a well-lit and quiet environment, with the subject looking directly into the camera and maintaining eye contact. The subject should also pause with closed mouth between sentences and avoid hand gestures above the chest.

  • Why is it important to avoid cuts and changes in the video footage for the AI Avatar?

    -To ensure the best quality output, continuous footage without cuts is recommended because the AI needs a consistent input to learn and replicate the user's gestures, expressions, and voice accurately.

  • What is the process for creating an AI Avatar on haen.com?

    -The process involves signing into haen.com, clicking on 'instant Avatar', and then following the instructions to upload a video or record a new one, ensuring to meet the quality and recording guidelines.

  • How does haen handle the issue of unauthorized use of someone's likeness?

    -To prevent unauthorized use, haen requires users to provide legal consent by recording a statement that they authorize haen to use their footage for creating an AI Avatar.

  • What are the different pricing plans offered by haen for creating AI Avatars?

    -Haen offers a free tier with one credit for a 60-second video and one instant Avatar, as well as paid plans starting at $30 per month for 15 credits, which can be adjusted based on the user's needs.

  • How does the haen tool process the video to create an AI Avatar?

    -The tool processes the video by analyzing the user's gestures, voice, facial expressions, and hand movements to create an AI replica that can emulate the user when provided with a script or voiceover.

  • What is the recommended approach for users with non-western accents when creating an AI Avatar?

    -For users with non-western accents, it is recommended to record a voiceover in their natural voice and use that for the video, rather than relying on the tool's script-to-speech feature, which may not accurately replicate their accent.

  • What is the fine-tuning process for an AI Avatar on haen, and what are its benefits?

    -Fine-tuning involves paying extra to enhance the AI Avatar's resolution, lip-syncing, and gesture details. It requires a longer processing time and provides a more accurate and higher-quality output.

  • How long does it take to fine-tune an AI Avatar on haen?

    -Fine-tuning an AI Avatar on haen takes 8 to 12 hours, as it involves additional training on the user's footage to improve the model's accuracy.

Outlines

00:00

πŸŽ₯ Creating an AI Avatar with Haen

The speaker introduces their AI Avatar, created using Haen, a tool that automates content workflow by generating a digital replica of oneself. The process involves signing up on Haen's website, choosing the 'Instant Avatar' option, and following a series of steps to create an AI version capable of producing videos. The speaker emphasizes the importance of high-quality video footage, proper lighting, and minimal background noise for optimal results. They also discuss the tool's limitations and provide tips for achieving the best output, such as maintaining eye contact and using generic gestures.

05:02

πŸ’¬ Legal Consent and Pricing Plans for Haen

In this section, the speaker addresses the legal aspect of using AI technology by explaining the need for consent to create an avatar. They demonstrate the process of recording a legal consent using Haen's interface. Additionally, the speaker provides an overview of Haen's pricing plans, highlighting the different tiers and the benefits they offer, such as the number of credits for video creation and access to premium features. They also discuss the importance of fine-tuning the AI model for higher quality videos, which comes at an additional cost and requires a longer processing time.

10:02

πŸ” Fine-Tuning AI Avatars and Practical Applications

The speaker delves into the process of fine-tuning an AI Avatar for improved video quality and more accurate lip-syncing. They explain the benefits of using a longer video clip for fine-tuning and the expected wait time. The speaker also shares their experience using the AI Avatar on social media, highlighting its effectiveness in garnering views. The section concludes with a recommendation for those looking to create high-quality AI videos, emphasizing the potential of the technology to improve over time.

Mindmap

Keywords

πŸ’‘AI Avatar

An AI Avatar refers to a digital representation of a person that mimics their appearance, voice, and gestures. In the video, the creator uses a tool called haen to generate an AI Avatar that can be used to produce videos, automating the content creation process. The AI Avatar is created by processing a short video of the individual, capturing their facial expressions, gestures, and voice to replicate them in digital form.

πŸ’‘haen

Haen is the name of the tool mentioned in the video that allows users to create their AI Avatars. It is used to automate the video production workflow, reducing the need for physical recording sessions. The video demonstrates how to sign up, select options, and create an AI Avatar using haen, highlighting its features such as instant Avatar creation and video production capabilities.

πŸ’‘Instant Avatar

Instant Avatar is a feature within the haen tool that enables users to generate a basic AI representation quickly. As described in the script, upon signing up for haen, users receive one free instant Avatar and one free credit, which can be used to create a 60-second video. This feature is part of the tool's offering to automate and simplify the video creation process.

πŸ’‘Content Workflow

Content Workflow refers to the series of steps involved in creating and delivering content, such as videos. The video's theme revolves around automating this workflow using AI technology. By creating an AI Avatar with haen, the creator aims to eliminate the need for repeated recording sessions, streamlining content production by allowing the AI to generate videos from scripts or voiceovers.

πŸ’‘Deep Fakes

Deep Fakes are synthetic media in which a person's likeness is superimposed onto someone else's body or face with the help of AI. The video touches on the ethical considerations of AI Avatars, mentioning the potential for misuse, such as when someone's face is used without consent. The haen tool requires legal consent to create an AI Avatar, addressing concerns about authenticity and the prevention of impersonation.

πŸ’‘Finetune

Finetune, in the context of the video, refers to the process of enhancing the AI Avatar's model to capture more detailed nuances of the individual's appearance and gestures. The video explains that for a higher quality output, users can opt to fine-tune their AI Avatar model, which results in better lip-syncing and gesture recognition, albeit at an additional cost and with a longer processing time.

πŸ’‘Legal Consent

Legal Consent is the affirmation by the individual that they permit the use of their likeness to create an AI Avatar. In the video, the creator demonstrates the process of providing legal consent to haen, which is necessary to ensure that the creation of the AI Avatar is authorized and to prevent potential misuse, aligning with ethical standards and legal requirements.

πŸ’‘Gestures

Gestures are the movements of a person's hands and body that convey meaning or emotion. The video script emphasizes the importance of capturing natural gestures during the recording process to ensure that the AI Avatar accurately replicates the individual's movements. It also advises keeping hand gestures below the chest level to improve the recognition and replication of hand movements by the AI.

πŸ’‘Voiceover

A voiceover is a recording of a voice that is added to a video, typically after the video has been produced. The video suggests that for individuals with accents that the AI tool may not accurately replicate, recording a voiceover and syncing it with the AI Avatar can produce more authentic results. This approach allows the AI to focus on visual elements while the original voice is preserved.

πŸ’‘Pricing Plan

The Pricing Plan refers to the cost structure offered by the haen tool for creating and using AI Avatars. The video outlines different subscription tiers, detailing the credits and features associated with each plan. Credits are used to produce videos of varying lengths, and the plans scale according to the user's needs, with additional benefits for higher-tier subscriptions.

Highlights

Introduction to creating an AI Avatar using the tool called haen.

The AI Avatar can automate content workflow by creating videos without the need for repeated recordings.

Signing up for haen provides one free instant Avatar and one free credit for a 60-second video.

The tool processes a 2 to 5-minute video of yourself to create an AI replica for video creation.

Guidelines for recording the video include using a high-resolution camera, well-lit and quiet environment, and maintaining eye contact.

Avoiding common mistakes like frequent cuts in footage, changing positions, and excessive head or hand movements.

The importance of providing legal consent to prevent misuse of one's likeness in the age of AI and deepfakes.

Pricing plans for haen, starting from a free tier to monthly subscriptions with varying credits for video creation.

The process of uploading a video and answering questions to ensure the AI Avatar's accuracy.

The tool's limitation in capturing non-Western accents and the recommendation to use voiceover for better results.

How to create a video using the AI Avatar by writing a script or uploading an audio file.

The cost associated with different video lengths and the process of generating videos using haen.

The option to fine-tune the AI Avatar for better lip syncing, higher resolution, and capturing finer details.

The time investment required for fine-tuning a model and the suggestion to use a longer video clip for better training.

The potential of AI Avatars in social media and content creation, as demonstrated by the creator's experience with Instagram reels.

Final thoughts on the future improvement of AI Avatar technology and a call to action for subscribing to the channel for more AI content.

Transcripts

play00:00

this is my AI Avatar that looks like me

play00:02

talks like me and gestures like me and I

play00:05

made this in only 10 minutes using this

play00:07

tool called haen and in this video I'm

play00:09

going to teach you how to do the same as

play00:11

well this completely automates your

play00:13

content workflow you no longer have to

play00:15

shoot without further Ado let's get

play00:17

started all right so this is the real me

play00:19

and I'm going to walk you through the

play00:21

process of creating your own AI Avatar

play00:24

let's go so first of all I'm going to

play00:26

app. haan.com and I'm going to create an

play00:28

account now I already have account so

play00:30

I'm just going to sign in now once I'm

play00:32

signed in I can see a bunch of options

play00:34

like instant Avatar Photo Avatar

play00:36

template AI script Etc you can do a

play00:38

bunch of things with this tool but in

play00:40

this video I'm only going to teach you

play00:41

how to create your own AI Avatar that

play00:43

can do videos for you the whole point of

play00:45

this is to automate your videos in some

play00:47

way or you not having to record a

play00:50

message a thousand times and just having

play00:51

to type in a script or just do a simple

play00:53

voice over so the first thing you need

play00:54

to do is click on instant Avatar and

play00:57

once you do that you can actually see

play00:59

this option called free instant Avatar

play01:01

now whenever you sign up for haen you

play01:04

get one free instant Avatar and you get

play01:06

one free credit out of which you can

play01:08

basically make a 60-second video after

play01:10

that free version you have to basically

play01:12

buy credits I'll quickly walk you over

play01:14

through the pricing plan but let's first

play01:16

create our Avatar and then get into it

play01:18

so I'm going to click on free instant

play01:20

Avatar and this video is going to play

play01:22

hey guys I'm Joshua co-founder and CEO

play01:25

of H let me show you these two videos by

play01:28

the way this is not the actual CEO this

play01:31

is a haen version of the CEO pretty cool

play01:34

right let's get started so you can

play01:35

either take text instructions or video

play01:37

instructions I prefer text because this

play01:39

is like an instruction manual so I'm

play01:41

just going to click on that so let's

play01:42

quickly go through the rules and

play01:44

regulations or the dos and the do Nots

play01:46

so the way this tool works is that it

play01:48

takes a 2 to 5 minute video footage of

play01:50

yourself and then it kind of processes

play01:52

it it understands your gestures it

play01:54

understands your voice it kind of

play01:56

stabilizes your backgrounds it

play01:58

understands your hand movements your

play01:59

facial Expressions it understands all of

play02:01

that that's what the model does and then

play02:03

it tries to create an AI replica of you

play02:06

so that whenever you enter a script or

play02:08

you do a voice over it can actually

play02:10

emulate you in some way or the other so

play02:12

there are some rules to ensure that you

play02:13

get the best quality output I'll just

play02:15

walk you through that use a high

play02:16

resolution camera so the quality has to

play02:18

be topnotch 1080p will actually do the

play02:20

job record in a well lit quiet

play02:22

environment try to minimize background

play02:24

noise as much as possible and have good

play02:26

lighting look directly into the camera

play02:29

look directly into the camera talk

play02:30

directly into the camera and try to

play02:32

maintain eye contact pause between each

play02:34

sentence with your mouth closed it will

play02:37

only be able to correctly humanize you

play02:40

in some way if you were to actually

play02:42

close your mouth when you pause so I

play02:44

know this rule can be a little tricky

play02:45

but just try to keep this in mind use

play02:47

generic gestures and keep hands below

play02:49

your chest try to emote your hands in

play02:52

this region things to avoid stitches or

play02:54

cuts of your footage yes very important

play02:56

I did this mistake the first time I was

play02:58

trying to create my avatar don't have a

play03:00

lot of cuts the best input that you can

play03:02

actually give is a video that is

play03:04

continuously 2 minutes or 5 minutes or

play03:07

anywhere between that long no cuts at

play03:09

all talking without pauses like I said

play03:10

earlier changing positions while

play03:12

recording don't do

play03:16

this stay loud background noises D

play03:20

shadows and overexposure on your face

play03:22

get your lighting correct basically

play03:24

diverting your gaze or looking around

play03:26

not too much head movement hand gestures

play03:28

above the chest

play03:30

don't do that use of pointing gesture ah

play03:34

so you're not supposed to do this this

play03:36

this I've seen a lot of these text to

play03:38

video text to image tools screw up hands

play03:41

a lot so I think it can't really get the

play03:43

details of the hands or the fingers

play03:45

correct which is why it's asking you to

play03:46

do this all right let's move on to the

play03:49

next step so you can either upload an

play03:51

existing footage or you can actually

play03:52

record it with your webcam now obviously

play03:54

like I mentioned you need high quality

play03:56

footage and most webcams usually suck so

play03:58

I wouldn't really recommend the the

play03:59

webcam unless you're just trying to

play04:02

fiddle around with this tool and see

play04:04

what it's like so I already have a

play04:06

footage of myself I'm going to go here

play04:09

and I'm going to upload that file so

play04:11

this is the file that I have of myself

play04:14

open AI had their first ever de

play04:16

conference and Sam Alman came on stage

play04:19

to announce some massive updates to chat

play04:21

GPT and the GPT infrastructure there

play04:24

were majorly three announcements and we

play04:27

will be going into a deep dive of all

play04:29

three of them individually so that's

play04:31

basically the video but again guys I

play04:33

made a little bit of a mistake over here

play04:35

I wasn't supposed to really keep my

play04:37

hands over my chest so just keep that in

play04:39

mind put it a little down and you'll get

play04:42

much better outputs but even when I did

play04:44

this I still got like pretty decent

play04:46

outputs but if you really want the best

play04:47

output keep it below your chest so this

play04:49

is the only video I have so I'm just

play04:51

going to put it in here and they ask you

play04:53

a bunch of questions your face is

play04:55

visible at all times yes you're looking

play04:57

directly into the camera yes there are

play04:59

POS es between sentences absolutely the

play05:01

environment is well lit and quiet yes my

play05:05

footage looks good all right let's go

play05:07

now in the age of AI and deep fakes it

play05:11

is very common for someone to actually

play05:13

use someone else's face there was this

play05:14

whole Rashmi Mandana thing that happened

play05:17

on Twitter when somebody did a deep deep

play05:19

fake video and amitab batan retweeted it

play05:21

saying that they can take legal action

play05:23

in order to prevent this you are

play05:25

required to provide a legal consent that

play05:27

you are consenting h in order to create

play05:30

an avatar of yourself so that you can

play05:32

actually go ahead to record the videos

play05:33

so I'm just going to quickly record my

play05:36

consent turn on mik and cam it basically

play05:39

has a script I have to just read the

play05:41

script that's going to come on the

play05:42

[Music]

play05:45

screen hereby declare that I authorize

play05:48

haen to use the footage of me to build a

play05:50

haen avatar and use it in my haen

play05:52

account all right it has my consent so

play05:55

the tool actually validates the consent

play05:57

while we're actually uploading it so

play05:59

that's going to take about 20 seconds

play06:01

and I'm going to hit submit so it's

play06:02

going to take a little while for my

play06:04

video to get uploaded now while our

play06:06

video is getting ready it's going to

play06:07

take about 2 to 3 minutes I'm going to

play06:09

quickly take you through the pricing if

play06:11

I go in the monthly prices so in the

play06:13

free tier you can basically get one free

play06:15

credit which is a 60-second video and

play06:18

also one instant Avatar but there are

play06:20

different scale as you go plans over

play06:22

here so you can just decrease it or

play06:24

increase it I've opted for the lowest

play06:26

plan which cost me $30 a month with $30

play06:29

a month I get 15 credits a month which

play06:31

means I can make 15 1 minute videos

play06:33

which is more than enough for me and the

play06:35

video can actually be 5 minutes long you

play06:38

also get three instant avatars which

play06:40

means you can either film yourself

play06:42

wearing three different outfits or you

play06:44

can film three different creators as

play06:46

well or three different backgrounds

play06:47

whatever that works for you uh and you

play06:49

get access to a bunch of other premium

play06:50

features that I'm not going to talk

play06:52

about in this video so this is a pretty

play06:54

decent plan and if you want more you can

play06:55

basically just click on the slider and

play06:57

basically the only thing that changes is

play06:59

the number of credits everything else is

play07:01

the same all right let's see if our

play07:03

video is ready yeah I'll actually have

play07:04

to wait for

play07:08

this the most annoying part about AI is

play07:10

that it makes you wait for so long one

play07:14

eternity later all right it's done let's

play07:17

see our model congrats your AI Avatar is

play07:20

ready all right hey shev raish your

play07:24

instant Avatar is ready feel free to

play07:26

create videos with it also click the

play07:28

feedback button to share what you think

play07:30

hope you enjoyed wow it actually caught

play07:32

all my gestures right my voice is right

play07:35

but my accent is really weird so one

play07:37

problem that I've actually encountered

play07:38

with this tool is that it doesn't really

play07:40

get Indian accents really well so what I

play07:43

essentially would recommend is that if

play07:44

you don't have like an American or

play07:46

British or you know like a western

play07:48

accent what you should do is just record

play07:50

a voiceover and turn that into a video

play07:52

rather than write a script and turn that

play07:54

into a video you can write a script but

play07:56

you'll sound fake so it's better to just

play07:58

record it in your normal voice so that

play08:00

you don't have to shoot in front of a

play08:01

camera all the time and set up a studio

play08:03

and things like that you basically can

play08:05

just record a voice over and convert

play08:06

that into a video so I'm going to show

play08:08

you what that's actually like let's hit

play08:11

on create video so here you see the haen

play08:13

editor so this is the model that has

play08:15

been selected and here I can basically

play08:17

write whatever script I want now you

play08:19

guys actually saw how bad the audio

play08:21

quality in the script was so let's

play08:23

actually try going for an audio script

play08:25

you can upload a local audio file you

play08:27

can probably record it on your audio

play08:29

software or on your iPhone or something

play08:31

or you can directly just record it from

play08:32

here I'm just going to directly record

play08:34

it from here hey guys this is an

play08:36

official tutorial of haen I'm going to

play08:39

teach you how to make your own AI Avatar

play08:41

and this is available on 100x if you

play08:43

like this video subscribe to 100ex right

play08:46

now all right let's see how that sounds

play08:49

like hey guys this is an official

play08:51

tutorial of hen I'm going to teach you

play08:54

how to make your own AI Avatar and this

play08:56

is available on 100x if you like this

play08:58

video subscribe to 100x right now right

play09:01

I'm going to hit submit so essentially

play09:03

it's going to cost me 0.5 credits so

play09:06

anything below 30 seconds is 0.5 credits

play09:08

anything between 30 seconds to 60

play09:10

seconds is one credit that's the math

play09:13

hey guys this is an official tutorial of

play09:16

haen I'm going to teach you how to make

play09:18

your own AI Avatar and this is available

play09:21

on 100x if you like this video subscribe

play09:23

100x right now right now what I've

play09:25

created here is a pretty basic Avatar

play09:27

which does a pretty decent job of

play09:28

emulating your gestures etc etc but one

play09:32

important thing that I want to cover

play09:33

over here is fine-tuning if you're

play09:36

really serious about putting out videos

play09:38

with haen what I would recommend is to

play09:39

fine-tune your models it understands the

play09:41

small nuances it is a much better output

play09:45

than the normal version and it

play09:48

definitely costs a little more in order

play09:49

to generate a fine tune model but if

play09:51

you're serious about this it's actually

play09:53

worth it all right now in order to fine

play09:55

tune it you can just click the video and

play09:57

you can see this button called finetune

play09:59

I'm going to click on fine tune and like

play10:02

I said you have to pay a little bit

play10:04

extra in order to finetune your model

play10:06

now you can only choose to F tune your

play10:07

video or you can choose to fine tune

play10:09

your video plus your audio but since

play10:11

this tool is not that great at capturing

play10:13

Indian accents I wouldn't want to

play10:15

actually fine tune my voice so I'm going

play10:18

to click no on The Voice which means

play10:20

it's basically going to cost me $40 in

play10:22

order to F tune my actual video fine

play10:24

tuning what it basically does is your

play10:26

video basically gets a higher resolution

play10:28

which means you can get your videos in

play10:30

4k you also get a better lip syncing in

play10:33

the fine tune model and it gets all the

play10:34

finer details of your gestures the thing

play10:36

about fine tuning is that it actually

play10:39

takes 8 to 12 hours which means there is

play10:42

actual training from your 2 to 5 minute

play10:44

footage video that is being done so one

play10:47

suggestion is if you want to fine tune

play10:49

your video model I would suggest put in

play10:51

a 5 minute clip instead of just a

play10:53

2minute clip because that's bigger data

play10:55

so you can keep it overnight you can

play10:57

leave it overnight next morning you come

play10:59

your finetune model will be ready now

play11:01

I've already finetuned my model so I'm

play11:03

just going to close this and I'm going

play11:04

to show you what my finetune model looks

play11:06

like the audio still sucks the accent

play11:09

still sucks and that's just the normal

play11:11

problem that's there with haen but let's

play11:13

actually create a video with this and

play11:15

see so here's what I'm going to do I'm

play11:17

going to get my AI Avatar in order to

play11:19

record the ending of this video and that

play11:22

is how you create your own personalized

play11:25

AI Avatar you no longer have to shoot

play11:27

I've actually used my AI Avatar and my

play11:30

Instagram reels and those reels have

play11:32

pulled over a million views it actually

play11:35

works if people have seen you for long

play11:37

enough they'll understand that it's not

play11:38

you but in about a few months this is

play11:41

only going to get better and better

play11:43

thank you for watching and if you want

play11:45

more AI stuff subscribe to

play11:48

100x

Rate This
β˜…
β˜…
β˜…
β˜…
β˜…

5.0 / 5 (0 votes)

Related Tags
AI AvatarContent AutomationH PlatformVideo TutorialDigital PresenceTech InnovationVideo CreationOnline MarketingAvatar TechnologyDigital Content