How to Transcribe and Translate Audio or Video to Any Language Using AI

Howfinity
3 Jul 202305:54

Summary

TLDRThe video demonstrates using two AI tools, Descript and DeepL, to easily transcribe and translate video and audio files. Descript provides automated transcription and editing capabilities, exporting transcripts and captions. DeepL then instantly translates the English transcript into over 30 languages. Together these tools enable creating accessible, multilingual content and subtitles to reach broader audiences. The value and accessibility of AI is highlighted, mentioning a website cataloguing AI courses and tutorials on platforms like ChatGPT and Midjourney for those wanting to further leverage AI.

Takeaways

  • 😀 The video demonstrates how to use Descript and DeepL to transcribe and translate video and audio files
  • 📹 Descript provides accurate AI-powered transcription of video and audio files
  • 📝 Descript transcripts can be edited and exported to various formats like Word docs
  • 🌎 DeepL translates transcripts and captions to over 30 languages quickly
  • 🇪🇸 Translated files can be saved as SRT/VTT captions for subtitling videos
  • 🔭 Analytics can identify top visitor countries to translate captions accordingly
  • 🚀 Skill Leap AI has a catalog of 200+ AI tutorials including ChatGPT prompts
  • ⏱ Using Descript and DeepL saves money and time on transcription/translation
  • 💡 Integrating the two tools makes content accessible to more people globally
  • 🎥 The workflow enables creating multi-language versions of videos easily

Q & A

  • What two AI tools does the narrator use to transcribe and translate video and audio files?

    -The narrator uses Descript for transcription and deepl.com translator for translation.

  • What file formats can you export the transcript to from Descript?

    -From Descript you can export the transcript to plain text, Microsoft Word doc, SRT captions file, and VTT captions file.

  • How accurate is the AI transcription in Descript?

    -The narrator says the AI transcription in Descript is incredibly accurate after testing it on 12 hours of video.

  • What languages can you translate to with deepl.com translator?

    -Deepl.com translator supports over 30 languages including Spanish, Chinese, Portuguese, Japanese and Italian.

  • How can translated caption files be used?

    -The translated caption files can be used to add subtitles in different languages to videos on YouTube or on the narrator's own platform.

  • How can you use Google Analytics with translated captions?

    -Google Analytics shows the top visitor countries. Captions can be translated to the top 10 languages to make the platform accessible globally.

  • What website does the narrator mention that has AI courses?

    -The website is skillleap.ai which has hundreds of tutorials on AI tools like ChatGPT and content creation platforms.

  • What editing can you do to transcripts in Descript?

    -In Descript you can highlight text to correct errors and also add punctuation. Edits to transcripts propagate to associated video/audio.

  • What is the benefit of the AI voice overdub feature in Descript?

    -The voice overdub feature lets you train the AI with your own voice so it can overdub parts of the audio that have mistakes.

  • What is the limitation on translating long text with the free version of deepl.com?

    -The free version of deepl.com has a character limit per translation. For long text files an upgraded account is recommended.

Outlines

00:00

😀 Transcribing and Translating Videos with AI

The first paragraph introduces two AI tools for transcribing and translating video and audio files to save time and money. It focuses on Descript, an AI-powered transcription tool that can also edit transcripts to alter videos and audio. The transcript is exported to make edits in Word and generate caption files for subtitles.

05:01

😃 Translating to Multiple Languages

The second paragraph shows how to use the DeepL translator to quickly translate the English transcript into over 30 languages like Spanish, Chinese and Portuguese. Translated caption files can then provide subtitles in different languages based on website visitor locations tracked in Google Analytics.

Mindmap

Keywords

💡transcription

Transcription refers to the process of converting speech (either from audio or video files) into text format. In the video, the narrator shows how the Descript software can automatically transcribe video and audio files with high accuracy using AI. This allows efficient conversion of large volumes of speech content into text, saving huge amounts of time and money. The transcribed text can then be edited, translated, exported, or used to generate subtitles.

💡translation

Translation refers to converting text content from one language into another language. The narrator demonstrates the DeepL translator website, which can rapidly translate the English transcript into over 30 languages like Spanish, Chinese, Portuguese etc. This allows the content to be made accessible to global audiences in their native languages. Combined with transcription, it enables automated multilingual subtitling.

💡editing

Editing refers to making changes and corrections in the automatically generated transcript text to improve accuracy. The Descript software allows the user to simply highlight and correct any errors in the transcript via a simple text editor. This saves time compared to manually transcribing or editing from scratch.

💡export

The narrator shows how both the edited transcript text and the translated versions can be exported and saved in various file formats like Word, Plain text, SubRip subtitle file etc. These output files can then be used for other downstream needs - upload subtitles, put transcript on website, analyze text data.

💡accessibility

A key benefit highlighted in the video is making content accessible to wider audiences across languages through subtitling and translation. The described workflows enable automated multilingual translations and subtitles at scale. The narrator gives the example of identifying top visitor languages using Google Analytics and creating matching subtitle files to increase accessibility.

💡automation

A core theme emphasized throughout the video is utilizing AI to automate manual and repetitive tasks like transcription, translation and subtitling. Humans no longer need to carry out this time-intensive work. This massively boosts productivity and efficiency for processing large volumes of content.

💡accuracy

The narrator points out the high accuracy in transcription and translation achieved by AI tools like Descript and DeepL. For transcripts, any minor errors can also be easily edited and corrected. So automation does not mean completely sacrificing quality.

💡workflows

The video focuses on step-by-step workflows combining multiple AI tools to produce transcriptions, translations and subtitles with great efficiency. Understanding and implementing such human-AI collaborative workflows is key to unlocking the power of existing technologies.

💡cost saving

A major benefit highlighted in the beginning is the significant time and money savings enabled by AI transcription and translation. Instead of expensive manual effort, content can be processed automatically at scale, with no restriction on volume.

💡multilinguality

The automated translation capability shown in DeepL to convert transcripts into over 30 languages highlights how AI can make content truly multilingual and global. This removes language barriers faced by various international audiences.

Highlights

Uses Descript to transcribe audio and video files automatically with AI

Descript allows editing transcripts to correct any mistakes

Exports transcripts to Word docs, subtitles, captions

Uses DeepL to translate transcripts instantly into over 30 languages

DeepL translations are incredibly fast, just a few seconds

Can create translated subtitles/captions for videos

Translations allow reaching wider international audiences

Skill Leap AI provides many AI tutorials and courses

Descript has tons of advanced audio/video editing features

Fixes to transcripts propagate to associated video/audio

Large volumes of content can be processed

Analytics show top visitor countries to focus translations

Creates accessible, multi-language platforms

Saves money compared to human translation services

Saves time compared to manual transcription

Transcripts

play00:00

with the power of two different AI tools

play00:02

that I want to share with you in this

play00:03

video I was able to transcribe and

play00:06

translate most of my video files right

play00:08

now and also my audio files too I want

play00:10

to show you my exact workflow it will

play00:13

save you ton of money and ton of time so

play00:15

the first one is for transcription and

play00:18

they have ton of free minutes that you

play00:20

could use this is called descript I'll

play00:22

go ahead and Link it on screen and Below

play00:24

but this option basically does a lot

play00:26

more than transcribing I'm only gonna

play00:29

show you just a brief transcription

play00:30

version and then we'll use the

play00:33

translation AI tool after this one so

play00:36

once you get dscript installed all I

play00:38

have to do is click on new project here

play00:40

and here I chose the video option but

play00:42

you could choose the audio option so now

play00:44

all I have to do is take my file from my

play00:46

computer and drop it right here so

play00:48

here's a file so I'm gonna grab it right

play00:50

here and I'm gonna drag and drop it

play00:52

right here and usually takes just a few

play00:55

seconds this is a two minute video file

play00:57

and right here I'm gonna go ahead and

play00:59

choose my name one option this has is it

play01:03

basically lets you do this AI voice

play01:05

overdub so you could train it with your

play01:07

own voice and overdub any part of it

play01:09

where you make a mistake again I'm not

play01:11

going to show you that in this video but

play01:12

dscript has ton of options like that

play01:14

including editing a text file which

play01:16

edits then the video file too and the

play01:19

audio file then over here you could

play01:21

transcribe this to ton of different

play01:23

languages so it will translate and

play01:25

transcribe but I'm going to show you a

play01:27

tool for that part in a second right now

play01:29

I'm just going to choose English for the

play01:31

English transcription this is in English

play01:33

the video and as you can see over here

play01:35

is the entire text I'm going to put it

play01:37

on mute here but basically if I just

play01:39

press play it's going to follow along

play01:41

with the video word by word and it is

play01:44

incredibly accurate this is not done by

play01:48

any humans right now this is just Ai and

play01:50

as I've been watching this I've done it

play01:52

with close to 12 hours worth of videos

play01:55

just in the last week okay but all I

play01:57

have to do is if I wanna correct

play02:00

something I just highlighted and then

play02:02

the correct option here is right here

play02:04

and I'm going to I'm just going to

play02:07

select that and press enter and it's

play02:09

going to change that for me so I could

play02:10

quickly go ahead and change anything

play02:13

that I see as I go through it so right

play02:15

here I could add a period here and I

play02:18

could go ahead and capitalize this again

play02:20

real quickly I could make changes and

play02:22

then it's going to follow along with

play02:24

everything I'm saying as I go through

play02:26

this whole process okay now I'm going to

play02:28

go to publish here and I'm going to go

play02:31

to export I could actually export the

play02:33

video or audio file if I alter this in

play02:35

any way so if I edited sections out it's

play02:38

going to let me just edit the fixed

play02:40

version of the video or audio but here I

play02:43

just want the transcript and I have lots

play02:45

of different file formats just plain

play02:47

text or Microsoft doc here I'm just

play02:49

going to use the Microsoft Word doc here

play02:51

and I'm going to save this file and then

play02:54

if I want the caption file this lets you

play02:56

download the SRT file or the vtt file

play02:58

this is how I subtitled my video so I

play03:01

usually go ahead and Export one of those

play03:02

as well I could use those on YouTube for

play03:05

example or on my own platform for a

play03:07

caption subtitle file in addition to the

play03:09

transcript now let me go ahead and open

play03:11

the transfer script I'll go to edit over

play03:13

here and select all and then I'll go

play03:16

ahead and right click over here and copy

play03:18

everything now here's the second

play03:20

platform that I'm going to show you for

play03:22

the translation portion so all you have

play03:24

to do is go to deepl.com translator and

play03:29

they do have a lot of free credits here

play03:31

as well but I did get the upgraded

play03:33

version because I usually have much

play03:35

longer text files that I'm going to

play03:36

import here they do have a limit on how

play03:39

much you could do at one time if you

play03:40

want to keep using it for free you will

play03:42

hit that limit so I'm gonna paste over

play03:44

here and this is my English detected and

play03:47

did you see over here the Spanish

play03:49

version just appeared just like that I

play03:51

got the Spanish version it took not even

play03:54

one second for this to translate from

play03:57

English to Spanish

play03:59

and here's the most interesting part I

play04:02

could select this option and I could

play04:04

click any other language I believe they

play04:06

have over 30 right now and look at this

play04:08

just like this two seconds later I got

play04:11

the Chinese version

play04:13

if I want the Portuguese version two

play04:15

seconds later I got the Portuguese

play04:17

version all these I could go ahead and

play04:20

press command or control a to select I

play04:22

could go ahead and right click and copy

play04:24

and then I could save it in the word doc

play04:26

or any type of format that I want to

play04:28

save it as I could also translate files

play04:31

from PDFs docs and PowerPoint but I

play04:33

could also do this with my caption file

play04:35

if I copy and paste that caption file I

play04:37

downloaded from the script this is

play04:39

English this is Portuguese or if I want

play04:42

Japanese or Italian here's Japanese for

play04:45

example is going to take this and turn

play04:47

it into this type of format if I save

play04:49

this as a plain text

play04:51

and then add dot SRT or dot vtt at the

play04:54

end of the file type I got myself a

play04:57

whole different language so on my

play04:59

website when I do this you could go over

play05:00

and press the subtitle option now you

play05:03

have English Portuguese Spanish Chinese

play05:04

then I could use a free platform like

play05:07

Google analytics to see where my

play05:09

visitors are coming from all the

play05:10

different countries are listed here I

play05:12

could take the top ten I could use deep

play05:14

ell I could do the caption file that way

play05:16

I could do the transcription that way

play05:18

and then I could basically make my

play05:20

platform accessible to a whole lot more

play05:22

people and the platform that I mentioned

play05:24

that I'm using this for this website

play05:25

skill leap AI is basically an entire

play05:28

catalog of AI courses and content so you

play05:31

could learn everything in the world of

play05:33

AI including how to use chat GPT

play05:35

correctly with hundreds and hundreds of

play05:37

prompts included ton of different

play05:39

tutorials on content creation platforms

play05:41

like mid-journey Runway Adobe there are

play05:45

nearly 200 tutorials if you're

play05:47

interested I'll put a link in the

play05:49

description so you can learn more I hope

play05:51

you found this video useful and I'll see

play05:53

you next time