TTSOpenAI - Is it Really Free and Unlimited Text-to-Speech?

Excelerator
20 Jul 202407:44

Summary

TLDRThe video script explores TTS Open AI, a new entrant in the text-to-speech market, which some speculate is connected to Open AI. The narrator tests the platform's features, including various voices and a 'Story Maker' mode, noting the quality and limitations. They also discuss the free and premium plans, highlighting the need for a premium subscription for advanced features like creating stories from text. The script concludes with the narrator's positive impression of the platform's potential and the suggestion to monitor its development.

Takeaways

  • 😀 The video introduces a new text-to-speech platform called TTS open AI, which some speculate may be unrelated to Open AI but uses its technology.
  • 🔍 The presenter suggests TTS open AI might be a front-end interface connecting to Open AI's text-to-speech application, but this is not confirmed.
  • 🎙️ The platform offers a variety of voices, including 'Alloy', 'Echo', 'Fable', 'Nova', 'Shimmer', and 'Buck', with the possibility of more system voices.
  • 📚 The platform has a 'Story Maker' mode that allows users to create conversations with different speakers and settings, though some features like emotion are 'coming soon'.
  • 📈 There's a free plan and a premium plan; the free plan allows text-to-speech conversion with ads, while the premium plan offers additional features and no ads.
  • 💰 The premium plan is priced at six dollars a month and includes unlimited document-to-speech conversion, HD quality voice, and other benefits.
  • 📈 The platform uses a credit system where 1000 credits equate to 1000 characters with high-quality voice or 500 characters with HD voice.
  • 📑 The 'Document' feature allows users to upload documents in various formats for text-to-speech conversion, with an email notification upon completion.
  • 🎨 The 'Voice Library' section seems to be where users can create new voices, either by cloning their own or designing synthetic ones, but this is a premium feature.
  • 📝 The input text box has a character limit based on user status; free users can enter up to 500 characters, while logged-in users have access to 3000 characters.
  • 📝 The platform is still in development, with features like emotion addition in the 'Story Maker' mode not yet available, indicating potential for future growth.
  • 👍 The presenter personally finds the voice quality good, with a preference for casual-sounding voices, and believes the platform shows promise and is worth monitoring.

Q & A

  • What is the new player in the Texas speech game called?

    -The new player in the Texas speech game is called TTS open AI.

  • What is the relationship between TTS open AI and Open AI according to the speaker?

    -The speaker suggests that TTS open AI may not be directly related to Open AI, but uses the Open AI text-to-speech app, possibly as a front-end user interface that connects to Open AI through the app.

  • What feature does TTS open AI offer for creating a conversation?

    -TTS open AI offers a 'Story Maker' mode that allows users to create a conversation by importing an SRT file or adding a conversation manually.

  • How many voices are available to work with in the TTS open AI platform?

    -There are six voices available to work with under the Open AI voices tab, and a few more under the system voices.

  • What is the difference between the free and premium plans on TTS open AI?

    -The free plan allows speech from text, access to the voice library, and downloading audio files. The premium plan includes unlimited documents to speech, the ability to create your own voice, no ads, and up to 200 megabytes per document.

  • What is the cost of the premium plan on TTS open AI?

    -The premium plan on TTS open AI costs six dollars a month.

  • What is the character limit for the input text box on the free plan?

    -On the free plan, if you are not logged in, you can enter up to 500 characters in the input text box. Once you have created an account, you can have up to 3,000 characters.

  • What is the limitation of the 'Create Story' feature in terms of language?

    -The 'Create Story' feature only works in English.

  • What is the difference between high quality and HD quality in terms of character credits on TTS open AI?

    -High quality voice uses 1,000 credits for 1,000 characters, whereas HD quality voice uses 1,000 credits for only 500 characters, making HD voices double the character credits.

  • What additional feature is mentioned as 'coming soon' for the 'Story Maker' mode?

    -The ability to add emotions to the 'Story Maker' mode is mentioned as 'coming soon'.

  • What is the speaker's opinion on the potential of TTS open AI?

    -The speaker believes that TTS open AI is pretty new but shows potential and is worth keeping an eye on to see how it develops.

Outlines

00:00

🤖 Exploring TTS Open AI's Features and Voice Options

The script introduces TTS Open AI, a new player in the text-to-speech (TTS) market in Texas. It discusses the speculation about the company's connection to Open AI and explores the possibility of it being a front-end interface for Open AI's TTS app. The narrator tests various voices available on the platform, including 'Alloy,' 'Echo,' 'Fable,' 'Nova,' 'Teresa,' 'Buck,' 'Lisa,' 'Mia,' and 'Ella.' The script also covers the platform's limitations, such as the presence of ads and the need for a subscription to access certain features like 'Story Maker Mode.' It highlights the platform's potential and the narrator's experience with creating a conversation and selecting voices for different parts of the script.

05:01

💬 Reviewing TTS Open AI's Pricing and Free Plan Limitations

This paragraph delves into the pricing structure of TTS Open AI, contrasting the free and premium plans. The free plan allows for unlimited text-to-speech conversion with ads, access to a voice library, and basic features, while the premium plan offers additional capabilities such as creating stories from text, uploading documents for speech conversion, and accessing a larger voice library. The narrator expresses a desire for clearer communication about premium features and shares their experience with the platform's limitations, particularly the inability to create stories in HD quality without a subscription. The script concludes with the narrator's positive impression of the platform's potential and the anticipation of its future developments.

Mindmap

Keywords

💡TTS open AI

TTS stands for Text-to-Speech, a technology that converts written text into spoken words. In the context of the video, TTS open AI refers to a new service that uses this technology, possibly connected to the well-known AI company Open AI. The script suggests that this service might be a front-end user interface for Open AI's text-to-speech application, indicating a focus on converting text into natural-sounding speech.

💡Text-to-Speech

Text-to-Speech (TTS) technology is a form of speech synthesis that converts written text into spoken language. In the video, TTS is central to the service being discussed, as it allows users to generate spoken content from text inputs. The script demonstrates how different voices can be used to read various texts, showcasing the versatility of TTS in creating audio content.

💡Voices

In the context of the video, 'voices' refers to the different audio outputs available in the TTS service. The script mentions several voice options, such as 'Alloy', 'Echo', 'Fable', 'Nova', 'Shimmer', and 'Ella', each with distinct characteristics. These voices can be selected to read text, adding a personal touch to the audio content generated by the TTS service.

💡Story Maker

The 'Story Maker' mode is a feature in the TTS service that allows users to create conversations or narratives by importing text or adding dialogues. The script describes how users can set the timing, speed, and speakers for each part of the conversation, emphasizing the creative potential of the service in crafting engaging audio stories.

💡Emotion

The script mentions 'emotion' in the context of the 'Story Maker' mode, suggesting that the service might eventually allow users to add emotional nuances to the voices. This feature would enhance the realism and expressiveness of the audio content, making the conversations sound more natural and engaging.

💡Premium Plan

The 'Premium Plan' is a subscription option in the TTS service that offers additional features and benefits. According to the script, the premium plan removes ads, allows for unlimited documents to speech conversion, and provides access to creating a custom voice. It also increases the character limit for HD quality voice, indicating a more comprehensive experience for paying users.

💡High Quality vs. HD Quality

The script differentiates between 'High Quality' and 'HD Quality' in terms of the audio output. High quality voice uses fewer character credits compared to HD quality, suggesting that HD offers a higher fidelity or more natural-sounding speech. This distinction is important for users who require different levels of audio quality for their projects.

💡Free Plan

The 'Free Plan' is the basic access level in the TTS service, allowing users to generate text-to-speech content without a subscription. The script notes that free users can enter up to 500 characters in the input text box, and once logged in, up to 3,000 characters. This plan is suitable for users who want to test the service or have limited needs.

💡Document to Speech

The 'Document to Speech' feature, as mentioned in the script, allows users to upload documents in various formats to be converted into spoken audio. This feature is particularly useful for creating audiobooks, educational materials, or accessibility content, making written documents more accessible to a wider audience.

💡Voice Library

The 'Voice Library' is a collection of voice options available in the TTS service. The script describes the library as containing six Open AI voices and additional system voices, offering a range of choices for users to personalize their audio content. The ability to create a new voice or design a synthetic voice is also mentioned, highlighting the service's flexibility.

💡Character Credits

In the context of the TTS service, 'Character Credits' refer to the number of characters a user can convert into speech within a certain plan. The script explains that free users have unlimited credits for standard quality, while premium users have up to 200,000 credits, with HD quality voices requiring double the credits, indicating a costlier but higher-quality option.

Highlights

Introduction of a new player in the Texas speech game called TTS open AI.

Debate over whether TTS open AI is affiliated with Open AI or just using the name.

The possibility that TTS open AI is a front-end user interface connecting to Open AI through an app.

Demonstration of the text-to-speech process with various voice options.

Six different voice options available under the Open AI voices tab.

Additional system voices beyond the six Open AI voices.

Premium plan removes ads and offers additional features.

Story maker mode allows creating conversations with different voices and timings.

Emotion feature in story maker is 'coming soon', indicating potential future updates.

HD quality voice option requires a login and is a premium feature.

Free plan limitations and premium plan benefits detailed, including credits system.

HD voices consume double the character credits compared to standard voices.

Document to speech feature allows uploading documents in various formats for conversion.

Voice library includes options to create new voices, a premium feature.

Free account allows 500 characters input, while a created account allows up to 3000 characters.

A paragraph generated by Chat GPT to test the text-to-speech generator.

Personal preference for a casual sounding voice in the text-to-speech output.

Potential for growth and addition of more voices to the platform.

The website seems new with ongoing development and updates.

Final thoughts on the value and potential of TTS open AI, worth monitoring for future developments.

Transcripts

play00:00

we've got a new player in the Texas

play00:01

speech game and it's called TTS open AI

play00:06

some say they have nothing to do with

play00:07

open Ai and just use the name others say

play00:10

it's open ai's website I don't believe

play00:12

either of those is true what I think's

play00:14

going on here is that TTS

play00:17

open.com has created a website they use

play00:20

the open AI text to speech appy and so

play00:23

this is a front-end user interface that

play00:26

is not open AI but they connect to open

play00:29

AI through the appy maybe I'm wrong

play00:32

that's my guess let's see how it works

play00:34

already pasted some texts in here it's

play00:36

just a a little welcome from my fake

play00:38

podcast take a look at what voices we

play00:41

have the sun rises in the East and sets

play00:43

in the west this simple fact has been

play00:46

observed by humans for that is alloy

play00:48

here's echo in the Heart of the City

play00:50

there is a large Park where people go to

play00:53

relax and enjoy nature the park has a

play00:56

sounded the same to me the sun rises in

play00:58

the East and in the Heart of the City

play01:01

those are very similar we have Fable the

play01:04

library is a quiet and peaceful place

play01:06

where people go to read study and learn

play01:09

little bit of a British accent there the

play01:11

train chugged along the tracks carrying

play01:13

passengers to their destinations now

play01:16

Nova in the kitchen the aroma of freshly

play01:18

baked bread filled the air the loaves

play01:21

were golden brown and Shimmer the beach

play01:24

was a popular spot on a hot summer day

play01:26

people were swimming in the ocean so

play01:28

we've got six voices to work with there

play01:30

under the open AI voices tab over on

play01:33

system voices we have a few more Buck

play01:36

did not read the newspaper or he would

play01:38

have known that trouble was brewing Al

play01:40

and over this great demen buck ruled

play01:42

Lisa there he lay for the remainder of

play01:44

the weary night Mia for two days and

play01:47

nights this Express Car Teresa and buck

play01:50

was truly a redyed devil last but not

play01:52

least Ella as he spoke he fearlessly

play01:55

patted the head he had so mercilessly

play01:57

pounded we're going to go back to the AI

play02:00

voices and I think we'll just use the

play02:02

first one here we use alloy so we have

play02:04

that selected if we wanted to select a

play02:06

different voice for this piece of text

play02:08

we just kind of click on the speaker

play02:09

card and it gives it a bright green

play02:11

border now you do have all these ads

play02:13

here to dodge around apparently if you

play02:16

subscribe to a Premium plan the ads go

play02:19

away I can live with them as long as

play02:20

it's free click create speech welcome to

play02:23

the ridiculous rants podcast I'm David

play02:26

now we've also got this story maker mode

play02:28

and if we switch over to that we can

play02:30

create a conversation you can do that

play02:32

either by importing an SRT file which is

play02:35

like your timed caption file or we can

play02:38

click add conversation we'll start with

play02:40

this welcome to the ridiculous rant

play02:42

podcast I'm David and we have some

play02:45

options here we can have a silence

play02:46

before we can set the speed anywhere

play02:49

from uh quarter speed to 4X we need to

play02:52

click the button here in this row if we

play02:54

want to change the speaker click that

play02:57

and we can say Echo might as well do

play02:59

something a little different emotion it

play03:01

says coming soon so that's interesting

play03:04

it might give us the ability to add

play03:06

emotions there let's add a co-host so

play03:08

I'll come down I'll click this plus

play03:10

button for our co-host here we're going

play03:12

to make her female I'll pick Shimmer and

play03:14

then here in block three we're back to

play03:18

David and I used Echo for that voice

play03:20

make sure we got that all squared away I

play03:22

see we have a drop down over here that

play03:24

says high quality we can have high or HD

play03:27

quality so we click HD and click create

play03:30

story and we have to log in if we want

play03:32

to use HD so let's go back get back to

play03:35

our story here and we'll drop that we'll

play03:38

switch that back to high quality create

play03:40

story oh we have to log in to do that

play03:43

I've created an account now see if we

play03:46

can get it in HD optionally it says we

play03:48

can give the story a name I just called

play03:49

it rant test one the output format it

play03:52

says wave is coming soon so MP3 is our

play03:54

only option it's calculated the audio

play03:57

duration the voices that we're using and

play03:59

tells us that some limitations of open

play04:02

AI this function only works in English

play04:04

and by clicking the button below we

play04:07

understand the risks regarding the

play04:09

quality of the resulting

play04:11

audio risks oh that took us to the

play04:14

pricing page so I'm guessing we're

play04:15

trying to do something that isn't free

play04:17

free gives us speech from text can

play04:20

download audio files access to the voice

play04:22

library and the Premium plan create

play04:24

speech from text unlimited documents to

play04:27

speech can download of course create

play04:29

your own voice voice Library create a

play04:32

story from text that must be the issue

play04:34

that's a premium thing no ads and up to

play04:37

200 megabytes per document and on the

play04:39

free you have unlimited credits on the

play04:42

premium you have 200,000 credits down at

play04:45

the bottom it reveals that a th000

play04:47

credits is a th000 characters with high

play04:49

quality voice or 500 characters with HD

play04:53

quality voice so HD voices are double

play04:56

the character credits so it looks like

play04:58

the deal is on the free plan you can do

play05:00

as much as you want just generating text

play05:03

one at a time on that first tab but if

play05:05

you want to create a story or do a

play05:07

document to speech you're going to need

play05:09

to be on the premium at six bucks a

play05:11

month I don't think that's a bad deal I

play05:14

do wish it would have told us up here

play05:16

hey this is a premium feature and saved

play05:18

us the trouble rather than building the

play05:20

whole thing and then finding out that

play05:22

it's not a free feature the story maker

play05:24

looks promising it's similar to what we

play05:27

have in 11 Labs now with the voiceover

play05:29

Studio where we can lay out the whole

play05:31

conversation there on one screen the

play05:34

document feature here allows you to

play05:36

upload a document in a multitude of

play05:38

different formats once you upload the

play05:40

document select your speaker it'll get

play05:42

to work and send you an email when it's

play05:44

been converted so that you can download

play05:47

your file the voice library right now it

play05:49

looks like just these six open AI voices

play05:52

we do have quite a few more system

play05:54

voices again the rim Ice broke away

play05:56

before and behind there is an ecstasy

play05:59

that marks the summit of life he months

play06:01

came and went in the my voices tab it

play06:03

looks like this is where we would create

play06:04

a new voice either by cloning our own

play06:06

voice or by designing a new synthetic

play06:09

voice that's a premium feature so I

play06:10

won't be exploring that one with you all

play06:12

right let's go back and look at this

play06:13

input text box since this is the only

play06:15

place we can work really with a free

play06:18

plan if you're not logged in you're

play06:19

using a free account you can enter up to

play06:22

500 characters in this input text box

play06:25

that's going to be about a 100 words

play06:27

once you have created an account you you

play06:29

can have up to 3,000 characters in this

play06:32

box roughly 600 words I just went over

play06:35

to chat GPT and said write me a

play06:37

paragraph that I can throw in a text to

play06:39

speech generator to see how it sounds

play06:41

and this is what it came up with let's

play06:43

go pick Shimmer for this one I'm not

play06:45

sure how echko is still selected but

play06:49

echko stays selected I we'll try it

play06:51

anyway the quick brown fox jumps over

play06:53

the lazy dog this sentence contains all

play06:56

the letters of the English alphabet it

play06:59

is a beautiful day outside with clear

play07:01

blue skies and a gentle breeze rustling

play07:04

through the leaves the town is bustling

play07:06

with people going about their daily

play07:08

routines from shopping at the market to

play07:10

enjoying coffee at the local cafe birds

play07:13

chirp happily in the background adding

play07:16

to the peaceful Ambiance of the scene I

play07:18

think the voice sounded really good

play07:20

personally I prefer a casual sounding

play07:22

voice but that's my preference I imagine

play07:25

they will grow and add additional voices

play07:27

it looks like they're working on under

play07:29

that story maker being able to add

play07:31

emotion I get the feeling that this

play07:33

website is pretty new and it looks like

play07:35

there's some potential here definitely

play07:36

worth keeping an eye on and seeing how

play07:38

it goes I hope you enjoyed our little

play07:40

test drive here and I look forward to

play07:41

seeing you in the next video

Rate This

5.0 / 5 (0 votes)

Ähnliche Tags
TTSOpen AIText-to-SpeechAudioVoicesAPIContent CreationPremium FeaturesOnline ToolTech Review
Benötigen Sie eine Zusammenfassung auf Englisch?