Mr. Bongo Makes a GPT

Testing Ollama on Hard Questions

AI Can do That?? Silver Medal in Pure Math

KONČÍM CESTU NA OLYMPII A ZÁVODNÍ KARIÉRU

Co to ti kluci zase vymýšlí?🤭😅

Touching Act of Kindness Brings Hope to the Homeless #shorts

AI Speech Gets Real: BASE TTS

AI Master Group

zhlédnutí 997

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 6. 09. 2024
Amazon has introduced an amazing new model called BASE TTS (TTS = text-to-speech). These are the models that accept written text as an input, and then speak that text for us, which is what we use to create talking avatars and chatbots, among many other use cases.
BASE stands for Big Adaptive Streamable Emergent.
The top TTS models until now have been YourTTS, Bark and Tortoise-TTS. They’ve all been pushing speech synthesis closer and closer to human-like speech, so BASE from Amazon set out to beat them by training on more data than they did. It’s a billion-parameter model trained on 100,000 hours of audio data.
The video covers seven areas where text-to-speech is known to stumble sometimes. In ascending order of difficulty, those are:
1. Compound nouns
2. Syntactically-complex sentences
3. Foreign words
4. Unusual punctuation
5. Questions
6. Paralinguistics (things like groans, laughs, and whispers),
and - most difficult of all . . .
7. Emotions.
The video then presents 8 audio samples created by BASE TTS, each of which illustrates BASE TTS attempting to perform one of those especially-difficult tasks described above.
The results are quite impressive. Give a listen and see what you think!

Komentáře • 1

@AIMasterGroup Před 4 měsíci ⁺¹
As promised in the video, here's a link to the original paper with technical details about BASE TTS.
assets.amazon.science/6e/82/1d037a4243c9a6cf4169895482d5/base-tts-lessons-from-building-a-billion-parameter-text-to-speech-model-on-100k-hours-of-data.pdf
Best wishes!

Další v pořadí

Automatické přehrávání

Mr. Bongo Makes a GPT

Mr. Bongo Makes a GPT

Testing Ollama on Hard Questions

Testing Ollama on Hard Questions

AI Can do That?? Silver Medal in Pure Math

AI Can do That?? Silver Medal in Pure Math

KONČÍM CESTU NA OLYMPII A ZÁVODNÍ KARIÉRU

KONČÍM CESTU NA OLYMPII A ZÁVODNÍ KARIÉRU

Co to ti kluci zase vymýšlí?🤭😅

Co to ti kluci zase vymýšlí?🤭😅

Touching Act of Kindness Brings Hope to the Homeless #shorts

Touching Act of Kindness Brings Hope to the Homeless #shorts

TOHODLE JSTE SI V AVENGERS NEVŠIMLI #zajimavosti #avengers

TOHODLE JSTE SI V AVENGERS NEVŠIMLI #zajimavosti #avengers

Nemotron-4 is BIG in More Ways than One

Nemotron-4 is BIG in More Ways than One

(FREE) Jersey Club Type Beat x Sexy Drill Type Beat - "In Your Eyes"

(FREE) Jersey Club Type Beat x Sexy Drill Type Beat - "In Your Eyes"

Segment of One - Now it’s Real

Segment of One – Now it’s Real

Can Robots Win at Table Tennis? Take a Look!

Can Robots Win at Table Tennis? Take a Look!

Hacking Passwords with ChatGPT?

Hacking Passwords with ChatGPT?

Behind the Curtain of Figma AI

Behind the Curtain of Figma AI

What is AGI? --the Ultimate Test!

What is AGI? --the Ultimate Test!

How a Language Model Aced a Top Leaderboard

How a Language Model Aced a Top Leaderboard

Will Open-Source Llama Beat GPT-4o?

Will Open-Source Llama Beat GPT-4o?

Секрет фокусника! #shorts

Секрет фокусника! #shorts

Wait for it… 😱 #shorts

Wait for it… 😱 #shorts

ZPOVĚĎ POLICISTY: "Mrtvoly skoro každý den, jednou s námi komunikoval muž s ustřelenou půlkou hlavy"

ZPOVĚĎ POLICISTY: "Mrtvoly skoro každý den, jednou s námi komunikoval muž s ustřelenou půlkou hlavy"

KONČÍM CESTU NA OLYMPII A ZÁVODNÍ KARIÉRU

KONČÍM CESTU NA OLYMPII A ZÁVODNÍ KARIÉRU

Cola + Mentos = Exploze

Cola + Mentos = Exploze

Gli occhiali da sole non mi hanno coperto! 😎

Gli occhiali da sole non mi hanno coperto! 😎

I play this like Cristiano Ronaldo⚽❓

I play this like Cristiano Ronaldo⚽❓