Open AI creates PERFECT Voice Clones - Incredibly Emotive!

Sdílet
Vložit
  • čas přidán 29. 03. 2024
  • Use code MATTVIDPROAI at the link below to get an exclusive 60% off an annual Incogni plan: incogni.com/mattvidproai
    Thank you Incogni for sponsoring this video.
    ▼ Link(s) From Today’s Video:
    Open AI Voices: openai.com/blog/navigating-th...
    Grok 1.5: x.ai/blog/grok-1.5
    Elon's boast about Grok 2: / 1773655245769330757
    Universal Claude 3 Jailbreak: / 1773455789056745782
    Amazon invests in Anthropic: / 1773030824927015369
    ► MattVidPro Discord: / discord
    ► Follow Me on Twitter: / mattvidpro
    -------------------------------------------------
    ▼ Extra Links of Interest:
    ✩ AI LINKS MASTER LIST: www.futurepedia.io/
    ✩ General AI Playlist: • General MattVidPro AI ...
    ✩ AI I use to edit videos: www.descript.com/?lmref=nA4fDg
    ✩ Instagram: mattvidpro
    ✩ Tiktok: tiktok.com/@mattvidpro
    ✩ Second Channel: / @matt_pie
    -------------------------------------------------
    Thanks for watching Matt Video Productions! I make all sorts of videos here on CZcams! Technology, Tutorials, and Reviews! Enjoy Your stay here, and subscribe!
    All Suggestions, Thoughts And Comments Are Greatly Appreciated… Because I Actually Read Them.
    -------------------------------------------------
    ► Business Contact: MattVidProSecond@gmail.com
  • Věda a technologie

Komentáře • 300

  • @MattVidPro
    @MattVidPro  Před měsícem +5

    What do you guys think of Open AI's Voice tech? Use code MATTVIDPROAI at the link below to get an exclusive 60% off an annual Incogni plan: incogni.com/mattvidproai Thank you Incogni for sponsoring this video.

    • @HiddenPalm
      @HiddenPalm Před měsícem

      I believe Elon Musk being a staunch supporter of the most horrid genocide of your lifetime, should be a warning your audience should get before marketing or reviewing his properties and assets to the public.
      Over 32,000 Gazans have been mass murdered in a US-sponsored genocide, 2/3rds of which are women and children. This genocide is still ongoing.

    • @seakyle8320
      @seakyle8320 Před měsícem +3

      german has a really really strong US accent.

    • @shinniehildebrand
      @shinniehildebrand Před měsícem +2

      Can confirm about the German... very strong US accent, especially on how it pronounces the Ls

    • @GaryJr530
      @GaryJr530 Před měsícem +2

      bro so close to 250k!

    • @NotThatVinny
      @NotThatVinny Před měsícem +1

      Looking forward to it.
      I can say that it's better at translation than I can muddle through it.
      I would like to see a feature where you can adjust the tone and gradient on certain parts of the audio though.

  • @huni_19
    @huni_19 Před měsícem +36

    I'm from Kenya. Swahili is our native language while "Sheng" is popular slang. They did a good job. I'm impressed!

  • @FunIsGoingOn
    @FunIsGoingOn Před měsícem +63

    Well as a german I was impressed by the spanish one and disappointed by the german one, that sounded like a dutch trying german.

    • @cyroc6705
      @cyroc6705 Před měsícem +16

      The German one has a good natural rhythm to it, but the voice has a distinct accent which makes it sound non-native

    • @highdefinist9697
      @highdefinist9697 Před měsícem +10

      Yeah the German was pretty bad... it sounded like some weird, inconsistent English accent: Some words basically fine, some were only slightly off, and a few, for example "Kulturen" and "alle" had a really strong American accent, and the accent was the same every time she said the same word. (Also, I believe I have seen enough Anime to judge that the Japanese voice likely suffers from the same problem...)

    • @googleSux
      @googleSux Před měsícem +3

      Bin deiner Meinung.

    • @seakyle8320
      @seakyle8320 Před měsícem +1

      Meddl loide

    • @fodiographer
      @fodiographer Před měsícem +1

      Ich spreche Deutch sehr gut mein freund

  • @AISpeculator
    @AISpeculator Před měsícem +93

    Clearly NOBODY is reading the blog post... when Voice Engine does translation it RETAINS the accent of the original speaker from their native language. Feature, not bug.

    • @user-vj5fb3ig4z
      @user-vj5fb3ig4z Před měsícem +12

      yeah, but why would anybody want that "feature". not me. I don't see any use for it.

    • @RackerTheRascalMashup
      @RackerTheRascalMashup Před měsícem

      @user-vj5fb3ig4z then Don't use it lol

    • @justinwescott8125
      @justinwescott8125 Před měsícem +19

      I like hearing people with accents. I don't want every person I talk to, to have the exact same accent.

    • @thepermman
      @thepermman Před měsícem +6

      @@user-vj5fb3ig4z Imagine Mr. Miyagi sounding like Arnold from Happy Days. Accents are charming.

    • @Cradien
      @Cradien Před měsícem +5

      @@user-vj5fb3ig4z
      I can imagine it could be useful for dubbing. For example MrBeast dubs his videos into Spanish and Portuguese and posts them on different channels, with something like this he could have those dubbed video be in HIS voice

  • @tylerchambliss8379
    @tylerchambliss8379 Před měsícem +11

    Hey Matt. As far as the audio of the voice engine sounding low quality if you listen to the audio they're feeding it that's why. That audio sounds like some teacher recording in a room on a crappy laptop mic. That's actually the impressive part not only is it very emotionally and phonetically accurate to how the guy in the source recording sounded but it's also mimicking the sort of edited sound of the audio and the conditions of the recording. As an audio engineer I find this insane.

  • @WeissM89
    @WeissM89 Před měsícem +4

    I'm surprised no one's talking about how cool this is for patients with speech impairments.

  • @DarkandTwisted
    @DarkandTwisted Před měsícem +6

    Too many safety concerns with OpenAI. That's the only reason I am not too excited.

  • @amkire65
    @amkire65 Před měsícem +3

    In some cases, where the generated audio sounded low-quality, the original didn't sound like a studio recording, either. I guess, it was doing as good as it could with what it had to work with. Amazed by the fact that you can "give someone back their voice" using such a small amount of audio content, and the way people are always recording themselves these days, we probably all have at least 18 seconds of audio... if not, put some aside as an archive for the future, just in case.

  • @ahmedkagabo
    @ahmedkagabo Před měsícem +18

    The German and French versions were not good. I was a bit surprised by the Swahili version, which was a bit better. Open AI still has a lot of work to do on non-English languages.

  • @blackestjake
    @blackestjake Před měsícem +6

    🎵 Everyone together, sing it with me! 🎵
    🎵This is the worst it’s ever gonna be! 🎵

  • @draken5379
    @draken5379 Před měsícem +4

    The thing is, Eleven Labs, is a product, not research, if that makes sense.
    Those samples from OpenAI, are the raw outputs from the model. Where as something like Eleven Labs, you can be sure they have a insane pipeline to take the raw outputs from their models, and clean them up. You could even create custom neural networks for this task etc.
    Also, you can try Voice Engine. You can use it via OpenAIs APIs, but you dont get to provide a reference, you can only pick from a selection of provided voices. Its what powers chatGPT voice.

  • @TheSopk
    @TheSopk Před měsícem +4

    The most important aspect is the input; if you have an emotional voice in the input, the output would sound amazing. I'd like an AI that enhances voice input to make it more emotional. The input at 7:15 sounds monotone.

  • @vainezaiven6677
    @vainezaiven6677 Před měsícem +2

    I mean, if they're going to wait until all of their "conditions" are met before they release this voice engine, then they're never actually going to release it.

  • @AkariTheImmortal
    @AkariTheImmortal Před měsícem +2

    The German one, while I do like the intonation and all, it definitely has a strong accent. Without that accent, this could've been the best AI generated voice translation, I've heard.

  • @gerkim3046
    @gerkim3046 Před měsícem +3

    that swahili one is amazing! one can tell it is ai generated but it is still so good.

  • @surreal_dreams
    @surreal_dreams Před měsícem +29

    French here. I confirm that the french voice does have a weird accent, but that's honestly still very good.

    • @dahozabich
      @dahozabich Před měsícem +6

      The french had a hint of an american english person trying to speak canadian french. I am fluent in both.

    • @DeGandalf
      @DeGandalf Před měsícem +1

      Same for german

    • @MarcusBuer
      @MarcusBuer Před měsícem +1

      Same for portuguese. It sounds right, but has stops at the wrong places.

    • @testboga5991
      @testboga5991 Před měsícem +1

      German has the same weird American accent

    • @cagnazzo82
      @cagnazzo82 Před měsícem +2

      @@testboga5991 I'm leaning towards the native accents being intentional. In a way it sounds more authentic.

  • @3rdeyesociety
    @3rdeyesociety Před měsícem +1

    @MattVidPro my boy done sauced up in that sponsor message, chain looks good bro 💪

  • @esuus
    @esuus Před měsícem +2

    German: light accent, like an American who's lived in Germany and spoken German fluently for 3+ years. This is also what chatGPT sounds like when it speaks German.
    French: light accent, maybe a tiny bit thicker than German.
    I don't speak Spanish but that accent sounded very very heavy to me.
    Is this because they were trained with American voice actors speaking other languages, or does this happen naturally when an english trained model speaks another language? That would be fascinating.

  • @MadsterV
    @MadsterV Před měsícem +1

    Spanish reference sounded stilted and unnatural, while the generated audio sounded VERY natural. Weird.
    Spanish from English reference had a slight English accent, which is very interesting and I hope it keeps doing that.

    • @claudioestevez61
      @claudioestevez61 Před měsícem

      I confirm this. The first Spanish already sounds generated and the AI sounds more natural in comparison. In the second sample, the voice has an English accent.

  • @sarahhardwig2765
    @sarahhardwig2765 Před měsícem +17

    As a regular ChatGPT voice chat user, I can definitely tell that the quality of the audio generated from the reference audio is very reminiscent of GPT voice chats. It doesn't necessarily have the best quality, but I know it can be better, as proven by companies like ElevenLabs. And another thing. I think ElevenLabs translation feature can be a little bit iffy when it comes to how natural a person's voice sounds once it's translated to another language. However, for Voice Engine in particular, I was very shocked to hear how natural a voice still sounded after being Used to translate something else into another language. I also found the Americanized pronunciation of some words in other languages (German, Chinese, Spanish, and others) to be particularly funny, but I think AI can definitely progress past that point.

    • @justinwescott8125
      @justinwescott8125 Před měsícem +4

      Everyone is missing this, but the accent is on purpose. The blog post says that the languages will retain the accent of the original speaker.

    • @michaelsimonsen2017
      @michaelsimonsen2017 Před měsícem +2

      @@justinwescott8125 I'm really happy about that. Retain accents so people can express there backgrounds.

    • @kuromiLayfe
      @kuromiLayfe Před měsícem +1

      Heard many multilangual people speak and their accents and tone of voice tend to differ between languages.. if the voice has the same accent and tone, it most likely is AI generated and not a recording

    • @brexitgreens
      @brexitgreens Před měsícem

      ​@@kuromiLayfe Only in Japanese. Sounding like an idiot is mandatory in that language. Joking aside, you are right. However I strive to maintain the same voice and prosody across all languages without hurting pronunciation. I just steer clear from Japanese.

    • @brexitgreens
      @brexitgreens Před měsícem

      ​@@kuromiLayfe Only in Japanese. Sounding like an idiot is mandatory in that language. Joking aside, you are right. However I strive to maintain the same voice and prosody across all languages without hurting pronunciation. I just steer clear of Japanese.

  • @ananthakrishnank3208
    @ananthakrishnank3208 Před měsícem

    A small dedicated segment covering the overall flow of the model architecture would be great.
    If you have the domain knowledge, it would be even greater to discuss the "why"s regarding the working of the model.
    The demos were amazing!

  • @marwangs686
    @marwangs686 Před měsícem +14

    Open Ai writing it name in the history of the beginning of artificial intelligence

    • @treudden
      @treudden Před měsícem +2

      OpenAI has been in the lead for 2 years

    • @ryzikx
      @ryzikx Před měsícem +3

      her 😂

    • @helix8847
      @helix8847 Před měsícem +3

      @@treudden Not anymore... ElevenLabs shits on OpenAI Voice and Claude 3 shits on GPT 4. While Gemini has 1 million token count and is also very good. ClosedAI better hurry up otherwise they will be left behind.

    • @helix8847
      @helix8847 Před měsícem

      How dare you assume its gender..!!

    • @bigglyguy8429
      @bigglyguy8429 Před měsícem

      @@helix8847 Geminis it terrible.

  • @matters-and-facts
    @matters-and-facts Před měsícem

    I've been using the voice generator in the Simplified app. and it sounds like me but it does have a bit of difficulty with emoting, but it's not a huge problem, so it works for me.

  • @renan777
    @renan777 Před měsícem +5

    I speak English and portuguese, and man, English with Portuguese accent is amazingly good!

    • @32rq
      @32rq Před měsícem

      I thought the one I understand might seem worst, but the Portuguese was great!

  • @JhonataCosmo
    @JhonataCosmo Před měsícem +3

    I'm Brazilian and the Portuguese part wasn't as good as ElevenLabs.

  • @Black-Re4per
    @Black-Re4per Před měsícem +4

    The German sounds like someone with a very heavy American accent, but otherwise it was correct.

  • @Jay33721
    @Jay33721 Před měsícem

    Man, you should really get the DarkReader extension. This video was very very bright lol.

  • @SebSenseGreen
    @SebSenseGreen Před měsícem +2

    French one has an accent but it's really good, like a non-native with a high level French.

  • @Dron008
    @Dron008 Před měsícem

    That's great that we have a competition here. We'll see soon what Meta and Apple show.

  • @angelgarcia3410
    @angelgarcia3410 Před 20 dny

    *whispers* Did my phone start imitating people?

  • @tehPlacebow
    @tehPlacebow Před měsícem

    Yo matt! Im curious what your opinion is on the best local TTS software? :D

  • @tanahirygallardohuizar3981

    Wild, is Alexa's new gig gonna be a voiceover actor?

  •  Před měsícem

    🇫🇷🇪🇸 For French and Spanish, there's a strong American accent while speaking these languages, hope trained data gets broader to improve audio generation !

  • @Glowbox3D
    @Glowbox3D Před měsícem +1

    I love how Anthropic pulls ahead, great competition all around, we all win. I've had GPT4 for some time now, I've loved it's abilities, but Dalle being added to the package is the clincher. If Opus added an image gen to their product, I would definitely move over to them. That is...until SORA comes out...see? What do we do?

    • @AkikoAika
      @AkikoAika Před měsícem

      Worth adding also: Not that I'm super into benchmarks (I feel a bit guilty nitpicking on this): When mentioning the domination of Claude 3 Opus even in comparison to GPT-4, this is in comparison to GPT-4's original paper back in early 2023. From what I understand, GPT-4 Turbo is much better, e.g. on HumanEval & others (can search up "EvalPlus Benchmark", which also has the original HumanEval benchmark).

    • @brexitgreens
      @brexitgreens Před měsícem

      Do we all win by Stability AI collapsing under the weight of competition? I'm not sure about that.

    • @Glowbox3D
      @Glowbox3D Před měsícem +1

      @@brexitgreens bit bummed, my buddy works over there, and I'm rooting for 'em!

    • @brexitgreens
      @brexitgreens Před měsícem +1

      @@Glowbox3D Only bad guys are _not_ rooting for them.

    • @brexitgreens
      @brexitgreens Před měsícem +1

      Speaking of Anthropic getting their own image generator - they are allied with Amazon and Amazon already has their own named Titan. Not many people know. In terms of quality, it's between DALL·E 2 and 3. Comparable to SD XL.

  • @sunnywest28
    @sunnywest28 Před měsícem +14

    As someone fluent in Japanese, the Japanese audio you showed sounded very foreigner sounding and not Japanese. Not good quality 😭

    • @justinwescott8125
      @justinwescott8125 Před měsícem +8

      That's on purpose. The blog post specifically says that the accent of the original speaker is maintained period it's supposed to sound like an American speaking Japanese.

    • @southcoastinventors6583
      @southcoastinventors6583 Před měsícem

      Japanese is 100% pure 外人 or put another way 日本語上手. Was waiting for it to say さようなら at the end.

    • @brexitgreens
      @brexitgreens Před měsícem

      "The Japanese audio didn't sound Japanese as someone fluent in Japanese"? Maybe try to learn English first.

  • @cesarsantos854
    @cesarsantos854 Před měsícem

    I can confirm Portuguese sounds natural.

  • @MoDs_3
    @MoDs_3 Před měsícem

    Looks like we've all been successfully SHOCKED! 😅
    Amazing!

  • @leandro3710
    @leandro3710 Před měsícem +2

    Brazilian here, brazilian portuguese is sounding very good!

    • @Kiiush
      @Kiiush Před měsícem +1

      It seemed robotic to me, I mean, without emotion, IDK, it was a bit weird the way he was finishing the sentences

    • @goldenhok
      @goldenhok Před měsícem +3

      ​@@Kiiush Brazilian here, normally people talk more robotic in a studio setting even the reference audio is not that normal sounding, in a studio you normally try to be very formal and say every syllable in this monotone way, which is not how people talk irl

  • @NirvanaFan5000
    @NirvanaFan5000 Před měsícem +1

    the voices have a good cadence but low overall clarity quality... still very impressive

    • @NirvanaFan5000
      @NirvanaFan5000 Před měsícem +1

      p.s. once we get good translation and audio for all the world languages, it's going to have a huge impact. e.g. I work with immigrants from east africa. many barely speak english and may have never used a computer in their life. it is very difficult for them to learn to use. having a computer they can just talk to in their native language can mean the difference between computer usage or none at all.
      Right now we have good auto-translate for around 100 languages (which do represent the majority of the planet), but researchers are now working on the next 1,000. (Then there are still a lot of tiny, local languages.)

  • @Scott-Zakarin
    @Scott-Zakarin Před měsícem

    To me, the voice generator sound like a voice generator for it's emotive capabilities. Far from human.

  • @IceMetalPunk
    @IceMetalPunk Před měsícem

    That is the most realistic TTS I've heard so far! How much do you want to bet *this* is the model being used in Figure 01?

  • @hitmusicworldwide
    @hitmusicworldwide Před měsícem +1

    The Chinese is a hair better than the French, Japanese and German accent wise. The original English model occasionally overcomes the actual Chinese weights. A- for the Chinese. B+ for the others to me. Portuguese is an A match to the initial voice. Forget emotive I'm happy about diversity. Pi P8 is the standard for diversity in English.

  • @RobinRehmann
    @RobinRehmann Před měsícem +18

    The german wasn‘t realy good

    • @AiVaultGuy
      @AiVaultGuy Před měsícem

      im a spanish and portuguese native and the voice pronunciation sounds horrible, not natural at all

    • @1isaacmusic
      @1isaacmusic Před měsícem

      I'm kinda wondering how this would handle speech when it comes to text like a list of ingredients off a cereal box. Would sound odd being emotive

    • @smartduck904
      @smartduck904 Před měsícem

      I could tell too they sound very robotic

    • @alexkaa
      @alexkaa Před měsícem +1

      True. German was bad...

    • @resumindo857
      @resumindo857 Před měsícem +2

      Spanish neither

  •  Před měsícem

    The Portuguese had inflections in the wrong places, it's pretty good, tho.

  • @wenhanzhou5826
    @wenhanzhou5826 Před měsícem +2

    The mandarin version sounds like an English speaker who got into university studying Chinese for couple of years.

    • @bastienpetit5161
      @bastienpetit5161 Před měsícem +1

      Did the ai nailed the tones at least ?

    • @ruizhao5057
      @ruizhao5057 Před měsícem +1

      @@bastienpetit5161 It did nail the tones, that's a low standard for ai though.

    • @ChristianIce
      @ChristianIce Před měsícem

      I guess that was the point.

  • @gizmomismo7071
    @gizmomismo7071 Před měsícem

    In Spanish, it clearly has an English accent, but it sounds very natural... love it!

  • @nachod9772
    @nachod9772 Před měsícem +3

    as a native speaker i can tell spanish version is so fcking good

    • @brexitgreens
      @brexitgreens Před měsícem

      Finally someone using the "as" construction correctly: with the subject ("I") agreeing in both clauses. Very rare in 2024.

  • @XetXetable
    @XetXetable Před měsícem

    When they say "preset voices", I'm pretty sure they're referring to the built-in TTS voices that all OSs come with by default. You know, Microsoft Sam and friends; the light-weight handcrafted roboty voice that screen readers default to.

  • @ThomasJDavis
    @ThomasJDavis Před měsícem

    Killer App: voice cloning for texting.

  • @blindstreet
    @blindstreet Před měsícem

    The audio quality depends on the source quality being fed into it.

  • @Jossie_188
    @Jossie_188 Před měsícem

    In Chinese style, this situation is called "Million Model Warfare".

  • @damondragon324
    @damondragon324 Před měsícem +1

    The german one has a strong accent. But it's understandable.

  • @xponentialdesign
    @xponentialdesign Před měsícem

    the french voice sounds like its read by an english locutor

  • @Youtuber-lh3ky
    @Youtuber-lh3ky Před měsícem

    The Spanish translation has a very strong accent.

  • @CozyChalet
    @CozyChalet Před měsícem

    I am waiting for a time that I could listen to the text part of my ebooks with ease. I use Apple’s screen reader but it’s painful.

  • @Joe-SoftwareEngineer
    @Joe-SoftwareEngineer Před měsícem

    7:24 my native language is spanish, and I understand what she's saying but at times it sounds like an american who is learning spanish and hasn't fully mastered the "r" sounds. When she says "aporta" and "importar", all 3 letter "r" sound like an english "r" rather than spanish.

  • @youtube_moderator
    @youtube_moderator Před měsícem

    Parity-wise, Elevenlabs is better at most of the multilingual voice cloning, although I was especially impressed by the quality of intonation and pauses in the first English example.
    On a side note, voice recovery is not new - it's just voice cloning from old footage but it unfortunately retains the bad audio qualities from the same footage. It would have been more impressive to have just cloned the woman from her post brain-damaged voice in this particular case. Or even better blended them both together but maybe using EQ matching.

  • @HumanGamer
    @HumanGamer Před měsícem

    did I win? I didn't sign up for the contest so I must be the person who won. clearly. :P

  • @kenrock2
    @kenrock2 Před měsícem +1

    I wish Stephen hawking was alive to use this voice box

  • @rishabhsingh1406
    @rishabhsingh1406 Před měsícem

    What are you opinion on Emads leaving Stablility AI. Do you think with time Open Source will have less and less competitive.

  • @Jossie_188
    @Jossie_188 Před měsícem

    Now I know, Musk's Grok-1 is fairly early.

  • @alexanderalcantara3932

    Creepy, or the future of virtual assistants?

  • @JREinaNutshell331
    @JREinaNutshell331 Před měsícem

    I still miss the option to give a prompt besides the information i want it to voice. Something like "sound angry, sound drunk, make long pauses, etc"
    Btw: The German generated text was horrible, it sounded like an american trying to speak german.

  • @bgill7475
    @bgill7475 Před měsícem +2

    The Mandarin one was good but it sounds kinda American...

  • @aaronanimations9527
    @aaronanimations9527 Před měsícem +1

    Who do you think is still ahead of the competition matt?

  • @ChannelName227
    @ChannelName227 Před měsícem

    I can only comment on the English audio. It was surprising that the text didn't have punctuation other than periods, and it still knew where to short or long pause.

  • @Siree-bro
    @Siree-bro Před měsícem

    The second Spanish chick (AI) speaks better than the human lol sounds like a Mexican children's author. The translation from English to Spanish, has brutal pronunciation.

  • @StoreHouseApp
    @StoreHouseApp Před měsícem

    You didn't even talk about one of the most impressive features, the translated language has an accent!

  • @budekins542
    @budekins542 Před měsícem

    This will be the "SORA" of A.I voice generation😂

  • @Wasaia
    @Wasaia Před měsícem

    Just wanted to compliment you on your audio quality using the RE20. Really good clarity and not boomy.

  • @cagnazzo82
    @cagnazzo82 Před měsícem +2

    As a french and english speaker the french that they spoke was with the woman's american accent... therefore making it impressive.
    It intentionally tries to mimic the original speaker's intonations, making it sound more like them but pretty much transmitting their native tongue accent to different languages.

    • @panomaniac5399
      @panomaniac5399 Před měsícem

      Agree, it sounds like a American woman who speaks very good French, but with an accent. Not an overly strong accent, but definitely identifiable a North American English speaker.

    • @cobb8613
      @cobb8613 Před 12 dny

      Where can we try Chat Gpt voice? I’m French, and i want to try if the english accent dissapear…

  • @markmuller7962
    @markmuller7962 Před měsícem

    "On mars by next week" Elon in a nutshell

  • @ykles24
    @ykles24 Před měsícem +1

    French one is an american talking french.

  • @mralbertteacheralbert8619
    @mralbertteacheralbert8619 Před měsícem

    @7:25 It's spanish but sounds very much like a white person speaking spanish (it has an accent). The same way a foreigner sounds when they try speaking English. I have been teaching ESL 3 years, believe me I can hear the accent.

  • @fredthomson2384
    @fredthomson2384 Před měsícem

    Audible is in big trouble.

  • @alvaroluffy1
    @alvaroluffy1 Před měsícem

    its good but specially in the translation theres some englishness that filters to the translated languages, you can notice it in all languages actually

  • @martianingreen
    @martianingreen Před měsícem

    8:30 The German has a really tick accent (it basically sounds like an american one). Doesn't sound fantastic tbh, at least not if the goal is like very good dubbing / translation. But it sounds good as in AI voices go

  • @STONJAUS_FILMS
    @STONJAUS_FILMS Před měsícem

    I speak spanish and chinese and they sound like an American accent speaking those languages … but in a way thats very understandable, as if they learned the language very well but did not manage to get rid of the accent …. Did not robotic to me if thats the concern… the accent might bother some natives

  • @WesTheWizard
    @WesTheWizard Před měsícem

    I agree with a lot of the other commenters here. The voices seem to have an American accent. I'm only fluent in Japanese but I picked up an accent in most of them. You can understand the Japanese just fine, but the accent is a little cringe.

  • @robertsousasantos6766
    @robertsousasantos6766 Před měsícem

    O Português ficou perfeito, idêntico a uma pessoa real falando numa gravação real, ficou realmente perfeito.

  • @notBeggingMattandLissy2PlayRE4

    As someone who knows Spanish, that's pretty bad lol. It sounds like a strong American accent trying to speak Spanish.

  • @microcontrolledbot
    @microcontrolledbot Před měsícem

    Bro have you not been talking to OpenAI in the app. Their voices have been around for like 6months.

  • @silencedandshadowbanned7277

    Ham sandwich here I can confirm the Swahili is a weird accent

  • @Flizyx
    @Flizyx Před měsícem

    the english to spanish one is tricky, it sounds like spanish but with english accent, so not full spanish

  • @noeaguilar5945
    @noeaguilar5945 Před měsícem

    The Spanish cloned voice sounds amazing 😍, the best one I've ever heard, edit: la traducción es bastante mala

  • @claudioagmfilho
    @claudioagmfilho Před měsícem +1

    🇧🇷🇧🇷🇧🇷🇧🇷👏🏻, Wow, the song's are fricking awesome on v3, voice still needs some work tho!

  • @guerric
    @guerric Před měsícem +1

    The French one is weird. It sounds robotic and as a French I don't see what accent it is, doesn't sound like France French, doesn't sound like Canadian French, just weird
    Japanese seems to have a very heavy accent too that I can't categorize
    Spanish sounds very bad imo
    The translation keeps a weird accent that makes it sound very weird. I don't think it's the woman's American accent that is kept but a weird mix that is quite uncanny

  • @manzell
    @manzell Před měsícem

    I want to see more foreign language stuff translated into English so I can evaluate it.

  • @pierre-samuelgreau-hamard6379

    French is still a bit robotic, and the voice seems to speak with a slight english accent.

  • @nonetrix3066
    @nonetrix3066 Před měsícem +7

    I am learning Japanese but to me at least with someone that has listened to it a lot seems really strange accent wise

    • @brexitgreens
      @brexitgreens Před měsícem

      Your user image leaves no doubt about it.

  • @devlisandro
    @devlisandro Před měsícem

    GCP has something similar but not that emotional

  • @justinwhite2725
    @justinwhite2725 Před měsícem +4

    I speak French. Sounds like an Anglephone (English speaker) who has learned French (which the boiler plate on that video says is the intent)
    ... Though it still rolls the rs better than I do.

  • @muzy8768
    @muzy8768 Před měsícem

    I think the translated audio keeps the original accent and sounds a bit weird and not perfect

  • @reezlaw
    @reezlaw Před měsícem

    Weirdly, the Spanish voice had an English accent

  • @StefanSchmidtRegensburg
    @StefanSchmidtRegensburg Před měsícem

    German has a strong american accent. Just like the voice out from ChatGPT

  • @IM2awsme
    @IM2awsme Před měsícem

    I just wish I could set the playback speed 😅 I use text to speak because I read slow, I shouldn't be outpacing the ai.

  • @EQORIA
    @EQORIA Před měsícem

    What do you think about Singularity Intelligence for New Earth (QORA*) for EQORIA, United Earth? It is the first introduction of the vision and more to come... check it on youtube channel. EQORIA will begin promotion to mass media beginning December 12, 2024 on 12 year anniversary.

  • @PuppiesAreNice.
    @PuppiesAreNice. Před měsícem

    It is so strange to hear ai generated voices with an accent. like im german and the german one sounded like an american tried to read out german text

  • @jumbleblue
    @jumbleblue Před měsícem

    German has English accent. Slight. French too.

  • @AlexanderWeixelbaumer
    @AlexanderWeixelbaumer Před měsícem

    In the german example the "r" was pronounced like it was an english word. Germans pronounce the r much harder.