ChatTTS - Best Quality Open Source Text-to-Speech Model? | Tutorial + Ollama Setup

Sdílet
Vložit
  • čas přidán 25. 06. 2024
  • ChatTTS - Best Quality Open Source Text-to-Speech Model? | Tutorial + Ollama Setup
    👊 Become a member and get access to GitHub and Code:
    / allaboutai
    🤖 Great AI Engineer Course:
    scrimba.com/learn/aiengineer?...
    🔥 Open GitHub Repos:
    github.com/AllAboutAI-YT/easy...
    📧 Join the newsletter:
    www.allabtai.com/newsletter/
    🌐 My website:
    www.allabtai.com
    Code in Video:
    github.com/AllAboutAI-YT/chatTTS
    ChatTTS GitHub: github.com/2noise/ChatTTS
    Tutorial on how to install and use the Open Source TTS model ChatTTS. Very fun to play around with and a great addition to the OS TTS landscape.
    00:00 ChatTTS Intro
    00:57 Install / Setup ChatTTS
    03:30 Testing
    06:28 ChatTTS with Ollama (Llama 3)
    08:58 Conclusion
  • Věda a technologie

Komentáře • 72

  • @neolynxer
    @neolynxer Před 20 dny +13

    Great stuff. Please implement a "a... yeah" counter in your videos! :D should be fun.

  • @john5s
    @john5s Před 19 dny +14

    I found that by setting a seed I can keep the voice sounding the same.
    chat = ChatTTS.Chat()
    chat.load_models(compile=True) # Set to True for better performance
    torch.manual_seed(seedNumber)

    • @2099EK
      @2099EK Před 18 dny

      You're a hero! I wish more people would check out this comment. Thank you for posting this!

  • @Ms.Robot.
    @Ms.Robot. Před 21 dnem +4

    Totally nailed this tut.This was very well explained. It was organized in order, easy to understand step-by-step instructions, and it addressed important points in case we run into obstacles.❤
    (Still waiting for gpt4o voice too. I hope customizable voices are an option.)

    • @AllAboutAI
      @AllAboutAI  Před 20 dny

      thansk a lot :D really appericiate it

  • @DaCashRap
    @DaCashRap Před 18 dny

    Uh yeah, great video overall!

  • @darcwader
    @darcwader Před 21 dnem +1

    very nice

  • @joachimschoder
    @joachimschoder Před 19 hodinami

    The project libukai/Awesome-ChatTTS has a more extensive documentation. It is in Chinese but Google Chrome can automatically translate at least the text elements. It doesn't replace a good english documentation but it is a good starting point.

  • @micbab-vg2mu
    @micbab-vg2mu Před 21 dnem +1

    the quality is great - I have to try it

  • @luisvictorf
    @luisvictorf Před 21 dnem +1

    cool story about the underwater cats, would've liked to hear a bit more; it would make a good kids story! =D

  • @EricB1
    @EricB1 Před 20 dny +2

    Great find. One suggestion: remove all [yeah] from your script.

  • @ArseniyPotapov
    @ArseniyPotapov Před 20 dny

    XTTSv2 is also very good model, highly recommended

  • @mendthedivide
    @mendthedivide Před 21 dnem +6

    works well, but only with smaller wording/sentences. almost sounds real at times, nice find!

    • @AllAboutAI
      @AllAboutAI  Před 20 dny +1

      thnx :) yeah i think the token limit is like 380

  • @legendarystuff6971
    @legendarystuff6971 Před 20 dny +2

    By default it generates a voice randomly from gaussian noise, you can definitely chose the voice somehow, look over the example they have on their repo and ask Opus for help. Their materials combine English and Chinese which makes it a bit annoying. On they're bilibili video, whatever bilibili is, they even clone Steve job's voice and Taylor swift. Nice find, thank you

    • @2099EK
      @2099EK Před 18 dny

      You'd've been a hero if you just put that here.

  • @korni5149
    @korni5149 Před 20 dny +1

    How can I achieve this gradient text color and text appearing animation in Windows Terminal?

  • @user-um2uq9nh4z
    @user-um2uq9nh4z Před 21 dnem +1

    Hi! If english is still experimental then french or german...

  • @elgodric
    @elgodric Před 19 dny

    Is there a webui or something for non coders

  • @mrpro7737
    @mrpro7737 Před 21 dnem +1

    very nice base voices i can voice chnge them to any model i want

    • @_zproxy
      @_zproxy Před 20 dny

      how

    • @AllAboutAI
      @AllAboutAI  Před 20 dny

      do you know how?

    • @mrpro7737
      @mrpro7737 Před 20 dny

      @@AllAboutAI
      i am using this open source project
      h§ttps§://youtu.§be/nXpBlC6OBw4?si=UOQ86u97CLp0BgJN
      for training models and voice change because its only support audio to audio
      and applio§.org for text to speech because it support so many accents and languages
      remove the § from urls , i post this comment 3 times and yt keep deleting it 😑

    • @mrpro7737
      @mrpro7737 Před 20 dny

      @AllAboutAI
      i am using this open source project
      h§ttps§://youtu.§be/nXpBlC6OBw4?si=UOQ86u97CLp0BgJN
      for training models and voice change because its only support audio to audio
      and applio§.org for text to speech because it support so many accents and languages
      remove the § from urls , i post this comment 3 times and yt keep deleting it 😑

  • @limebulls
    @limebulls Před 21 dnem +6

    Can it speak german etc as well?

  • @Edward_ZS
    @Edward_ZS Před 20 dny

    Did anyone find a way to,select the voice?

  • @glikoz
    @glikoz Před 14 dny +1

    How about speech-to-text ?

  • @MLDQ
    @MLDQ Před 20 dny +1

    Is it better than realtimeTTS XTTS with coqui?

    • @AllAboutAI
      @AllAboutAI  Před 20 dny

      like i said in the vid, i dont think I could use this for real time s t s because of the compute time

  • @Tofu3435
    @Tofu3435 Před 20 dny

    I want to make an audiobook from light novels. My phones built in reader are too robotic, natural reader are needs internet access and too expensive. Maybe a program built on this can help.

  • @snatvb
    @snatvb Před 21 dnem +1

    similar to "bark" from suno
    interesting which is better and perfomance

    • @AllAboutAI
      @AllAboutAI  Před 20 dny +1

      noted, tnx :)

    • @christiandarkin
      @christiandarkin Před 20 dny

      bark's problem - when i tested it at least - was that it didn't always say what you told it to say. often it just made stuff up.

    • @snatvb
      @snatvb Před 20 dny

      @@christiandarkin I faced with this if I passed to long text

  • @NirmalEleQtra
    @NirmalEleQtra Před 19 dny +4

    Can we have an Indian English accent or any other Indian language dialect here? If yes, how can we do it?

  • @VaibhavShewale
    @VaibhavShewale Před 21 dnem +3

    cant set the specific speaker

  • @OliNorwell
    @OliNorwell Před 19 dny

    It's interesting how the best quality settings aren't used in the basic advanced demo... I thought those later samples were very good.
    Not sure how it got so many stars so quick on Github though, I mean, there are alternatives that are excellent that didn't grow so quick at all

    • @zachary3603
      @zachary3603 Před 19 dny

      What are the better alternatives everyone is using? Using Elevenlabs atm, but the speech comes out so robotic half the time.

  • @Project_SaveTheWorld
    @Project_SaveTheWorld Před 21 dnem +1

    It be nice if you could have it read Spanish. If it could, you'd pretty much have a translator.

  • @AEnoob
    @AEnoob Před 21 dnem +1

    i think there is a Text Seed that lets you choose your voice

  • @nexuslux
    @nexuslux Před 21 dnem

    What’s the best real time equivalent of this?

    • @Resursator
      @Resursator Před 21 dnem

      GPT 4o, probably.

    • @BORCHLEO
      @BORCHLEO Před 21 dnem +6

      coqui/tts with the xtts model

    • @NathanChambers
      @NathanChambers Před 21 dnem +1

      I've been really happy with alltalk_tts with deepspeed enabled. It's api worked good for my personal needs too. I'm no pro with tts stuff but it's been great for me. I have it reading/speaking in English, German, and Russian and does great. The only issues I've ever really had with it is if it's set in "ru" but reading "en" text.... punctuation can become demonic sounds :P It's Russian accent speaking English words is a real bonus to me too. :P

    • @AllAboutAI
      @AllAboutAI  Před 20 dny +1

      yeah xtts is great

  • @DihelsonMendonca
    @DihelsonMendonca Před 21 dnem +2

    No Brazilian Portuguese, unfortunately 😮😮

    • @AllAboutAI
      @AllAboutAI  Před 20 dny +2

      yeah only en and ch for now i think

  • @agatitytube
    @agatitytube Před 21 dnem +1

    I assume, it is available only in english, right?

  • @stanTrX
    @stanTrX Před 20 dny +1

    Does it speak Turkish?

    • @AllAboutAI
      @AllAboutAI  Před 20 dny

      i think its only english and chineese atm

  • @8eck
    @8eck Před 20 dny +1

    Don't see any weights on huggingface, so don't understand all the hype... But the code looks real.

  • @beliebigerusername
    @beliebigerusername Před 20 dny

    how can i train it a different language? or connect it to another model?

  • @kamalkamals
    @kamalkamals Před 20 dny

    but not support most popular languages

  • @dumbol8126
    @dumbol8126 Před 21 dnem +1

    chinese ml engineers are goated

    • @gustavheinrich5565
      @gustavheinrich5565 Před 16 dny

      Just make sure you're not falling for llms with Chinese propaganda and false information baked in, when using that stuff.

  • @threepe0
    @threepe0 Před 14 dny

    Turtle is better I think

  • @Ginto_O
    @Ginto_O Před 21 dnem

    Not good

  • @Sanguen666
    @Sanguen666 Před 10 dny

    tortoise is still better

  • @JNET_Reloaded
    @JNET_Reloaded Před 21 dnem +1

    nice can you add the requirements.txt file
    ChatTTS
    IPython
    requests
    openai
    torchaudio
    torch
    numpy
    omegaconf
    vocos
    vector_quantize_pytorch
    transformers

  • @kobe81
    @kobe81 Před 14 dny

    ChatTTS.model.gpt:Incomplete result. hit max_new_token: 384 too bad...