Updated AI Audiobook Maker Installation and Bug Fixes

Sdílet
Vložit
  • čas přidán 6. 09. 2024
  • Links referenced in the video:
    Audiobook Maker - github.com/Jar...
    AI Voice Cloning V3 - • AI Voice Cloning v3 Pa...
    Get RVC Voices - • How to Get AI Voice Mo...
    RVC Playlist - • RVC (Retrieval-based V...
    Hardware for my PC:
    Graphics Card - amzn.to/3pcREux
    CPU - amzn.to/43O66Ir
    Cooler - amzn.to/3p98TwX
    RAM - amzn.to/3NBAsIq
    SSD Storage - amzn.to/42NgMFR
    Power Supply (PSU) - amzn.to/430bIhy
    PC Case - amzn.to/447499T
    Mother Board - amzn.to/3CziMXI
    Alternative prebuilds to my PC:
    Corsair Vengeance i7400 - amzn.to/3p64r22
    MSI MPG Velox - amzn.to/42MnJHl
    Cheapest and PC recommended:
    Cyberpower 3060 - amzn.to/3XjtZoP
    Come join The Learning Journey!
    Discord - / discord
    Github - github.com/Jar...
    TikTok - / jarodsjourney
    If you found anything helpful, please consider supporting me and the content I am trying to produce!
    www.buymeacoff...

Komentáře • 50

  • @Dreamcatcher659
    @Dreamcatcher659 Před 2 měsíci +7

    This guy has single handedly allowed me to work on projects and progress my life and hobbies in a way I NEVER would have been able to do without. Thank you sir!

    • @user-ff2ed6dh8x
      @user-ff2ed6dh8x Před měsícem +1

      great to know. but what is your hobby?, what do you do?

  • @Public-Freakout
    @Public-Freakout Před 2 měsíci +4

    Is there a way to download pretrained tortois tts models so that we can just plug them in ?

  • @Mowgi
    @Mowgi Před 2 měsíci +2

    Love your work, Jarod ❤

  • @BryanFaustino-hy8om
    @BryanFaustino-hy8om Před 2 měsíci

    Hiya! And thank you for your wonderful work!
    Just wanted to say, I upgraded to this version and noticed an odd bug where the generated json will remove all periods except those next to a quotation mark. This causes about half the generated audio to hang at the end of a sentence or mispronounce the final word and wasn't an issue in the previous version. The solution so far is to just manually add back in the missing periods if the audio is bad and regenerate.

  • @Paul_Wasson
    @Paul_Wasson Před 2 měsíci +1

    The hero we don't deserve!

  • @daryladhityahenry
    @daryladhityahenry Před 2 měsíci +1

    Hi Jarod. I want to ask something. Why you choose to still use TortoiseTTS + RVC instead of StyleTTS2?
    From performance and quality wise, isn't StyleTTS2 better? Or is there anything else you consider?
    Thankss

  • @TheFlexxoo
    @TheFlexxoo Před 2 měsíci

    @Jarods_Journey Have you considered working on the code to enable it to run in the background or multithreaded? That way, it won't freeze the program when you regenerate the audio.

    • @Jarods_Journey
      @Jarods_Journey  Před 2 měsíci +1

      I left that in there as you technically shouldn't be doing anything while it's generating audio, it's similar to how I grey out buttons when the main generation block is running. Never went back to make it unblocking

    • @TheFlexxoo
      @TheFlexxoo Před 2 měsíci

      @Jarods_Journey is never a good practice to have a gui freeze like that, it will be better disable the rest of the buttons or add a please wait message box, by the way good work in all you do, just an opinion, thanks.

  • @MiniFireball
    @MiniFireball Před měsícem

    What files do I need to move around to use a voice I trained on the web guide for the audio books?

  • @Melike-oh1ir
    @Melike-oh1ir Před 23 dny

    The paus doesn't seem to work, everything else is fine. Any clue as to why?

  • @mauricio9581
    @mauricio9581 Před měsícem

    How do you create an audiobook with multiple different voices?

  • @aa-xn5hc
    @aa-xn5hc Před měsícem

    Why not combining to M4B ? thank you very much for the update!

  • @donmarshal2070
    @donmarshal2070 Před 2 měsíci

    Can you integrate pronounciation correction in this (like Balabolka).🧐
    If this frontend utility become utilisable like balabolka, [per word realtime pronounciation check, IPA pronounciation/editing, assign shortforms for sentence (AI for Artificial Intelligence)] and inbuilt text editor with pronounciation dictionary.
    Man, No kidding, I'll become member for lifetime 🙏🏻

  • @Mowgi
    @Mowgi Před 2 měsíci

    I've been having a lot of fun with udio lately. Tried it? I've put in some old unfinished music of mine and it's really surprised me with ways to move the song forward.

  • @elliotnewton4937
    @elliotnewton4937 Před 2 měsíci

    Thanks for this Jarod :). Will there ever be a working model of your tortoise build set up in Google Colab? I really like the way it runs, but I simply don't have the processing power. Many thanks for everything you do.

  • @NFawc
    @NFawc Před 14 dny

    RVC makes things sound worse??
    Just having a play with this, and the voice pitches are raised/messed around with from the source version from Tortoise? How can you stop the output from being manipulated by Audiobook from what Tortoise generates? ie: The WAV file in the Tortoise results folder sounds fine. But in the Audiobooks output folder, the WAV has been manipulated and sounds worse :(
    Is it RVC messing it up?

    • @NFawc
      @NFawc Před 14 dny

      Yes! The RVC manipulation of the WAV files makes them sound worse. I've stopped it from working by overwriting the RVC output with the original file and it sounds much better:-
      orig_audio_path = audio_path
      audio_path = rvc_convert(model_path=voice_model_path,
      f0_up_key=f0_pitch,
      resample_sr=0,
      file_index=voice_index_path,
      index_rate=index_rate,
      input_path=audio_path)
      shutil.copy2(orig_audio_path, audio_path) # LOSE RVC

  • @09jake12
    @09jake12 Před 2 měsíci

    how do I make tort stick with one voice instead of doing it randomly? I don't want to make my own voice.

  • @dolboeb-tz4bw
    @dolboeb-tz4bw Před 2 měsíci

    Please review IMS Toucan

  • @giovannif2567
    @giovannif2567 Před 2 měsíci

    So useful! Thank you ❤🎉

  • @wnrandom98
    @wnrandom98 Před 2 měsíci

    this is awesome! thank you

  • @dougmaisner
    @dougmaisner Před 2 měsíci

    crazy good update!

  • @yoru2530
    @yoru2530 Před 2 měsíci

    It work but a bit hard on my machine. I wonder if there's a way to run my own audio through SVC and record it. Like playing an audio and run it through svc like you do with mic and record it.

  • @keisaboru1155
    @keisaboru1155 Před 2 měsíci +2

    wait is this compatible with rvc ?

    • @Jarods_Journey
      @Jarods_Journey  Před 2 měsíci +3

      Yes, it uses RVC models to convert the TTS output from tortoise.

  • @rickyparker2943
    @rickyparker2943 Před 2 měsíci

    What do I do with this error?
    UnicodeDecodeError: 'utf-8' codec can't decode byte 0x93 in position 69: invalid start byte

  • @duahauvn
    @duahauvn Před 2 měsíci

    How to create subtitle for converted audio as .srt file please? Is there any ways to convert audio + subtitle files? Thank you.

  • @Artholos
    @Artholos Před 2 měsíci

    Absolute hero 🎉

  • @vidneypopples
    @vidneypopples Před měsícem

    I'm getting
    Error: [WinError 10061] No connection could be made because the target machine actively refused it, retrying... (1/3)
    Error: [WinError 10061] No connection could be made because the target machine actively refused it, retrying... (2/3)
    Error: [WinError 10061] No connection could be made because the target machine actively refused it, retrying... (3/3)
    When trying to select the text file & then start audiobook generation

  • @tylerchambliss8379
    @tylerchambliss8379 Před 2 měsíci

    Is there a way you can improve the UI to be accessible with screen readers for the blind? The web based gradio interfaces are great but this is almost unusable. As of now if I wanted to make a book I'd have to manually record each sentence from Tortoise in my digital audio workstation sentence by sentence because Tortoise has voice glitches you are all too aware of when using fine tuned models. I hope Style TTS 2 will finally be able to replace Tortoise and sound just as good. Obviously I'd rather something local be as good as Chat GPT4O's voice or 11 labs but I know that's going to be a few years down the line. You know Tortoise would be really great if it wasn't susceptible to the voice glitches.

  • @tempertephra
    @tempertephra Před 2 měsíci

    well done!

  • @nggaknyangka7688
    @nggaknyangka7688 Před 2 měsíci

    hello brother, I always get info can't access localhost :7860 (HTTP 1.1 404 Not Found), when activating start_package so I can't access data in the audio book maker.

  • @zonas7915
    @zonas7915 Před 2 měsíci

    Hello Jarod, did you take a look at seed tts?

    • @Jarods_Journey
      @Jarods_Journey  Před 2 měsíci +2

      Yes, it's really really good. However, ain't no way bytedance is releasing their models 😂

  • @JonnyD000
    @JonnyD000 Před 11 dny

    Would love a linux version

  • @titolindj
    @titolindj Před 2 měsíci

    Will it work with other languages?

  • @tnewstamil
    @tnewstamil Před 2 měsíci

    levenlabs altrenative create sir please

  • @heckyes
    @heckyes Před 2 měsíci

    Windows only?

  • @bill4320
    @bill4320 Před 2 měsíci +1

    Linux please