AniPortrait - AI Audio-Driven Synthesis of Portrait Animations - Local Install!

Sdílet
Vložit
  • čas přidán 29. 03. 2024
  • Using python & AI for animation images is always something I find fun, and there's a new kid on the block - AniPortrait! Using the power of Stable Diffusion, this repo can create animated avatars with fairly minimal effort. There have been quite a few research advances over the years, and this brings us a step closer. Want to install some of the latest, cutting edge, AI research at home? Well now you can, as this will run on consumer hardware too 😃 This step-by-step tutorial will guide you through getting it running locally, meaning you too can be on the cutting edge! Crack that anaconda terminal open once again to begin your journey…
    Want to support the channel?
    / nerdyrodent
    AniPortrait - github.com/Zejun-Yang/AniPort...
    AniPortrait paper - arxiv.org/abs/2403.17694
    Installing Anaconda for MS Windows Beginners - • Anaconda - Python Inst...
    RVC WebUI - • RVC Web UI - FREE, Ope...
  • Věda a technologie

Komentáře • 37

  • @vi6ddarkking
    @vi6ddarkking Před měsícem +19

    Ok, so hear me out.
    This with SD3 models + Tavern AI text to speech with Llama 3 or Grok derived models.
    2024 is shaping out to be a truly fun year.

    • @jackrabbit1704
      @jackrabbit1704 Před měsícem

      It'd likely take a nasa quantum super computer to run stable diffusion image prompts, a moderate llm, rvc/applio and then this all at the time. But yes, yes please.

    • @vi6ddarkking
      @vi6ddarkking Před měsícem

      @@jackrabbit1704 Not really. Anything above a RTX 5070 should be able to handle it comfortably.
      Considering the Blackwell architecture and the improvements in AI efficiency we're seeing.

  • @kariannecrysler640
    @kariannecrysler640 Před měsícem +5

    Here comes Peter cottontail. Hopping down the bunny trail. Hippity hoppity nerdy’s on his way! 🐭🐇

    • @NerdyRodent
      @NerdyRodent  Před měsícem +1

      😀

    • @kariannecrysler640
      @kariannecrysler640 Před měsícem +2

      @@NerdyRodent 🤘😉💕

    • @marilynlucas5128
      @marilynlucas5128 Před měsícem

      @@NerdyRodent My suggestion is to try experimenting with fourier transforms to eliminate the flickering in stable diffusion videos as the frequency level. Please do you understand me?

  • @Sol-zp6kc
    @Sol-zp6kc Před měsícem +4

    I wonder if we'll ever get something like LucidSonicDreams but with SD, that would be incredible!

  • @PrincessSleepyTV
    @PrincessSleepyTV Před měsícem +2

    This is so cool!

    • @NerdyRodent
      @NerdyRodent  Před měsícem +1

      Can’t wait for a few more papers down the line! 😉

  • @RhapsHayden
    @RhapsHayden Před dnem

    Ugh I forgot to create an environment and screwed up Comfyui. Took me all day to fix it😂. I'll try it again tomorrow because I need this

  • @DeconvertedMan
    @DeconvertedMan Před měsícem +4

    +1 points.

  • @UnchartedWorlds
    @UnchartedWorlds Před měsícem +5

    Fix those eyes to look directly into the camera and this is great!

    • @leavemealoneandgoaway
      @leavemealoneandgoaway Před měsícem +2

      there is software for that too

    • @marilynlucas5128
      @marilynlucas5128 Před měsícem

      @@leavemealoneandgoaway What is that software? descript?

    • @MrRaja
      @MrRaja Před 5 dny +2

      If you have Nvidia GPU you can use Nvidia Broadcast on your camera and there is a A.I. function to lock your eyes to camera. even when you are reading something on your monitor or looking down to your keyboard to type your videofeed will show you staring at the camera at all times as long as your eyes are decently visible (bang/too dark).

    • @marilynlucas5128
      @marilynlucas5128 Před 4 dny

      @@MrRaja oh? How nice

  • @user-rt6nk9sc4y
    @user-rt6nk9sc4y Před měsícem

    now this tool can make , two hand ,mouse , Natural movements and realistic performances

  • @pfbeast
    @pfbeast Před měsícem

    How to use "instruct pix 2 pix" & "SDXS" in comfyui?

  • @smtabatabaie
    @smtabatabaie Před měsícem +1

    How's the inference time with audio driven mode? is it near real time?

    • @NerdyRodent
      @NerdyRodent  Před měsícem +2

      Not even close 😉

    • @smtabatabaie
      @smtabatabaie Před měsícem

      @@NerdyRodent Thanks, Do you know any talking head with near real-time inference? Something like D-ID real-time avatars

    • @NerdyRodent
      @NerdyRodent  Před měsícem

      Where you can do your own custom stuff easily, locally and for free… not that I can think off 🫤

  • @wsx256
    @wsx256 Před měsícem +1

    Strabismus attack

  • @user-rt6nk9sc4y
    @user-rt6nk9sc4y Před měsícem

    How long time we can make out put video 5 nminutes or 10 or 30 minutes if we want

    • @NerdyRodent
      @NerdyRodent  Před měsícem

      Each video can be as long as you have the hardware for!

  • @user-rt6nk9sc4y
    @user-rt6nk9sc4y Před měsícem

    and any tool can make voice clone and train voice , thanks

    • @NerdyRodent
      @NerdyRodent  Před měsícem

      Do you mean like the example in this video?

  • @LouisGedo
    @LouisGedo Před měsícem +1

    Hi

  • @sinayagubi8805
    @sinayagubi8805 Před měsícem

    we want a speaking rodent in the corner of your videos. ahaha

  • @el-_-grando-_-_-scabandri
    @el-_-grando-_-_-scabandri Před měsícem +1

    creepy

  • @LilShepherdBoy
    @LilShepherdBoy Před měsícem +3

    Now this is actually very cool. Must be great for people that want to do VTuber content but don't want to go through the whole rigmarole of setting one up.
    -
    Jesus Christ loves you 💙
    He has a plan and a purpose for your life, plans to prosper you and not to harm you, plans to give you hope and a future.
    Jesus Christ loves you 💙

  • @fernandodiaz8231
    @fernandodiaz8231 Před měsícem

    Thank you for the information. I would like to ask if you know some Colab options or Kaggle notenook for AniPortrait