From real to anime (with IPAdapter and ComfyUI)

Sdílet
Vložit
  • čas přidán 16. 06. 2024
  • Easy and effective way to apply the IPAdater models to get a "realism" slider from cartoonish to ultra-realistic with everything in between. We talk about prompt engineering, upscaling and of course IPAdapter.
    IPAdapter: github.com/cubiq/ComfyUI_IPAd...
    CASharpening is part of: github.com/cubiq/ComfyUI_esse...
    You can download the workflow from here: pastebin.com/tppqDVUp
    00:00 Introduction
    00:24 Prompt Engineering
    03:34 IPAdapter
    07:16 Upscaling
    12:54 Sharpening
    14:25 SDXL
    17:24 AnimateDiff
    🎵 Background Music
    -- "Part A" by Alexander Nakarada (www.serpentsoundstudios.com)
    Licensed under Creative Commons BY Attribution 4.0 License
    -- "Nightmare City" Synthwave by Karl Casey @ White Bat Audio (whitebataudio.com/)
    -- "Menace" Synthwave by Karl Casey @ White Bat Audio

Komentáře • 71

  • @Danbearpig
    @Danbearpig Před 7 měsíci +33

    These have been by far the most helpful and informative videos on Stable Diffusion I have found, please keep them coming! I appreciate that you don’t just show what to do, but explain why you are making specific choices and what happens if you choose differently. Thank you!

  • @rawkeh
    @rawkeh Před 7 měsíci +7

    Your work with IPAdapter is astounding, man! Great job and keep it up!

  • @pedxing
    @pedxing Před 7 měsíci +2

    the final animations are really quite charming. nice work!

  • @johnmcaleer6917
    @johnmcaleer6917 Před 7 měsíci +3

    Wonderfully clear and informative, please keep doing what you are doing...I love IPAdapter and Comfy and yours are the best vids on it I've found..👍🏻

  • @mr2ti41
    @mr2ti41 Před 7 měsíci +1

    Man keep up the great work! I switched over from A1111 and your videos are helping way more than what I learned from any A1111 videos!!

  • @ericren5390
    @ericren5390 Před 3 měsíci +2

    Another great video, as always it gave me new know-how to give me a deeper understanding of the drawing process. This is definitely worth watching again. Thank you a lot.

  • @Gabriecielo
    @Gabriecielo Před 6 měsíci +2

    Great video to explain deeper knowledge of IPAdaptor, you guys did a great job!!

  • @jepperisum
    @jepperisum Před 7 měsíci +3

    Thanks for awesome tutorials! Like other people say, it is great to kind of watch over the shoulders of a real pro. So yes.. Please keep'em coming! :)

  • @LucasNoce
    @LucasNoce Před 7 měsíci

    Thank you for your tutorials and animatediff nodes ! Cheers from south américa !

  • @bobchang325
    @bobchang325 Před 7 měsíci

    Your tutorial inspired me so much, Thank you for these awesome works!

  • @ronnykhalil
    @ronnykhalil Před 7 měsíci +1

    another gem! thank you. Looking forward to the next one :)

  • @enigmatic_e
    @enigmatic_e Před 7 měsíci

    This is so awesome! Thank you for creating this!

  • @maxehrlich
    @maxehrlich Před 5 měsíci

    Amazing! I can only hope to someday come close to this level of control and knowledge.

  • @Inner-Reflections-AI
    @Inner-Reflections-AI Před 7 měsíci

    Loving your stuff as always!

  • @comfyuiadrian
    @comfyuiadrian Před 4 měsíci

    Wonderful work again and I thank you!

  • @hakandurgut
    @hakandurgut Před 7 měsíci +1

    Great video, please keep posting videos like this one

  • @vizsumit
    @vizsumit Před 6 měsíci +1

    You are legend 💪
    Thanks for quality tuts

  • @PradeepKumar6
    @PradeepKumar6 Před 4 měsíci +1

    Again Excellent Video. Thanks very much.

  • @elan4912
    @elan4912 Před 6 měsíci

    Lots of things I have learned from your video. Thanks a lot.

  • @swannschilling474
    @swannschilling474 Před 5 měsíci

    Thanks a lot for this one!!! Your channel is awesome!! 🤩

  • @vinh-le
    @vinh-le Před 7 měsíci

    Awesome tutorials!!!! Thank you very much.

  • @angelotsk3173
    @angelotsk3173 Před 7 měsíci

    as always, there is something to learn from your videos, today it was Conditionning (concat) ..thanks

  • @ysy69
    @ysy69 Před 7 měsíci

    Thank you for a show and tell tutorial.

  • @ALatentPlace
    @ALatentPlace Před 7 měsíci +2

    Nice one. I really like the small details of how you work. Tweaking the CFG here, manipulating weights there, jumping between the schedulers and samplers... reminds me of trying not to stick to the kind of workflow you can get used to very fast because of your "daily work".
    If I wanna see more videos like this? Sorry, I don't get the question. 😉

  • @impactframes
    @impactframes Před 7 měsíci

    fantastic channel learning a lot here

  • @reapicus557
    @reapicus557 Před 2 měsíci

    Your playlist of videos has helped me so much! I'm going to be coming back to these for reference for quite a while, and it'll be easy to recommend your videos to others. Getting into this SD and ComfyUI is so exciting. (>'.')👍

  • @hleet
    @hleet Před 7 měsíci +1

    nice choice of music background

  • @aleksanderk5695
    @aleksanderk5695 Před 7 měsíci

    Great job, thank You!

  • @aa-xn5hc
    @aa-xn5hc Před 7 měsíci

    Si', grazie, more videos like this, please!

  • @Chad-xd3vr
    @Chad-xd3vr Před 4 měsíci

    Yes please, more videos like this

  • @lbf5984
    @lbf5984 Před 4 měsíci

    I listened to this the first time and thought he was saying muse-killer... i had really bought in to it by the end. Muss-queue-lar.

  • @alexgilseg
    @alexgilseg Před 4 měsíci

    thank you 🙏🙏

  •  Před 2 měsíci

    2 videos for the price of one!

  • @Iron-Painter
    @Iron-Painter Před 7 měsíci

    Master

  • @_VFX_Ninja
    @_VFX_Ninja Před 6 měsíci

    Fantastici i tuoi video! Question, I do I install your ComfyUI_essentials?

  • @drviolet396
    @drviolet396 Před 4 měsíci

    if you were to use KsamplerRave + animatediff would you connect the IPadapter to the RAVE sampler or still to the animatediff ?

  • @FieldMajor76
    @FieldMajor76 Před 7 měsíci +1

    Is there a node for multiple lora's with weights? Right now I have my load checkpoint going into a load Lora, but this is just one at a time. Thank you

    • @latentvision
      @latentvision  Před 7 měsíci +1

      you can daisy chain loras

    • @hindihits9260
      @hindihits9260 Před 7 měsíci +1

      use efficiency nodes, it has a lora stacker node

    • @FieldMajor76
      @FieldMajor76 Před 7 měsíci

      @@hindihits9260trying to install the Lora motion that will plug in to the Animatediff node today. For tilt, zoom and pan.

    • @nocturne3455
      @nocturne3455 Před 7 měsíci +1

      ​​@@hindihits9260note that I noticed the efficient loader seems to start on the wrong clip skip setting. Default is -1 but -2 is the actual baseline. You can test yourself by eliminating the loader and generating a result with the same settings. Your image will differ until you run the loader at -2. But otherwise yes, Efficient Loader is awesome

  • @kikoking5009
    @kikoking5009 Před 16 dny

    Hi,
    can I ask for which use cases this workflow is exactly?
    U said it's to creat anime like realistic illustration, something in between.
    As far as I know I can use a Checkpoint to creat those.
    Iam not sure. Do you know any usecases that are only possible with this workflow?

  • @kakochka1
    @kakochka1 Před 7 měsíci

    Great tutorial as always! Could you explain what's the difference between muting and bypassing nodes? And what's the difference between concat and combine prompt nodes?

    • @latentvision
      @latentvision  Před 7 měsíci

      Muting stops the execution and the following nodes are just ignored. Bypassing just ignores the bypassed nodes and keeps going to the next ones.

    • @kakochka1
      @kakochka1 Před 7 měsíci

      Thanks for the answer!@@latentvision And what about ConditioningConcat vs ConditioningCombine? When do you use one vs another?

    • @latentvision
      @latentvision  Před 7 měsíci

      @@kakochka1 combine merges the prompts in one vector. concat keeps them separate. If you want less "bleeding" between the tokens use concat.

  • @alecubudulecu
    @alecubudulecu Před 7 měsíci

    where did you get the image_encoder clip vision? mine doesn't automatically have it, and I couldn't find it on the git repo.....

  • @sairampv1
    @sairampv1 Před 7 měsíci

    can you create landscapes and buildings as well with this method?

  • @matyourin
    @matyourin Před 4 měsíci

    Can you provide download links to the files you need? I cannot find all via manager... the ipadapter page i found the models for "load ipadapter model"-box... and i also was able to find the deliberate_v3 checkpoint model.
    But what is that vae you are using to decode after the first ksampler? and in the "load clip vision box" what model is that? Where can i get it? On

  • @bender203
    @bender203 Před 3 měsíci

    can you post the workflow as a json file? I don't know how to use the text download you linked to.

  • @aa-xn5hc
    @aa-xn5hc Před 7 měsíci

    Amazing, grazie. What about a tutorial with only images as inputs, and very little weight for the text? an image for face, one for style, one for the body/content/background?

    • @latentvision
      @latentvision  Před 7 měsíci +1

      I'm prepping a video about that (not the next one, the one after)

  • @nodewizard
    @nodewizard Před 6 měsíci

    Matteo, can you do video tutorial using IPA with multiple LORAs? Thank you sir. Ciao

    • @latentvision
      @latentvision  Před 6 měsíci

      you can daisy chain loras. Checkpoint > Lora > Lora > IPAdapter > Ksampler. simple as that :smile:

  • @luiswebdev8292
    @luiswebdev8292 Před 6 měsíci

    what are your computer specs? It runs so much faster than mine...
    Also I don't understand why you use a 4x upscaler and then downscale my .5, why not simply use a 2x upscaler to begin with?

    • @latentvision
      @latentvision  Před 6 měsíci +1

      the video is sped-up sometimes even x10 otherwise it would become a 5 hours long video and would be terribly boring to watch :)
      Regarding the upscaler I use the 4x because that's the one that I like that is better than many 2x. But you are free to use whatever upscaler you want.

  • @sc0peAI
    @sc0peAI Před 7 měsíci

    Hi, I am getting this error "size mismatch for proj.weight: copying a param with shape torch.Size([3072, 1024]) from checkpoint, the shape in current model is torch.Size([3072, 1280])." pleae help

    • @latentvision
      @latentvision  Před 6 měsíci

      check the IPAdapter repository, there's a troubleshooting at the bottom of the main page

  • @alpaykasal2902
    @alpaykasal2902 Před 6 měsíci

    before you did a concat of the positive clips, you did not have a comma separating "perfect piercing blue eyes wasteland background highly detailed". i am pretty sure your carriage return does not separate those terms. Maybe I'm wrong. Might explain why your terms bled.

    • @latentvision
      @latentvision  Před 6 měsíci +1

      the concatenation happens at the tensor level. the two texts are concatenated after they are converted into a vector

  • @samsiteone
    @samsiteone Před 7 měsíci

    How do you use that workflow text file?

    • @latentvision
      @latentvision  Před 7 měsíci +1

      the workflows are text files in json format. Just download (or copy/paste) the text and name the file: whatever.json

  • @user-rk3wy7bz8h
    @user-rk3wy7bz8h Před 2 měsíci

    I need help i get a error when working with an SDXL checkpoint. RuntimeError 3 K Sampler.
    It shows: Expected query, key, and value to have the same dtype, but got query.dtype: struct c10::Half key.dtype: float and value.dtype: float instead.
    I have a gtx 1080 and use gpu. I want to continue using gpu without getting the error.
    In cpu it works without error but takes hours to generate a simple IP Adapter workflow.
    I really need help, please.

    • @latentvision
      @latentvision  Před 2 měsíci

      run comfy with --force-fp16 option

    • @user-rk3wy7bz8h
      @user-rk3wy7bz8h Před 2 měsíci

      @@latentvision yess it works. Thank you very much. You are very helpful continue :)

  • @user-hv3ey2zg4i
    @user-hv3ey2zg4i Před 6 měsíci

    What does it mean when an image is "burnt"?

    • @latentvision
      @latentvision  Před 6 měsíci

      the checkpoints are often "over trained", that leads the image to look over-saturated at medium/high CFG scale. When that happens we sometimes refer to that as "burnt image"

    • @user-hv3ey2zg4i
      @user-hv3ey2zg4i Před 6 měsíci

      @@latentvision Thanks!

  • @grahamastor4194
    @grahamastor4194 Před 2 měsíci

    Great video, thank you. Looking for a strong female actress... Sigourney Weaver?

  • @luiswebdev8292
    @luiswebdev8292 Před 6 měsíci

    suggestion: remove the music for the tutorials as I might want to have my own music in the background at low volume

    • @latentvision
      @latentvision  Před 6 měsíci

      that's something I didn't think about. I do a lot of cuts in the audio and the voice goes a bit up/down, the bgm helps merging all together and adds a little consistency.