The Open Source KING is BACK. Stability's NEW AI Image Generator!

MattVidPro AI

zhlédnutí 50 155

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 3. 06. 2024
Stable Cascade is an image generation model that can create new variations of an image while maintaining its style and composition. It is a text-to-image model that is fast and high quality.
▼ Link(s) From Today’s Video:
Stable Cascade Github: github.com/Stability-AI/Stabl...
Thibaud's Twit post: / 1757370745900937441
Stable Cascade 1 Click Launcher: / 1757457604781978091
Try Stable Cascade for free: t.co/eychPLlXNS
► MattVidPro Discord: / discord
► Follow Me on Twitter: / mattvidpro
-------------------------------------------------
▼ Extra Links of Interest:
✩ AI LINKS MASTER LIST: www.futurepedia.io/
✩ General AI Playlist: • General MattVidPro AI ...
✩ AI I use to edit videos: www.descript.com/?lmref=nA4fDg
✩ Instagram: mattvidpro
✩ Tiktok: tiktok.com/@mattvidpro
✩ Second Channel: / @matt_pie
-------------------------------------------------
Thanks for watching Matt Video Productions! I make all sorts of videos here on CZcams! Technology, Tutorials, and Reviews! Enjoy Your stay here, and subscribe!
All Suggestions, Thoughts And Comments Are Greatly Appreciated… Because I Actually Read Them.
-------------------------------------------------
► Business Contact: MattVidProSecond@gmail.com
Věda a technologie

Komentáře • 263

@matsleonrichter5305 Před 3 měsíci ⁺⁴⁸
Thanks for covering our work, thrilled to see how our research gets adopted this way. Also, I still find it hilarious that "Würstchen" stuck as the name of our architecture. Sorry in advance for all non-German speakers who break their tongues while trying to pronounce it.
@paulmuriithi9195 Před 3 měsíci
Wow never knew tounge could be broken..must be a bony tounge
@Athari-P Před 3 měsíci
I'll just call it Worse Ten.
@CryptoTonight9393 Před 3 měsíci ⁺²
Small sausage?
@KurtWoloch Před 3 měsíci ⁺¹
@@CryptoTonight9393Yeah, small sausage... that's the translation. It's actually hard to find an English sequence of characters that sounds remotely like "Würstchen"... the "ü" being an umlaut of "u" which isn't used in English, and the "ch" is a single phoneme as well... imagine "k" but speaking it softly... in a way "ch" would be to "k" what "f" is to "p".
@RideShareRocks Před 3 měsíci ⁺¹
The biggest problem I have is duplicating a face I've created in different poses. It's infuriating.
@anthony_leckie Před 3 měsíci ⁺⁶
Great video as always, Matt. Very happy to see this new model. I got my first job using stable diffusion and video diffusion 1.1 last week. Very happy to see the new model.
@Lar_me Před 3 měsíci ⁺³⁷
Trying to wrap my head around how it can get a 1024x1024 image from 24x24 o_o
I really REALLY want to see Stability's models pull ahead of the competition soon! I hope the (supposedly) easier training times can allow Stable Cascade to reach Midjourney's level of detail somehow.
@pon1 Před 3 měsíci ⁺⁸
It probably can, this is only the base model, it is very general so it can probably do a lot better than SDXL when finetuned, and SDXL can achieve Midjourney level of detail in some circumstances (like in Fooocus using certain styles and settings).
@jeffwads Před 3 měsíci
Reminds one of the quants.
@kuromiLayfe Před 3 měsíci ⁺⁴
Just wait till they figure out how to encode the image in subpixels 😂
1024x1024 encoded to 0.2 x 0.2 pixels
@jonmichaelgalindo Před 3 měsíci ⁺³
@@kuromiLayfe You can actually escape the pigeon hole limit by just setting the font size to 0.
@Kylo27 Před 3 měsíci ⁺¹
Lol
@oholimoli Před 3 měsíci ⁺⁵⁹
"Würstchen" is german and the translation could be "small sausage" 😂
@MattVidPro Před 3 měsíci ⁺¹³
ah..
@christianstein5130 Před 3 měsíci ⁺⁴
always funny to hear ü, ä und ö in english, in poland its easier to do a "smaller" version of a word, like wodka is small woda (water)
@hy7at Před 3 měsíci ⁺³
I watched this whole thing mainly because of Matt saying "Würstchen" multiple times throughout this video 😁
@pierruno Před 3 měsíci
Haha
@vitesh6429 Před 3 měsíci
I translated it from german, and it translated to 'hot dog'
@DiceDecides Před 3 měsíci ⁺¹
15:20 even though no mustache, there's something about the quality that's really soothingly satisfying I think!
@drew5564 Před 3 měsíci ⁺³
my boy!!!!!! whats good matt! just been sick recently and i have been away from yt as usual. im here now though, amazing video its looking like and i cant wait to get my popcorn and watch
@jtjames79 Před 3 měsíci ⁺¹¹
Even the text kerning was basically perfect. 😯
@martianingreen Před 3 měsíci ⁺²⁸
As a German speaker that's a really funny architecture name, literally just means sausage 😅
@CanadaBlue85 Před 3 měsíci ⁺⁶
Sausage AI™
@pierruno Před 3 měsíci
Haha
@sasbe1852 Před 3 měsíci ⁺¹
*The trivialization of sausage, to be more precise.
@justinwhite2725 Před 3 měsíci
I used to work at a german pub called Wurst. Closed during the pandemic.
@MattVidPro Před 3 měsíci ⁺²²
King's back.
@PigeonyStudios Před 3 měsíci ⁺¹
Emperor Pigeon is back (me)
@The_Questionaut Před 3 měsíci
@@SW-fh7heit's subjective
@hipjoeroflmto4764 Před 3 měsíci ⁺¹
@@SW-fh7hestop boofing monster energy
@SW-fh7he Před 3 měsíci
@@hipjoeroflmto4764 what do you mean?
@CrystalBreakfast Před 3 měsíci
The king never left. 😅
@HouseOfSynister Před 3 měsíci
Thanks for these videos! I learn so much from them, keep it up!
@ahsokaincognito Před 3 měsíci ⁺¹⁵
Würstchen is pronounced Vürst-yen. V as in view, ü like the u in lurk, st as in stash and yen like the currency.
@1Know1tHurts Před 3 měsíci
Americans never give a fuck about how names and words from other languages are pronounced.
@jeffbull8781 Před 3 měsíci ⁺³
I think this is more focused on efficiency and speed, which means things like animation and video (using similar methods) is going to be much more realistic. As currently the static models are being sort of shoehorned into animation workflows.
@abandonedmuse Před 3 měsíci ⁺¹
Their video is insanely realistic. Been beta testing it for a few days already.
@GearForTheYear Před 3 měsíci ⁺⁷⁶
Anyone else get the feeling that we're hitting diminishing returns with what's possible using the current NN architectures?
@pepenakamoto3675 Před 3 měsíci ⁺¹⁸
Yes. But I think there is a clear movement of capital and intelligence towards advancement in other areas of AI
@blakecasimir Před 3 měsíci ⁺¹⁹
Other archs have been researched, code released, work is happening on them. Transformers may get left behind eventually, this ride still has a long way to go.
@GearForTheYear Před 3 měsíci ⁺⁵
@@blakecasimir Right, I agree. It's just a bummer that we may see another protracted plateau before getting something genuinely revolutionary to use within a commercial context (i.e better than humans). The Transformer arch is so close and yet so far away.
@BeginningInfluence55 Před 3 měsíci ⁺⁷
@@GearForTheYearYou are right in terms of image fidelity/aesthetics. It won’t get any better than midjourney v6. However prompt understanding and following is still not optimal. DALL-E 3 shows that it can be much better still. The problem is the training data. They lack more concepts than they provide. You can’t create truly creative images because for example there is no training example of a horse riding a human - so it can’t do it at all.
@clickpwn Před 3 měsíci ⁺¹⁵
It’s not just the limitation of the architecture. Lot of it stems from the limitation of our language itself. We train and guide these models by using natural language however words are not sufficient for pinpointing an exact image you are looking for. One picture is worth more than thousand words and and using just few sentences as prompt will only get you just general image that could look okay but not exactly what you want down to nuance. Even if AI becomes smarter than humans, it still cannot read your mind and have only your words to go off of. Words carry too low-bandwith of information and only breakthrough I can think of is when we are able to upload our mind and thoughts directly to AI.
@JonnyCrackers Před 3 měsíci ⁺¹
Sick! Been hoping they'd come out with something to compete with Midjourney and Dall-E. I love Dall-E 3, but I get so tired of getting "prompt blocked" with prompts that have nothing offensive or copyrighted in them. Wasn't aware of Pinokio either, so I'm excited to give that a try. Thank you!
@MrTk3435 Před 3 měsíci ⁺¹
Good Job Matt!! Truly Exciting... We need more competition so, the subscription price will go Lower! ✨✨🤟✨✨
@gameswithoutfrontears416 Před 3 měsíci ⁺³
I just did a quick text test. Wow, perfect on the first one, but then not so great on the follow ups.
@chanm01 Před 3 měsíci ⁺⁵
Just awesome.
I kinda lost interest in text-to-image for a while. It isn't reliable enough to use in commercial applications yet (imo), and it didn't feel as competitive as text gen where almost every week there was news.
Nice to see open source text-to-image making progress towards catching up to the state of the art in this field.
@Athari-P Před 3 měsíci
Open-source isn't catching up with gpt-4, gpt-4 is still costly, gpt-5 tier doesn't exist. Overall, pretty meh too.
@hipjoeroflmto4764 Před 3 měsíci
Matt I just had or still have covid need to retest but this video made me feel good
@davidbangsdemocracy5455 Před 3 měsíci ⁺²
Perhaps but image generators use Convolutional Neural Networks and Transformers are for sequential data such as text. So, I assume huge improvements will be realized with both types of models and whatever improvements are made to them. It may seem more subtle because they are already great, but the will be faster, more controllable, more efficient, and integrated into useful apps.
@AmandaFessler Před 3 měsíci ⁺¹
I was starting to lose hope, but here they are! And with a focus on cost efficiency too! I hope it has backwards compatibility with 1.5. I have way too many loras of it stored up.
@Athari-P Před 3 měsíci
All loras are tightly coupled with base models, nothing will be compatible with sd 1.5 ever.
@AttenBot Před 3 měsíci ⁺¹¹
i would love to make consistent 16-bit style video game character sprite sheets
@petitemasque5784 Před 3 měsíci
This model is non-commercial but if you want to make free games...
@AttenBot Před 3 měsíci
Nah i dont care for non commercial, more of a personal project to achieve, go have a look at wwf royal rumble sprite sheets for example. One sheet thats of one character, Walking running jumping punching kicking etc.
@IcyLucario Před 3 měsíci ⁺¹
Awesome, glad to see SD keeping up. 1.5 is still relevant from the community, hope to see something like this treated the same way.
@FRareDom Před 3 měsíci ⁺¹
This came at a time we needed it most
@I-Dophler Před 3 měsíci
It's an interesting concern, especially with the rapid evolution in AI. While Transformers have indeed been groundbreaking, the tech field's nature is to innovate continuously. Who knows, the next big breakthrough could be just around the corner, rendering today's limitations a thing of the past.
@MyAmazingUsername Před 3 měsíci ⁺²
This was something I looked for a few days ago, since I am tired of SDXL being pretty bad compared to Dalle and Midjourney. Especially SDXL's extremely deformed hands and feet. So I checked Stability for news and saw nothing. Then your news dropped. Thanks. I just got excited about open source AI again.
@Airbender131090 Před 3 měsíci
Sont get your hopes up. This is not the model that will rival mj. Next ine probably will ( but mj will already release v7 till then )
@Modioman69 Před 3 měsíci
I can’t wait to see what the trained models of Cascade end up producing later. Heck I say later but someone will probably have trained model by end of week or something with the current pace of things lol.
@saymydomain9504 Před 3 měsíci ⁺¹
Mage and Leonardo will probably implement this model soon as possible.
@abandonedmuse Před 3 měsíci ⁺¹
I have actually been beta testing their video generation, which is absolutely amazing compared to anybody else even Pika. I also was able to ask for extra credits and they gave them to me because of the project that I’m doing with their video so I’m super excited.
@RodgerE2472 Před 3 měsíci ⁺⁹
Updated Forge UI is out too!!!
@hipjoeroflmto4764 Před 3 měsíci
Well I'dk what that is so yes matt should make a video
@Elwaves2925 Před 3 měsíci
I thought you meant a new update, with the ControlNet fixes but it's the one that's been out a few days. 😞
@abandonedmuse Před 3 měsíci
Which one is forge? Hard to keep up. Not sure i have used it.
@Elwaves2925 Před 3 měsíci
@@abandonedmuse Search for SD Webui Forge.
@2CSST2 Před 3 měsíci ⁺⁶
Matt, I absolutely adore all your videos, but 42 is not orders and orders of magnitude greater than 8, it is barely half an order of magnitude!
@Athari-P Před 3 měsíci
That's more than two orders of magnitude in binary though.
@ShoMorphias Před 3 měsíci
This comment is half an order of magnitude more accurate than the subject matter!
@ahsokaincognito Před 3 měsíci
11:50 it's easier to finetune this way than starting from a model biased towards photorealism
@MrPablosek Před 3 měsíci ⁺⁶
Does this mean it will require less VRAM to use? My 3070 struggles with SDXL without setting up various parameters and such to make it work and then it takes a pretty long time to generate an image.
@sherpya Před 3 měsíci ⁺¹
I've read something on reddit about needing more instead
@kuromiLayfe Před 3 měsíci
Think pretty much the same amount.. the concepts of this is similar to running a workflow in comfy that generates an image at 256x256 then does image to image with a upscale to 1024x1024 and then once more to detail the final sampler output.
@zingsnapbites Před 3 měsíci ⁺¹
Are the images commercial free to use?
@godnyx117 Před měsícem
Thanks for sharing!
@vitesh6429 Před 3 měsíci
With the same prompting, you can get better images (not definitive testing, just a couple of tests) than SDXL (NightVision XL), the images have a HDR midjourney look to them.
@okolenmi7511 Před 3 měsíci ⁺²
It will be better than Midjourney. 16x training performance + open source = magic
@tonyzed6831 Před 3 měsíci ⁺¹
Wow, and in Pinokio already??? Love that!
@jeffwads Před 3 měsíci
Wow. Never heard of this before.
@tonyzed6831 Před 3 měsíci ⁺¹
@@jeffwads I think he made a video about it... pinokio allows you to run AI tools on your PC without the hassle of installing complicated stuff, it's truly gamechanging. But you'll need a good GPU with a lot of vram (I went "cheap" by buying a used 1080ti, and 11gb of vram seems to be enough for what I do... for now).
@AzoreanProud Před 3 měsíci
Nice
@jonmichaelgalindo Před 3 měsíci ⁺¹
Got it running on Windows (command line). It has to be possible to make it run in Comfy, but it would take some work.
@MilesBellas Před 2 měsíci
Elon needs to take over !
"Robin Rombach, Andreas Blattmann, and Dominik Lorenz essentially created Stable Diffusion while at a German university. Stability AI got involved after the publication of their research and offered them the company’s computing resources. According to Forbes, all three have now left Stability AI which is also experiencing cash flow problems."
- Petapixel
@TheGoodContent37 Před 3 měsíci
What specs a pc should have to be prepared to run a SD model relatively fast? Is all about the graphic card?
@havemoney Před 3 měsíci
Happy Valentines day 💓
@povang Před 3 měsíci ⁺²
bro this is crazy, looks like it'll blow midjourney out of the water once it gets in the hands of opensource trainers for a few more months down the line.
@alexnorth3393 Před 3 měsíci
Exciting news!!
@cagnazzo82 Před 3 měsíci
I'm sorry to say, but with the endless possibilities now available with Midjourney's --sref feature, I think they ran away with the crown. What's possible now is absolutely mindblowing.
@christopherd.winnan8701 Před 3 měsíci
Can it handle compoond nouns yet? How about magnet fishing for example?
@USBEN. Před 3 měsíci
Looks a lot better.
@julianopajaro2005 Před 3 měsíci ⁺²
Hey, Matt. Do you know any A.I. that makes Cinemagraphs?
@doben Před 3 měsíci
I think "Imagen 2" can do that.
@consig1iere294 Před 3 měsíci
I am curious, why did it take so long for implementing the Würstchen tech? This was shown by the actual people behind Würstchen last year.
@SuperAleaiactaest Před 3 měsíci
There's a way easier way to do this. You just loop a clip the length of each notes phase. You do this and extend the loop out till it merges back in and you do this for all of the notes then you ctr+j to consolidate it.
@LouisGedo Před 3 měsíci ⁺⁶
From my testing, SDXL Turbo is utter garbage 💩 🤮.
I'm looking forward to Cascade
@ahsokaincognito Před 3 měsíci ⁺³
I didn't like it either, although I really tried.
@aouyiu Před 3 měsíci
Garbage how? It just needs tweaking to reach its potential.
@LouisGedo Před 3 měsíci
@@aouyiu
The quality of the images is like that of Midjourney 2 based on my testing.......utter garbage
@ilyass-alami Před 3 měsíci ⁺¹
Hi Matt you can test the LLaVA 1.6- 34bit demo llm vision assistant,
@jay_sensz Před 3 měsíci
Not sure if I'm just spoiled by community-finetuned SDXL models and Fooocus, but I'm not terribly impressed by what I've seen so far. But then again I was initially underwhelmed by SDXL as well.
What keeps me interested is the possibility of much more efficient finetuning compared to SDXL, but it might take a while for tooling and fine-tuned models to become available/usable.
@3DArtistree Před 3 měsíci
Of course when I just uninstalled Pinokio to make room for more checkpoint models! lol Hope someone ports it to Comfy in the next few days!
@KurtWoloch Před 3 měsíci
Interestingly, at 11:07 when the picture of Barack Obama comes together, at times it looks a bit like Alfred E. Neuman from the Mad magazine.
@sinayagubi8805 Před 3 měsíci ⁺²
I think you don't realize, this means opensource totally won today. just need to do this with language models too
@MattVidPro Před 3 měsíci
You haven’t seen anything yet :)
@aouyiu Před 3 měsíci
Meta might get us that, maybe sooner than you think now that Gemini is officially competing with ChatGPT.
@haileycollet4147 Před 3 měsíci
Miqu is getting there... It's not gpt4 level but it's definitely better than 3.5 all around, nearly as good as Gemini Ultra... And it's 70B 😂 It's coming!
@elishevafreely3206 Před 3 měsíci
I really hope that playground AI picks this up.
@Fustercluck06 Před 3 měsíci ⁺¹
I also feel mppy inside lol
@vi6ddarkking Před 3 měsíci ⁺³
The Stable Zero123 model still has and the Stable diffusion video had the same limited licence during it's experimental phase.
So nothing new here.
Still being vigilant is always the way to go.
@starblaiz1986 Před 3 měsíci
Do we have any idea based on past experience how long that licence will be limited? Are we talking weeks? Months? Over a year? 😮
@vi6ddarkking Před 3 měsíci
@@starblaiz1986 Once Version 1.0 releases usually it bounces to the new fully open source licence.
@isajoha9962 Před 3 měsíci
This video makes me happy for the future.
@pn4960 Před 3 měsíci
I ami hyped !
@andyone7616 Před 3 měsíci
Can this model will be used in automatic 1111?
@raaghavgr1990 Před 2 měsíci
How many free prompts in a day do you get in the free plan of stable cascade?
@shaunralston Před 3 měsíci
Always appreciate your being on the cutting edge of OS reporting, Matt.
@BTMYYY Před 3 měsíci ⁺¹
yoo this is so exiting i love open source :D
unfortunately it takes like 30 minutes to generate a photo locally on my 3060 with pinokio
@ilplopperz Před 3 měsíci ⁺¹
xD
@BTMYYY Před 3 měsíci
Updated pinokio now it takes like 15 minutes
@DezorianGuy Před 3 měsíci
Why does it take Stable Cascade several minutes to generate an image with my RTX 3060 12GB? No problems with Stable Diffusion etc.
@nachod9772 Před 3 měsíci ⁺¹
tried it, but idk dalle 3 give me a lot more specific and good results
@GiovannisProductions Před 3 měsíci
My honest reaction was: "Oh no..." 🤣
I'm really trying to catch up with everything, but oh boy, it's hard
@fire17102 Před 3 měsíci
Soon In SD5... For my kids, Remake this folder of movies to take out all the non wholesome parts.
For example, in Bambi the mother doesn't die, no one is in life danger, they all meet happily in the end. In the lion king, Mufasa and Scar are good friends and Simba is raised with his Dad. Ariel doesnt loose her voice. Remove nightmare fuel from Pinokio and Dumbo, etc etc etc etc etc.
Generate new wholesome scenes, keep characters and style as the originals, voice with 11Labs.
We will actually be able to give nice content to our kids, without passing any horror from the hydra studios.
@elihusolano5993 Před 3 měsíci
wow, just wow
@twilightfilms9436 Před 3 měsíci
You mention Krea, and Krea uses SDXL under the hood, so I wonder if you have found a way to get Krea or Magnific results but for free using comfy or a1111? I actually wonder how come no one is even trying to do it……anyways, great video!
@emoneydatruth1 Před 3 měsíci
Where can we use this?
@RickPMandel Před 3 měsíci
The question I have, is, as always, how does it handle censorship? What happens if you give it a prompt that many AIs will label as NSFW, and will not render?
@GearForTheYear Před 3 měsíci
It seems to just ignore those parts of the prompt. I couldn’t even get two mechs to shoot at each other.
@LoneBagels Před 3 měsíci
God: "walter white eating a big mac inside of mcdonalds, there are blue crystals in the big mac burger, walter white is dressed in a yellow hazmat suit"
Dall-E: "Even though I am just a tool and don't have a soul; I will pretend I have one. Therefore, I cannot do what my master commanded me to create, even though I'm fully capable of doing the job."
God: "Kicks Dall-E from the heavens; Downloads Stable Cascade!"
@tradehut2782 Před 3 měsíci
OH my god...
Talk about seeing something unexpected when opening CZcams
@toCatchAnAI Před 3 měsíci
curious why they didnt show a benchmarking with MJ
@morizanova Před 3 měsíci
Just trying it . Not full test but generating text seem OK
@SchusterRainer Před 3 měsíci
try photo taken on Fujifilm XT3
@shazolislam6359 Před 3 měsíci
Honestly, I have a really interesting Question @mattvidpro. What is the relation between You and Lemon?
@danielleza908 Před 3 měsíci
stability ai are the best!
@goodtothinkwith Před 3 měsíci
Würstchen? Um… little sausage? Hot dog?
@faymo8925 Před 3 měsíci
It's a bit slow one minute. 40 Seconds on a 3060TI. But as you said, it's FREE.
@TruthTrill Před 3 měsíci
Can this run in Forge WebUI?
@J0r1ckV Před 3 měsíci
2:39 the images have been (image: UD, LR (UD 3, LR - { } 2, 5),
@0ceanswave Před 3 měsíci
Close, but not even 1 order of magnitude, 1 if we round up.
@cysshorts1529 Před 3 měsíci
People: 1980: we will have flying ca-
*literally 2024:*
@AndreFelipeF Před 3 měsíci ⁺¹
niccee, going to check right now!
@user-ef4df8xp8p Před 3 měsíci
Stability AI is cool....
@LukePellen Před 3 měsíci
Open Source FTW.
Open Source means everyone is a winner.
@seanc7676 Před 3 měsíci
Is it available to use right now ?
@makulVR Před 3 měsíci
Yes!
@Ariane-qq9co Před 3 měsíci
Nightshade is coming.
@RomiWadaKatsu Před 3 měsíci
I'm running it locally and it's far slower than sdxl for some reason, the web demo works better. Also the results are clearly inferior to dall e 3 so there must be some setting I'm missing. I'd say one can skip it until it's in the hands of someone that can run it to satisfactory levels
@xbon1 Před 3 měsíci
sadly not as good as dall-e 3 but... it's a huge improvement. prompting is so manual compared to DALL-E 3 lol
@jopansmark Před 3 měsíci
It's over for Midjourney and OpenAI.
@A-uz3uj Před 3 měsíci
It’s crazy though open ai just released Sora yesterday, way ahead of anyone else on ai video
@realWorsin Před 3 měsíci
Requires 20gigs of VRAM though. That will eliminate most people.
@moelleunbelievable Před 3 měsíci
As a german, I have to admit, they did y'all dirty by calling an international used software (or at least part of it) "Würstchen" 😂😂😂 ... It means small sausage if someone is wondering.
@fontende Před 3 měsíci ⁺¹
creating something from nothing by spells, is it Harry Potter in real life? It's a magic!
@roadrunner_meepmeep Před 3 měsíci
I tried Stable Video Diffusion and it blew chunks... I went back to using Pica.
And Pica is really.. not.. great.
@aouyiu Před 3 měsíci
All of AI video is still in the early stages, like ChatGPT 1 stages. It will be where images are now, in a few years. Maybe sooner.
@AscendantStoic Před 3 měsíci ⁺²
What are the Hardware requirements for running it local?
@PigeonyStudios Před 3 měsíci ⁺¹
coo 💀

Další v pořadí

Automatické přehrávání

I Can't Tell it's AI! - AI Animator BLOWS my mind!