NEW WizardCoder 15b - The Best Open-Source Coding Model?

Matthew Berman

zhlédnutí 32 933

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 19. 06. 2023
In this video, we review WizardLM's WizardCoder, a new model specifically trained to be a coding assistant. It's completely open-source and can be installed locally. Let's test it out!
Enjoy :)
Join My Newsletter for Regular AI Updates 👇🏼
www.matthewberman.com
Need AI Consulting? ✅
forwardfuture.ai/
Rent a GPU (MassedCompute) 🚀
bit.ly/matthew-berman-youtube
USE CODE "MatthewBerman" for 50% discount
My Links 🔗
👉🏻 Subscribe: / @matthew_berman
👉🏻 Twitter: / matthewberman
👉🏻 Discord: / discord
👉🏻 Patreon: / matthewberman
Media/Sponsorship Inquiries 📈
bit.ly/44TC45V
Links:
TheBloke's Model - huggingface.co/TheBloke/Wizar...
WizardCoder HF - huggingface.co/WizardLM/Wizar...
TextGen WebUI - github.com/oobabooga/text-gen...
How to Install TextGen - • How To Install TextGen...
Věda a technologie

Komentáře • 71

@TJTHEFOOTBALLPROPHET Před rokem ⁺¹
It's channels like this that makes paying for CZcams premium worth! All of this amazing content without a commercial and you still get paid in the process. Mad love from New Orleans
@cassandrachristine Před rokem ⁺¹⁴
It will generate a snake game if you specify for it to use the pygame library. I noticed a lot of models default to using turtle and even chatGPT fails when trying to use turtle a lot of the time.
@erick2will Před rokem
As always: Outstanding! Thanks for sharing it!
@AncientSlugThrower Před rokem
Great information as usual!
@joannot6706 Před rokem ⁺¹⁴
It's impressive how much more performant it is at coding compared to a normal 15b model.
Imagine how incredible it would be if GPT 3.5 or GPT 4 (which are already great at coding) were finetuned like this.
If such very large models can be finetuned for coding and that there is actually that kind of relative jump in performance, I think that it would create enormous economic value even if it can't generalize.
@LeFinesseGod Před rokem
For real tho. When they make the gpt-4 code interpreter open for everyone for gpt plus users. It's gonna be massive.
@jarrod752 Před rokem
I'm excited to see what IBM has... A leak there would be amazing.
@npc-drew Před rokem ⁺²
Isn't that what Copilot does? GPT-4 Finetuned at coding.
@MoeShlomo Před rokem
"Imagine how incredible it would be if GPT 3.5 or GPT 4 (which are already great at coding) were finetuned like this."
They very well may be. For all we know, ChatGPT automatically switches between multiple fine-tuned models based on the type of query the user is asking. Seems like a no-brainer type of method to increase performance.
@npc-drew Před rokem
@@MoeShlomo GPT-4 is too slow for that to be true. However, that's a very economy approach. Model balancing lol the equivalent of Load Balancing but for AI models.
@marcfruchtman9473 Před rokem
Great Stuff. Thank you!
@SinanAkkoyun Před rokem ⁺¹³
Let's see Paul Allen's Coding LLM
@matthew_berman Před rokem ⁺¹
What?
@SinanAkkoyun Před rokem
@@matthew_berman It's just a meme lol
@LDFort Před rokem ⁺¹
@@SinanAkkoyun i got it dw
@marilynlucas5128 Před rokem ⁺¹
@@matthew_berman You should be highliighting that guanaco 33B is the best open source coding llm
@alexander191297 Před rokem ⁺²
Look at that subtle, off-white colouring…
@npc-drew Před rokem ⁺⁴
I would love to see this against Orca whenever it's released.
@GideonCrawley Před rokem ⁺¹
dude these tools are coming out faster than I can install them on my computer!!!
@nexxai Před rokem
hey brother - love ur videos; just curious which mic u use
@ZeroIQ2 Před rokem ⁺³
The great thing about local models is, you can work without an internet connection, which is really helpful in those situations where your provider is having issues.
@VioFax Před rokem ⁺¹
Sadly people don't seem to get how beneficial and POSSIBLE it is to have local models. Everyone's all about the "google colab." I just skip colab tuts now. It's just not what I'm looking for/what I want from AI...
Imagine not even NEEDING or WANTING internet... Doubt that is what the media companies want.
@kotykd6212 Před rokem ⁺²
@@VioFaxI don't have a good pc, so Google Colab is my only choice, not everyone has an RTX
@Greenmarty Před rokem ⁺¹
@@VioFax Thing is that you move the cost of buying and operating HW to the cloud computing provider.
So as much as i would loved to have personal private AI i can not come close to the performance/ possibilities i get from what huge multi billion companies can offer me for pennies on comparison.
It might change however with new AI GPU boom that is taking over nVidia and AMD as well. Hopefully we get to see some affordable AI HW to run private (offline) LLM.
@ZeroIQ2 Před rokem
@@Greenmarty I really wonder what GPUs will be like in the not so distant future, like imagine in 2026 we get 1TB GPU cards and then the price starts to drop for other cards. Like a 24GB GPU card might be really cheap then.
@Greenmarty Před rokem ⁺²
@@ZeroIQ2 For now these RTX 4090, RTX 3090 Ti, RTX 3090, or Titan RTX can do for 7B-13B models but they are still expensive . As I've seen multiple expensive > affordable HW over the years i don't see the reason why this would not happen. And there's always an aftermarket.
@agenticmark Před 8 měsíci
the im_sep is a command in the prompt. you can change the prompt to match the model in your chat app. i prefer lm studios myself
@Subcode Před rokem ⁺¹
The limit of 2048 is in the models, text-generation-webui has a config file you can edit but that wouldnt change that the model cant do it. So asking it to do a snake game is silly, it would never fit. It isnt a limitation in the model "intelligence" but of the architecture.
You could try to ask for code that is minimised. Meaning it doesnt use full names for values, doesnt use newlines etc.
@tiredlocke Před rokem
When will we get to play with that shiny new Orca model?
@pleabargain Před rokem
3:55 What LLM coding tool operates directly in VSCode(for example) so you DON'T have to "go back and forth" as you mention.
@adamrodriguez7598 Před rokem
Copilot?
@Healz4Life Před rokem ⁺¹
Looking for tools that work better than ChatGPT for Unity/C#
Would this be a good candidate?
@PrashantSaikia Před rokem ⁺³
Given a natural language description, it can generate code. But can it do the opposite - given a piece of code, can it describe in layman terms what that code does? What are the best LLMs for this code explaining task?
@ViktorFerenczi Před rokem ⁺¹
That's a very good questions which is definitely worth exploring with some tests.
@DarrenTarmey Před 10 měsíci
which software was you using to acess openb source models, im using chatgpt4all and i can see code models on hugginface but dont seem to be in correct format, it require .bin files
@riccardoesclapon549 Před rokem ⁺¹
any good models that actually execute python code and can self correct?
@ventor11111 Před rokem
Is it better then simple , clean GPT4 ?
@jjhw2941 Před rokem ⁺³
How about using it with GPT Engineer?
@user-jg4ci4mf8w Před rokem
Why?
@alessandrorossi1294 Před rokem
What's the difference between textgen web ui and oobabooga?
@marilynlucas5128 Před rokem
Is this better than guanaco 33b or Falcon?
@tiredlocke Před rokem ⁺³
What sort of consumer GPU is really needed to run a model like this? Do I need a 24GB beast, or can I run on something a little more practical for home/hobby use?
@rom100main Před rokem ⁺²
It's an AI with 15B parameters that uses a 4-bit storage value, so it can fit on a 12Gb GPU.
@tiredlocke Před rokem ⁺¹
@@rom100main Thanks that's helpful.
@hotrodhunk7389 Před rokem ⁺¹
I just installed it and it is giving me terrible answer. Like complete nonsense. I think i messed something up or it doesn't run properly on a GTX 1080.
@STDFme Před 10 měsíci
Is Wizard coder suited for C/C++, too?
@Saleh07BR Před rokem
I have RTX 2060 ( 6GB) and an I7-10700 with 32GB ram. how far can I run models on my pc
@itusvirus Před rokem
What is gptq by bloke?
@s.patrickmarino7289 Před rokem ⁺¹
Has anyone tried Tree of thought prompts with this?
@amankumarsingh2671 Před rokem ⁺²
People like me
Amd GPU and GGLM doesn't run on textgenWebui plus not running on GPT4ALL
Any chances amd GPU will be even supported in future or not 😢
@matthew_berman Před rokem ⁺²
Yea, I keep reading that AMD GPU isn't supported. Sorry :(
@fardoche6 Před rokem ⁺²
so gpt engineer is still better?
@stumbras2000 Před rokem
is it better that replit AI coder?
@neelanjanchakraborty5137 Před rokem
I wonder how does it perform at Competitive Programming
@ViktorFerenczi Před rokem
Test it out and release a blog post or video on the topic.
@SolidBuildersInc Před rokem
"Just a Little Bit Longer" in the Training, and ChatGPT 4 and 5 might be taking the Back Seat with this type of progress.
What a Time for Tech !!!
You must be exhausted from the ....
"Thrill of the Chase"
Just when you thought you caught something, you let it go to Chase after something even Better.
It's a Marathon at this point on who can bring the most Powerful tools to the Social Market and have the most up to date solution for the future. We gotta be getting close to the Grand Finale right ? 😅
@JustAThought01 Před rokem
What are the local hardware requirements?
@henrylawson430 Před rokem ⁺¹
15 billion parameter model at 4 bits per parameter works out to 6.98 GB of Vram (there is a formula - ask chatgpt)
@user-jg4ci4mf8w Před rokem ⁺³
If it isn't better than GPT 3.5 it's really not that good. An improvement yes, but what's the point?
@marilynlucas5128 Před rokem
I don't think it's better than guanaco 33B at coding.
@P-G-77 Před rokem
Unfortunately I don't have the right hardware for the situation... otherwise it would be my daily "pleasure" to test new IAs
@dik9091 Před rokem
TheBloke rocks, he is by orders of magnitude more important for AI progress than Ilya , who get's all the praise now, can't stand that because he is just a footnote in the grand scheme. Without Hinton, Google and M$ he is nothing, while the TheBloke needs no one.
@michealkinney6205 Před 2 měsíci
Sorry, but that `format_number` - final code test - is nowhere near a 7/10.. for one it's just a function, I'd give it a 2/10. The real problem I find LLM's having is compound logic or making a complete project (like snake). Nothing against you, I love your videos, and info! In my experience, using AI as a code assistant is frustrating and more pain then it's worth. With that said however, I do find it's very good at assisted code reviews, summarizing, planning (steps), static unit tests, and boilerplate stuff. It'll be awesome to see where it goes. Best!
@rv8804 Před rokem ⁺¹
if its not better than gpt3.5 then its garbage. Cause GPT3.5 is garbage for coding. GPT4 is very useful for coding tho its night and day compared to GPT3.5.
@emperorgalaxy4495 Před 7 měsíci
A more powerful computer is needed and training must be of better quality.
@Dr_Tripper Před rokem ⁺⁴
Get some sleep.
@matthew_berman Před rokem ⁺⁶
That's just how I look lol
@pfylim Před rokem ⁺¹
This is clickbait. You confirmed in your video GPT3.5 and above is much better. So watch it with your titles dude! You should also declare you were approached by Python Principles to inject a useless piece of information in this video. Not related at all. You are too sneaky!

Další v pořadí

Automatické přehrávání

API For Open-Source Models 🔥 Easily Build With ANY Open-Source LLM