I Tried Every AI Coding Assistant

How to Improve your LLM? Find the Best & Cheapest Solution

Multi GPU Fine tuning with DDP and FSDP

WHY THROW CHIPS IN THE TRASH?🤪

フォーメーション付きで I WANNA BE YOUR SLAVE 踊ってみた♪ #shorts

You can now keep your hands clean, and your toilet cleaner...🚽 #toilet #cooltech #future

Deepseek Coder vs CodeLlama vs Claude vs OpenAI

Trelis Research

zhlédnutí 5 192

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 12. 06. 2024
Trelis.com - for inference, fine-tuning and function-calling scripts.
* Deepseek Inference *
One-click template: runpod.io/gsc?template=51tpe9...
TheBloke AWQ: huggingface.co/TheBloke/deeps...
* Deepseek 1.3B, 6.7B and 33B function-calling models *
huggingface.co/Trelis/deepsee...
* Inference Guide *
trelis.com/enterprise-server-...
* Fine-tuning Scripts *
trelis.com/advanced-fine-tuni...
Full Repo Includes:
- LLM Comparison Notebook from this video
- Supervised fine-tuning
- Unsupervised fine-tuning
- Quantization scripts
- Function calling / Structured response fine-tuning
- Embeddings notebook
OR buy only the LLM Comparison Notebook here: buy.stripe.com/5kAcNy8G12Hxg7...
Chapters:
0:00 Deepseek coder
0:24 Agenda
1:03 Model sizes and license
2:02 Prompt format
3:36 Inference on Runpod
5:05 Performance vs CodeLlama, OpenAI, Claude
7:04 Returning a sequence in reverse
10:40 Passkey retrieval
16:23 Website generation
24:20 Function calling
25:22 Resources
Věda a technologie

Komentáře • 12

@enzocalzone5298 Před 4 měsíci
Awesome! Thanks for the template
@vishalgoklani Před 7 měsíci ⁺²
I enjoyed the video, thanks for sharing. I am curious, do you think the NF4 quantization tripped up some of your results. How are the results different when using AWQ? What running the full model using bfloat16
@TrelisResearch Před 7 měsíci ⁺¹
NF4 probably has some effect, although I used it for both models so perhaps the relative effect is similar and the comparison is ok.
AWQ typically performance a bit worse than NF4, but better than GPTQ. I think I discuss that a little in the awq video.
Running bfloat16 is def best, you can check out this paper for relative performance: arxiv.org/pdf/2305.14314.pdf
@romanweilguny3415 Před 7 měsíci ⁺¹
interesting insights - thk you" - atm it seems that those big models are still very weak at certain tasks that seem to be rather simple. I tried getting out some info from table like info in the prompt from gpt-4 and it failed mostly even if the prompt is not long. this diappointed me hard as I really like the performance of gtp4 in many other tasks
@othmanaljbory3649 Před 7 měsíci ⁺¹
Can you help me solve the two level programming model with constraints
@MW-ez1mw Před 3 měsíci
Hi @Trelis Research, thank you for the great video, will you consider to make a video to discuss how to fine tune copilot style coding llms? Thanks
@TrelisResearch Před 3 měsíci
Could you give more of an example of a specific fine tune that would be helpful
@DreamingConcepts Před 6 měsíci ⁺¹
9:54 here, gpt-3.5 actually failed, it skipped the "n"
@TrelisResearch Před 6 měsíci
oops, you're right. My brain is clearly a bad tool for grading

Další v pořadí

Automatické přehrávání

I Tried Every AI Coding Assistant

I Tried Every AI Coding Assistant

How to Improve your LLM? Find the Best & Cheapest Solution

How to Improve your LLM? Find the Best & Cheapest Solution

Multi GPU Fine tuning with DDP and FSDP

Multi GPU Fine tuning with DDP and FSDP

WHY THROW CHIPS IN THE TRASH?🤪

WHY THROW CHIPS IN THE TRASH?🤪

フォーメーション付きで I WANNA BE YOUR SLAVE 踊ってみた♪ #shorts

フォーメーション付きで I WANNA BE YOUR SLAVE 踊ってみた♪ #shorts

You can now keep your hands clean, and your toilet cleaner...🚽 #toilet #cooltech #future

You can now keep your hands clean, and your toilet cleaner...🚽 #toilet #cooltech #future

Massive jump at the end...

Massive jump at the end...

Is CODE LLAMA Really Better Than GPT4 For Coding?!

Is CODE LLAMA Really Better Than GPT4 For Coding?!

Fine Tuning Qwen 2 with Custom Data

Fine Tuning Qwen 2 with Custom Data

Top Ten Fine Tuning Tips

Top Ten Fine Tuning Tips

Новая бесплатная топовая нейросеть для написания кода | DeepSeek Coder

Новая бесплатная топовая нейросеть для написания кода | DeepSeek Coder

Ollama - Local Models on your machine

Ollama - Local Models on your machine

Function Calling in Ollama vs OpenAI

Function Calling in Ollama vs OpenAI

Writing Better Code with Ollama

Writing Better Code with Ollama

All You Need To Know About Running LLMs Locally

All You Need To Know About Running LLMs Locally

Should You Use Open Source Large Language Models?

Should You Use Open Source Large Language Models?

Why I'm buying a $500 phone instead of a $1000 one

Why I'm buying a $500 phone instead of a $1000 one

Apple má vlastní umělou inteligenci! 🤯 | Vše co potřebuješ vědět z WWDC 2024

Apple má vlastní umělou inteligenci! 🤯 | Vše co potřebuješ vědět z WWDC 2024

Apple WWDC: iOS 18 updates iPhone home screen customization options

Apple WWDC: iOS 18 updates iPhone home screen customization options

Prototypy herních myší vyrobené na 3D tiskárně

Prototypy herních myší vyrobené na 3D tiskárně

Samsung galaxy S24ultra titanium green 💚, Oppo find N flip 3 Display quality 😱🤯 Digital #shorts

Samsung galaxy S24ultra titanium green 💚, Oppo find N flip 3 Display quality 😱🤯 Digital #shorts

This could change water cooling forever... - Fabric8 Labs @ Computex 2024

This could change water cooling forever... - Fabric8 Labs @ Computex 2024

No waterproof, no Samsung, Samsung S24UItra gives you a different experience Samsung #shorts

No waterproof, no Samsung, Samsung S24UItra gives you a different experience Samsung #shorts

Logitech, wake up.

Logitech, wake up.