Best Practices For Fine Tuning Mistral

Accelerated Training by Amplifying Slow Gradients

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

Sock Check 🧦

KONEC ADÉLY, KUBĚNKA VS DENNY, STREAMER PÍŠE DĚTEM A MNOHEM VÍC!

Worlds smallest 4K headset 😎 Visor.com #tech #vr #technology #virtualreality #insideout2

Napkin Math For Fine Tuning w/Johno Whitaker

Hamel Husain

zhlédnutí 783

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 9. 07. 2024
We will show you how to build intuition around training performance with a focus on GPU-poor fine tuning.
This is a talk from Mastering LLMs: A survey course on applied topics for Large Language Models.
More resources available here:
parlance-labs.com/education/f...
00:00 Introduction
Johno introduces the topic "Napkin Math for Fine Tuning," aiming to answer common questions related to model training, especially for beginners in fine-tuning large existing models.
01:23 About Johno and AnswerAI
Johno shares his background and his work at AnswerAI, an applied R&D lab focusing on the societal benefits of AI.
03:18 Plan for the Talk
Johno outlines the structure of the talk, including objectives, running experiments, and live napkin math to estimate memory use.
04:40 Training and Fine-Tuning Loop
Description of the training loop: feeding data through a model, measuring accuracy, updating the model, and repeating the process.
09:05 Hardware Considerations
Discussion on the different hardware components (CPU, GPU, RAM) and how they affect training performance.
12:28 Tricks for Efficient Training
Overview of various techniques to optimize training efficiency, including LoRa, quantization, and CPU offloading.
13:12 Full Fine-Tuning
Describes the parameters and memory involved with full fine-tuning
18:14 LoRA
Detailed explanation of full fine-tuning versus parameter-efficient fine-tuning techniques like LoRa.
21:04 Quantization and Memory Savings
Discussion on quantization methods to reduce memory usage and enable training of larger models.
23:10 Combining Techniques
Combining different techniques like quantization and LoRa to maximize training efficiency.
22:55 Running Experiments
Importance of running controlled experiments to understand the impact of various training parameters.
25:46 CPU Offloading
How CPU offloading works and the tradeoffs.
28:31 Real-World Example
Demo of memory optimization and problem-solving during model training, with code. This also includes pragmatic ways to profile your code.
45:44 Case Study: QLoRA + FSDP
Discussion of QLorA with FSDP, along with a discussion of tradeoffs.
54:25 Recap / Conclusion
Johno summarizes the key points of his talk.
Jak na to + styl

Komentáře • 2

@dahiruibrahimdahiru2690 Před 6 dny
Johno is someone you always want to listen to, there's so much in that brain you would want to pick.
@anne-marieroy8812 Před 7 dny
Thank you so much for this presentation and ways to tweak the model on GPUs very instructive.

Další v pořadí

Automatické přehrávání

Best Practices For Fine Tuning Mistral

Best Practices For Fine Tuning Mistral

Accelerated Training by Amplifying Slow Gradients

Accelerated Training by Amplifying Slow Gradients

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

KONEC ADÉLY, KUBĚNKA VS DENNY, STREAMER PÍŠE DĚTEM A MNOHEM VÍC!

KONEC ADÉLY, KUBĚNKA VS DENNY, STREAMER PÍŠE DĚTEM A MNOHEM VÍC!

Worlds smallest 4K headset 😎 Visor.com #tech #vr #technology #virtualreality #insideout2

Worlds smallest 4K headset 😎 Visor.com #tech #vr #technology #virtualreality #insideout2

Never waste PASTA SAUCE @itsQCP

Never waste PASTA SAUCE @itsQCP

Back to Basics for RAG w/ Jo Bergum

Back to Basics for RAG w/ Jo Bergum

Hot Topics in Computing Prof. Michael Bronstein

Hot Topics in Computing Prof. Michael Bronstein

100+ Linux Things you Need to Know

100+ Linux Things you Need to Know

BERT explained: Training, Inference, BERT vs GPT/LLamA, Fine tuning, [CLS] token

BERT explained: Training, Inference, BERT vs GPT/LLamA, Fine tuning, [CLS] token

Beyond the Basics of Retrieval for Augmenting Generation (w/ Ben Clavié)

Beyond the Basics of Retrieval for Augmenting Generation (w/ Ben Clavié)

Why Fine Tuning is Dead w/Emmanuel Ameisen

Why Fine Tuning is Dead w/Emmanuel Ameisen

Fine tuning LLMs for Memorization

Fine tuning LLMs for Memorization

LoRA & QLoRA Fine-tuning Explained In-Depth

LoRA & QLoRA Fine-tuning Explained In-Depth

Daniel Vávra: Proč ODS NENÁVIDÍ Roberta Šlachtu! #shorts #nejlepsipodcast

Daniel Vávra: Proč ODS NENÁVIDÍ Roberta Šlachtu! #shorts #nejlepsipodcast

Worlds Most COLORFUL Shoes!🌈🌈

Worlds Most COLORFUL Shoes!🌈🌈

El pepe challenge | Doğrusunu yapman imkansız | çok fazla kaçırdık | çok zor challenge | sürpriz son

El pepe challenge | Doğrusunu yapman imkansız | çok fazla kaçırdık | çok zor challenge | sürpriz son

Cibulove placky special 1.9 TDI 🍺 #ostravskygastrošef #heřmangazda

Cibulove placky special 1.9 TDI 🍺 #ostravskygastrošef #heřmangazda

Probudil ho komár 💀

Probudil ho komár 💀

Kdo si mě vezme? 🐶 #vtip #funny #czechdog #dogvlog #bakiamy #cz #sk #fyp #viral #aussie #pes #dog

Kdo si mě vezme? 🐶 #vtip #funny #czechdog #dogvlog #bakiamy #cz #sk #fyp #viral #aussie #pes #dog

You Can CHANGE Your Tesla Car’s Color?! #asher #shorts

You Can CHANGE Your Tesla Car’s Color?! #asher #shorts

Tak kdo dal líp?🤔 Všichni stejně?😅🤣 Čím je mám trumfnout?? 🤷🤷

Tak kdo dal líp?🤔 Všichni stejně?😅🤣 Čím je mám trumfnout?? 🤷🤷