Napkin Math For Fine Tuning w/Johno Whitaker

Official PyTorch Documentary: Powering the AI Revolution

Geoffrey Hinton | On working with Ilya, choosing problems, and the power of intuition

I lift so my son can win this argument🫡

Sock Check 🧦

TeamBrichta na Clash of the Stars 8 | feat Jakub Wikłacz & Losene Keita

Why Fine Tuning is Dead w/Emmanuel Ameisen

Hamel Husain

zhlédnutí 3 300

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 30. 06. 2024
Arguments for why fine-tuning has become less useful over time, as well as some opinions as to where the field is going with Emmanuel Ameisen.
This is a talk from Mastering LLMs: A survey course on applied topics for Large Language Models.
More resources are available here:
bit.ly/applied-llms
00:00: Introduction and Background
01:23: Disclaimers and Opinions
01:53: Main Themes: Trends, Performance, and Difficulty
02:53: Trends in Machine Learning
03:16: Evolution of Machine Learning Practices
06:03: The Rise of Large Language Models (LLMs)
08:18: Embedding Models and Fine-Tuning
11:17: Benchmarking Prompts vs. Fine-Tuning
12:23: Fine-Tuning vs. RAG: A Comparative Analysis
25:03: Adding Knowledge to Models
33:14: Moving Targets: The Challenge of Fine-Tuning
38:10: Essential ML Practices: Data and Engineering
44:43: Trends in Model Prices and Context Sizes
47:22: Future Prospects of Fine-Tuning
Jak na to + styl

Komentáře • 6

@alaad1009 Před 3 dny
Excellent conversation!!!
@darkmatter9583 Před dnem
RAG,quantize data? favorite LLM? HELP
@mrwhitecc Před 3 dny ⁺¹
I do not think he understand what happened to the model after fine tuning. Just give one example here, if you have a unique reasoning pattern that there is no chance public pretraining dataset can contain the correlated data , then the SFT is the only way that you can let the model simulate the "reasoning" ability that you want the model to behave , prompt engineering do not help at all , RAG either.
@agenticmark Před 3 dny ⁺¹
you fine tune for BEHAVIOR, you use RAG for DATA.
it fine tuning is how the model interacts with the user, rag is how the model gets factual information. that does not equal prompt engineering....
@agenticmark Před 3 dny
strange, prompt engineering over fine tuning? if you dont want control-ability sure... prompt engineering will disappear. fine tuning will not.
i train voice and chat models (fine tuning) and I have trained dozens of agent foundational models that play nintendo and atari games and a bunch of classifiers. training from scratch (foundational, pretraining) is very very costly. fine tuning is not.
@agenticmark Před 3 dny
_very_ unscientific claim about the lines on that chart. try trading stocks with that mentality of guessing it will just keep going up!

Další v pořadí

Automatické přehrávání

Napkin Math For Fine Tuning w/Johno Whitaker

Napkin Math For Fine Tuning w/Johno Whitaker

Official PyTorch Documentary: Powering the AI Revolution

Official PyTorch Documentary: Powering the AI Revolution

Geoffrey Hinton | On working with Ilya, choosing problems, and the power of intuition

Geoffrey Hinton | On working with Ilya, choosing problems, and the power of intuition

I lift so my son can win this argument🫡

I lift so my son can win this argument🫡

TeamBrichta na Clash of the Stars 8 | feat Jakub Wikłacz & Losene Keita

TeamBrichta na Clash of the Stars 8 | feat Jakub Wikłacz & Losene Keita

The clown broke the wings of the white angel and gave the wings to Harley Quinn!#cosplay

The clown broke the wings of the white angel and gave the wings to Harley Quinn!#cosplay

Solving Chollet's ARC-AGI with GPT4o

Solving Chollet's ARC-AGI with GPT4o

Stuart Russell, "AI: What If We Succeed?" April 25, 2024

Stuart Russell, "AI: What If We Succeed?" April 25, 2024

The Man Who Solved the World’s Hardest Math Problem

The Man Who Solved the World’s Hardest Math Problem

Fine-tuning Large Language Models (LLMs) | w/ Example Code

Fine-tuning Large Language Models (LLMs) | w/ Example Code

Yann Lecun | Objective-Driven AI: Towards AI systems that can learn, remember, reason, and plan

Yann Lecun | Objective-Driven AI: Towards AI systems that can learn, remember, reason, and plan

MIT Introduction to Deep Learning | 6.S191

MIT Introduction to Deep Learning | 6.S191

Back to Basics for RAG w/ Jo Bergum

Back to Basics for RAG w/ Jo Bergum

Andrej Karpathy's Keynote & Winner Pitches at UC Berkeley AI Hackathon 2024 Awards Ceremony

Andrej Karpathy's Keynote & Winner Pitches at UC Berkeley AI Hackathon 2024 Awards Ceremony

The Turing Lectures: The future of generative AI

The Turing Lectures: The future of generative AI

You Can CHANGE Your Tesla Car’s Color?! #asher #shorts

You Can CHANGE Your Tesla Car’s Color?! #asher #shorts

POR QUEEE DIVERTIDAMENTE !!! #shorts

POR QUEEE DIVERTIDAMENTE !!! #shorts

Cibulove placky special 1.9 TDI 🍺 #ostravskygastrošef #heřmangazda

Cibulove placky special 1.9 TDI 🍺 #ostravskygastrošef #heřmangazda

This guy shifts better than automatic cars

This guy shifts better than automatic cars

Na koncertě v Praze jsme se s vámi rozdělily o Kubíky Waterrr Cool😍 V tom vedru dobrý nápad, ne?😁

Na koncertě v Praze jsme se s vámi rozdělily o Kubíky Waterrr Cool😍 V tom vedru dobrý nápad, ne?😁

Shooting With A Bent Barrel? 🤔

Shooting With A Bent Barrel? 🤔

BONES as NAILS??😱🔥⚠️Summerween Nail Art💅🏼 #nailart #nails #3dnails

BONES as NAILS??😱🔥⚠️Summerween Nail Art💅🏼 #nailart #nails #3dnails

Unboxing MEGATRON 40th Anniversary Limited Edition Robot by Robosen

Unboxing MEGATRON 40th Anniversary Limited Edition Robot by Robosen