When and Why to Fine Tune an LLM

Master Fine-Tuning Mistral AI Models with Official Mistral-FineTune Package

Beyond the Hype: A Realistic Look at Large Language Models • Jodie Burchell • GOTO 2024

35M Subscriber Moment Almost Here🎉❤️ Supported by Korean creators🇰🇷🤝🇯🇵

Making A Transforming Dress 😊

They got a Golden Buzzer 🤣✨

Best Practices For Fine Tuning Mistral

Hamel Husain

zhlédnutí 821

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 25. 07. 2024
Sophia Yang discusses best practices for fine-tuning Mistral models. We will cover topics like: (1) The permissive Mistral ToS and how it's perfect for fine tuning smaller models from bigger ones (2) How should people collect data (3) Domain specific evals (4) Use cases & examples (5) Common mistakes
This is a talk from Mastering LLMs: A survey course on applied topics for Large Language Models.
For more info and resources related to this talk, see:: parlance-labs.com/talks/fine_...
My personal site: hamel.dev/
My twitter: x.com/HamelHusain
Parlance Labs: parlance-labs.com/
00:00 Introduction
Sophia Yang introduces herself and provides an overview of the talk, which will cover Mistral models, their fine-tuning API, and demos.
0:35 Mistral's History and Model Offerings
Sophia discusses Mistral's history, from their founding to the release of various models, including open-source and enterprise-grade models, as well as specialized models like CodeStraw.
02:52 Customization and Fine-Tuning
Mistral recently released a fine-tuning codebase and API, allowing users to customize their models using LoRa fine-tuning. Sophia compares the performance of LoRa fine-tuning to full fine-tuning.
04:22 Prompting vs. Fine-Tuning
Sophia discusses the advantages and use cases for prompting and fine-tuning, emphasizing the importance of considering prompting before fine-tuning for specific tasks.
05:35 Fine-Tuning Demos
Sophia demonstrates how to use fine-tuned models shared by colleagues, as well as models fine-tuned on specific datasets like research paper abstracts and medical chatbots.
10:57 Developer Examples and Real-World Use Cases
Sophia showcases real-world examples of startups and developers using Mistral's fine-tuning API for various applications, such as information retrieval, medical domain, and legal co-pilots.
12:09 Using Mistral's Fine-Tuning API
Sophia walks through an end-to-end example of using Mistral's Fine-Tuning API on a custom dataset, including data preparation, uploading, creating fine-tuning jobs, and using the fine-tuned model.
19:10 Open-Source Fine-Tuning with Mistral
Sophia demonstrates how to fine-tune Mistral models using their open-source codebase, including installing dependencies, preparing data, and running the training process locally.
Jak na to + styl

Komentáře •

Další v pořadí

Automatické přehrávání

When and Why to Fine Tune an LLM

When and Why to Fine Tune an LLM

Master Fine-Tuning Mistral AI Models with Official Mistral-FineTune Package

Master Fine-Tuning Mistral AI Models with Official Mistral-FineTune Package

Beyond the Hype: A Realistic Look at Large Language Models • Jodie Burchell • GOTO 2024

Beyond the Hype: A Realistic Look at Large Language Models • Jodie Burchell • GOTO 2024

35M Subscriber Moment Almost Here🎉❤️ Supported by Korean creators🇰🇷🤝🇯🇵

35M Subscriber Moment Almost Here🎉❤️ Supported by Korean creators🇰🇷🤝🇯🇵

Making A Transforming Dress 😊

Making A Transforming Dress 😊

They got a Golden Buzzer 🤣✨

They got a Golden Buzzer 🤣✨

Survival Skills: Amazing Basket for Extreme Conditions. #survival #camping #bushcraft #lifehacks

Survival Skills: Amazing Basket for Extreme Conditions. #survival #camping #bushcraft #lifehacks

Fine-tuning Large Language Models (LLMs) | w/ Example Code

Fine-tuning Large Language Models (LLMs) | w/ Example Code

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

What are the LLM’s Top-P + Top-K ?

What are the LLM’s Top-P + Top-K ?

Beyond the Basics of Retrieval for Augmenting Generation (w/ Ben Clavié)

Beyond the Basics of Retrieval for Augmenting Generation (w/ Ben Clavié)

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

"How to give GPT my business knowledge?" - Knowledge embedding 101

"How to give GPT my business knowledge?" - Knowledge embedding 101

Deploying Fine-Tuned Models

Deploying Fine-Tuned Models

Steps By Step Tutorial To Fine Tune LLAMA 2 With Custom Dataset Using LoRA And QLoRA Techniques

Steps By Step Tutorial To Fine Tune LLAMA 2 With Custom Dataset Using LoRA And QLoRA Techniques

How to set up RAG - Retrieval Augmented Generation (demo)

How to set up RAG - Retrieval Augmented Generation (demo)

A little girl was shy at her first ballet lesson #shorts

A little girl was shy at her first ballet lesson #shorts

VIDEO z mého TÁBORA🤩 I ty se stále můžeš objednat na bridgeacademy.cz💪❤️ #shorts #vapko #vtipy

VIDEO z mého TÁBORA🤩 I ty se stále můžeš objednat na bridgeacademy.cz💪❤️ #shorts #vapko #vtipy

DEJTE ODBĚR POKUD MÁTE RÁDI BIGEHO!!!!

DEJTE ODBĚR POKUD MÁTE RÁDI BIGEHO!!!!

Výzvy jsou něco pro nás! 🌟😅 Ale úplně jsme nezazářili.. 👀 #klucizmycky #autoskola #namycce

Výzvy jsou něco pro nás! 🌟😅 Ale úplně jsme nezazářili.. 👀 #klucizmycky #autoskola #namycce

Vyzkoušíš? Pro další Beauty tipy dej 🔔#shorts #skincareshorts

Vyzkoušíš? Pro další Beauty tipy dej 🔔#shorts #skincareshorts

The Monty Hall Problem 😨 (explained)

The Monty Hall Problem 😨 (explained)

Handstand train challenge!!

Handstand train challenge!!

Bylo to náročnější, než jsem myslela 😅🌊

Bylo to náročnější, než jsem myslela 😅🌊