Coding Llama-2 from scratch in PyTorch - Part 2

The New ‘AI Windows’ Will Change How We Use Computers Forever

Hadoop Bug Reproduction Part 2: Code [HDFS 14859: Performance Degradation Due to Safe Mode]

I Did This And She Was AMAZED! 😱 #shorts

NIKDY JSEM!😂 | Morry&@BoTmAnGOD&@Klarush&@Mode100&@lipomeister&Kíba&Joska

Glow Stick Secret (part 2) 😱 #shorts

Coding Llama 3 from scratch in PyTorch - Part 1

Prince Canuma

zhlédnutí 1 693

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 5. 05. 2024
In this video series, you will learn how to train and fine-tune Llama 3 model from scratch.
The goal is to code LLaMA 3 from scratch in PyTorch to create models with sizes 3B, 6B, 35B and 45BM params. In this first video, you'll learn about upcycling, downcycling and infini-attention.
📚Papers:
- Sparse Upcycling Training Mixture-of-Experts from Dense Checkpoints
: arxiv.org/abs/2212.05055
- Pre-training Small Base LMs with Fewer Tokens: arxiv.org/abs/2404.08634
Leave No Context Behind Efficient Infinite Context Transformers with Infini-attention: arxiv.org/abs/2404.07143
💻 To follow along you can use this colab notebook:
- github.com/Blaizzy/Coding-LLM...
🎥 Coding Llama 2 from scratch video series
Part 1: czcams.com/users/liveXHmag4damTg
Part 2: czcams.com/users/liveLSWDpFmbE90
Part 3: • Coding Llama 2 from sc...
Věda a technologie

Komentáře • 8

@AC-go1tp Před 16 dny ⁺³
This is very thoughtful and great initiative! researchers with enough gray matter but limited means can be still in the game . Thank you PC🙏!
@princecanuma Před 15 dny
Most welcome!
It’s my pleasure:)
I lived through this so others don’t have to.
@ngamcode2485 Před 5 dny
this is very impressive and great content. thank you
@princecanuma Před 6 hodinami
You're very welcome!
@kishoretvk Před 15 dny
Super impressive. Great value
One question
How do I further train the model on my custom content
Instead of LORA ?
Can we further full training it and add new memory
@princecanuma Před 9 dny
Most welcome!
You can do that, but that can be very expensive.
@vivekpadman5248 Před 3 dny
Bro how did you train llama 3 without paper?
@princecanuma Před 6 hodinami
Could you elaborate?

Další v pořadí

Automatické přehrávání

Coding Llama-2 from scratch in PyTorch - Part 2

Coding Llama-2 from scratch in PyTorch - Part 2

The New ‘AI Windows’ Will Change How We Use Computers Forever

The New ‘AI Windows’ Will Change How We Use Computers Forever

Hadoop Bug Reproduction Part 2: Code [HDFS 14859: Performance Degradation Due to Safe Mode]

Hadoop Bug Reproduction Part 2: Code [HDFS 14859: Performance Degradation Due to Safe Mode]

I Did This And She Was AMAZED! 😱 #shorts

I Did This And She Was AMAZED! 😱 #shorts

NIKDY JSEM!😂 | Morry&@BoTmAnGOD&@Klarush&@Mode100&@lipomeister&Kíba&Joska

NIKDY JSEM!😂 | Morry&@BoTmAnGOD&@Klarush&@Mode100&@lipomeister&Kíba&Joska

Glow Stick Secret (part 2) 😱 #shorts

Glow Stick Secret (part 2) 😱 #shorts

Trágico final :(

Trágico final :(

"Master Polynomial Regression: Beginner's Guide and Hands-On Tutorial"

"Master Polynomial Regression: Beginner's Guide and Hands-On Tutorial"

Graph Attention in DGL

Graph Attention in DGL

Coding Llama 2 from scratch in PyTorch - Part 3

Coding Llama 2 from scratch in PyTorch - Part 3

Llama3 + CrewAI + Groq = Email AI Agent

Llama3 + CrewAI + Groq = Email AI Agent

Replace Github Copilot with a Local LLM

Replace Github Copilot with a Local LLM

How Smart Will AI Be In 2050?

How Smart Will AI Be In 2050?

How is THIS Coding Assistant FREE?

How is THIS Coding Assistant FREE?

Coding a Transformer from scratch on PyTorch, with full explanation, training and inference.

Coding a Transformer from scratch on PyTorch, with full explanation, training and inference.

PyTorch in 100 Seconds

PyTorch in 100 Seconds

“Thinnest iPad Ever” - WHAT COULD POSSIBLY GO WRONG?

“Thinnest iPad Ever” – WHAT COULD POSSIBLY GO WRONG?

Discovering the ACTUAL windows admin

Discovering the ACTUAL windows admin

What’s your charging level??

What’s your charging level??

Apple & Android Cross Platform Tracker

Apple & Android Cross Platform Tracker

Apple watch hidden camera

Apple watch hidden camera

The power button requires a key!!

The power button requires a key!!