HuggingFace Fundamentals with LLM's such as TInyLlama and Mistral 7B

How AI 'Understands' Images (CLIP) - Computerphile

why llama-3-8B is 8 billion parameters instead of 7?

Harley Quinn's revenge plan！！！#Harley Quinn #joker

Can This Bubble Save My Life? 😱

NEJRYCHLEJŠÍ Střela v Historii FOTBALU…

Inside the LLM: Visualizing the Embeddings Layer of Mistral-7B and Gemma-2B

Chris Hay

zhlédnutí 5 900

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 22. 08. 2024
We look deep into the AI and look at how the embeddings layer of a Large Language Model such as Mistral-7B and Gemma-2B actually works.
You will learn how tokens and embeddings work and even extract out and load the embeddings layer from Gemma and Mistral into your own simple model, which we will use to visualize the model
You will see how an AI clusters terms together and how it can cluster similar words, build connections which cover not just similar words but also grouping of concepts such as colors, hotel chains, programming terms.
If you really want to understand how an LLM's works or even build your own LLM then starting with the first layer of a Generative AI model is the best place to start.
Github
-----------
github.com/chr...

Komentáře • 33

@chrishayuk Před 5 měsíci ⁺²
this is the github repo: github.com/chrishayuk/embeddings
@guaranamedia Před 2 měsíci ⁺¹
Excellent explanation. Thanks for making these examples.
@chrishayuk Před 2 měsíci
You're very welcome!
@sumandawnmobile Před 5 měsíci ⁺³
Its an great video to understand the internals via the visualization. Thanks Chris.
@rajneesh31 Před 2 měsíci ⁺¹
Damn, thank you CZcams for recommending this channel. @chrishayuk is a gun. Thanks Chris
@chrishayuk Před 2 měsíci
Very kind, glad you like the channel
@NERDDISCO Před 5 měsíci ⁺⁴
This came to the absolute right time! Thank you very much! I was just trying to understand this. Now I know how it works ❤
@chrishayuk Před 5 měsíci ⁺¹
Glad it was helpful!
@scitechtalktv9742 Před 5 měsíci ⁺³
Fantastic video !
I am wondering: I think it would also be very interesting to also be able have a visualization of not only the static embeddings you already did, but also a visualization of the so-called contextualized embeddings in a later layer of the model! These are the embeddings that are exposed to the attention mechanism. That why they are also called dynamic embeddings.
It adds another layer of abstraction, but are better embeddings because they are able to distinguish between homonyms: words that are the same but have completely other meanings if used in another context. A good example is the word “bank”, that has several different meanings when used in another context (for example financial institution or river bank and several other meanings! ). As a consequence the word “bank” will be represented by several different vectors in embedding space, depending on the context it is used in!
This technique is called Word Sense Disambiguation (WSD).
Would it be possible to visualize that too? I am curious….
@chrishayuk Před 5 měsíci ⁺¹
yep, you got what i'm doing... i'm literally walking the stack
@chrishayuk Před 5 měsíci ⁺²
so those videos will be coming
@scitechtalktv9742 Před 5 měsíci ⁺¹
@@chrishayukFantastic ! Those embeddings are crucially important for the workings of Large Language Models !
@johntdavies Před 5 měsíci ⁺²
Great insight, thanks for posting this. It would be interesting to show how a fine-tuned model differs in similarities and "vocabulary". I'm also curious on the effects of quantisation, i.e. Q4, Q6, Q8, fp16 etc. on the internal "workings" of the LLM. Thanks again.
@chrishayuk Před 5 měsíci ⁺¹
It’s almost like you’re reading my roadmap
@andypai Před 5 měsíci ⁺¹
Thank you! Great video!
@chrishayuk Před 3 měsíci
thank you, glad it was useful
@khalilbenzineb Před 5 měsíci ⁺²
I was playing a bit with finetuning to force an output schema for some 7B Models, but lately I discovered schema grammar, which is a way to dynamically play with the EOS tokens, by limiting them to a specific set of tokens, to generate the output you want, This is very stable and way efficient for many cases that we may think it requires finetuning, For me it felt like a new dimension to get the model intentions inline, I loved the unique and efficient way you create your videos, So I wanted to ask you if possible to create a video for us about this, I feel it's very important
@chrishayuk Před 5 měsíci ⁺²
that's a good shout
@khalilbenzineb Před 5 měsíci
Thx@@chrishayuk
@kenchang3456 Před 5 měsíci ⁺¹
Thanks the visualization really helped me.
@chrishayuk Před 5 měsíci ⁺¹
so glad, seeing it at a lower level really demystifies what's going on
@Memes_uploader Před 5 měsíci ⁺¹
Thank you so much! Thank you youtube algorithm for showing such a great video!
@chrishayuk Před 5 měsíci
Glad you enjoyed it!
@gregherringer7700 Před 5 měsíci ⁺¹
This helps thanks!
@chrishayuk Před 5 měsíci
Glad it helped! :)
@lfzuniga31 Před 5 měsíci ⁺¹
based
@enlightenment5d Před 4 měsíci ⁺¹
Good! Where can I find your programs?
@chrishayuk Před 3 měsíci
in my github repo github.com/chrishayuk

Další v pořadí

Automatické přehrávání

HuggingFace Fundamentals with LLM's such as TInyLlama and Mistral 7B

HuggingFace Fundamentals with LLM's such as TInyLlama and Mistral 7B

How AI 'Understands' Images (CLIP) - Computerphile

How AI 'Understands' Images (CLIP) - Computerphile

why llama-3-8B is 8 billion parameters instead of 7?

why llama-3-8B is 8 billion parameters instead of 7?

Harley Quinn's revenge plan！！！#Harley Quinn #joker

Harley Quinn's revenge plan！！！#Harley Quinn #joker

Can This Bubble Save My Life? 😱

Can This Bubble Save My Life? 😱

NEJRYCHLEJŠÍ Střela v Historii FOTBALU…

NEJRYCHLEJŠÍ Střela v Historii FOTBALU…

Fine-Tune Llama3 using Synthetic Data

Fine-Tune Llama3 using Synthetic Data

Back & Forth - Why choose Forth?

Back & Forth - Why choose Forth?

Ollama 0.1.26 Makes Embedding 100x Better

Ollama 0.1.26 Makes Embedding 100x Better

A Hackers' Guide to Language Models

A Hackers' Guide to Language Models

Model Distillation: Same LLM Power but 3240x Smaller

Model Distillation: Same LLM Power but 3240x Smaller

How the Gemma/Gemini Tokenizer Works - Gemma/Gemini vs GPT-4 vs Mistral

How the Gemma/Gemini Tokenizer Works - Gemma/Gemini vs GPT-4 vs Mistral

I wish every AI Engineer could watch this.

I wish every AI Engineer could watch this.

Getting Started with ReAct AI agents work using langchain

Getting Started with ReAct AI agents work using langchain

Beyond the Hype: A Realistic Look at Large Language Models • Jodie Burchell • GOTO 2024

Beyond the Hype: A Realistic Look at Large Language Models • Jodie Burchell • GOTO 2024

Díl který byl OSOBNÍ🔥 JIŽ online na HEROHERO🔥

Díl který byl OSOBNÍ🔥 JIŽ online na HEROHERO🔥

Vybíráme outfit na koncert😅 Berete 💛 nebo 🩷? #justforfun

Vybíráme outfit na koncert😅 Berete 💛 nebo 🩷? #justforfun

女孩妒忌小丑女？ #小丑#shorts

女孩妒忌小丑女？ #小丑#shorts

【斗罗大陆】坏人居然敢欺负唐舞桐？斗罗家族可不好惹哟！#斗罗大陆#唐舞桐#唐三#小舞

【斗罗大陆】坏人居然敢欺负唐舞桐？斗罗家族可不好惹哟！#斗罗大陆#唐舞桐#唐三#小舞

Gli occhiali da sole non mi hanno coperto! 😎

Gli occhiali da sole non mi hanno coperto! 😎

Symmetrical face⁉️🤔 #beauty

Symmetrical face⁉️🤔 #beauty

Send this to an artist to make them… 🫢✨🎨 #artistomg

Send this to an artist to make them… 🫢✨🎨 #artistomg

Replacing a valve on a full water tank! 🫣💦 - 🎥 @the_ladyplumber

Replacing a valve on a full water tank! 🫣💦 - 🎥 @the_ladyplumber