ChatGPT and Reinforcement Learning

ChatGPT: 30 Year History | How AI Learned to Talk

This is why Deep Learning is really weird.

Gender reveal 🤰🩵 #hannahstocking #shorts

Incredible Dog Rescues Kittens from Bus - Inspiring Story #shorts

How Hard Is To Slice With The World's Smallest Sword?

ChatGPT: Zero to Hero

CodeEmporium

zhlédnutí 4 260

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 28. 08. 2024

Komentáře • 15

@Rm-no6jr Před 10 měsíci
Your channel deserves more. Thanks a lot
@amiralioghli8622 Před 10 měsíci
Thank you, sir, for sharing valuable information through your CZcams channel. Once again, I have a request: please create a series on how to apply Transformers to time series tasks such as anomaly detection, forecasting, or classification. Working on just one of these tasks would be sufficient for us. I have followed numerous articles, short notes, and videos regarding the application of Transformers to time series data, but it is still not clear to me. I am a beginner on this Transformer journey, and there are no useful videos available on CZcams overall.
@user-zt2vq8ne1l Před 6 měsíci
Thank you for sharing this valuable Content, Great Channel
@chrisogonas Před 11 měsíci
Great picture of how the GPTs work and what they are. Awesome 👍
@victle Před 10 měsíci
Mind-blowing that all of this good stuff is free. Great video!
@CodeEmporium Před 10 měsíci ⁺²
Can’t beat that price :)
@user-wc7em8kf9d Před 10 měsíci
Thks mate. I love the summary around 2:30!
@khoshsirat Před 4 měsíci
Great video, but I think the "Likert scale" part around 31:00 is not correct. They scale the ratings for each rater separately, and they use those questions to detect sensitive topics and filter them.
@barni_7762 Před 11 měsíci
Watching this while finetuning llama 2 :D
I think this may be the first gpt tutorial featuring RLHF
@DaTruAndi Před 11 měsíci
Awesome video. A few comments.
Is the architecture for the rewards model OpenAI used actually publicly documented? Can we be sure it is a GPT model? (I believe you mention it).
I would love a contrastive deeper look at BERT vs GPT following what you mention. You mention it when you talk about stacking them up but it could make sense to talk about the what’s and whys a bit more.
At 43:41 you say that the table is generated anew as being the reason for different generations even for the same prompt, but wouldn’t the table be the same, just the sampling strategy would result in the table being processed resulting in a different token?
Overall:
Maybe as a video title “making of the tasty ChatGPT sausage” would have been better :)
The title may set wrong expectations for folks who casually discover your great content.
From the title alone many people may expect to become heroes in using ChatGPT.
@r.alexander9075 Před 6 měsíci
I have a question:
How does the GPT architecture provide outputs without an encoder? Doesnt the cross attention module need KV pairs from the encoder?
@sharangkulkarni1759 Před 4 dny
but why sft i.e. first step needed, we can do with it ofcourse it helps but not necessary right ? lets do 2 and 3 only .
@neetpride5919 Před 10 měsíci
Is there ANY open-source repo that includes a tool for manually rating the results of your own ChatGPT like in steps 2 and 3 of this video? I want to *actually* train a TNN from scratch and I have the time to do it.
Why do so few people talk about this aspect of ChatGPT?
@sam_joshua_s Před 11 měsíci ⁺¹
can you make video about deepspeed coding implementation
@prashlovessamosa Před 11 měsíci ⁺¹
Bahi is on Steroids after 100k.

Další v pořadí

Automatické přehrávání

ChatGPT and Reinforcement Learning

ChatGPT and Reinforcement Learning

ChatGPT: 30 Year History | How AI Learned to Talk

ChatGPT: 30 Year History | How AI Learned to Talk

This is why Deep Learning is really weird.

This is why Deep Learning is really weird.

Gender reveal 🤰🩵 #hannahstocking #shorts

Gender reveal 🤰🩵 #hannahstocking #shorts

Incredible Dog Rescues Kittens from Bus - Inspiring Story #shorts

Incredible Dog Rescues Kittens from Bus - Inspiring Story #shorts

How Hard Is To Slice With The World's Smallest Sword?

How Hard Is To Slice With The World's Smallest Sword?

Elements of Reinforcement Learning

Elements of Reinforcement Learning

[ 100k Special ] Transformers: Zero to Hero

[ 100k Special ] Transformers: Zero to Hero

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

Transformer Neural Networks, ChatGPT's foundation, Clearly Explained!!!

Transformer Neural Networks, ChatGPT's foundation, Clearly Explained!!!

How ChatGPT is Trained

How ChatGPT is Trained

The Turing Lectures: The future of generative AI

The Turing Lectures: The future of generative AI

Large Language Models (LLMs) - Everything You NEED To Know

Large Language Models (LLMs) - Everything You NEED To Know

How to learn to code FAST using ChatGPT (it's a game changer seriously)

How to learn to code FAST using ChatGPT (it's a game changer seriously)

The 3 Year AI Reset: How To Get Ahead While Others Lose Their Jobs (Prepare Now) | Emad Mostaque

The 3 Year AI Reset: How To Get Ahead While Others Lose Their Jobs (Prepare Now) | Emad Mostaque

Mikuláš Černák: PŘÍBĚH BOSSE (celý dokument)

Mikuláš Černák: PŘÍBĚH BOSSE (celý dokument)

아이스크림으로 진짜 친구 구별하는법

아이스크림으로 진짜 친구 구별하는법

NEJLEPŠÍ KVÍZ NA YOUTUBE @Duklock @EvilBender47

NEJLEPŠÍ KVÍZ NA YOUTUBE @Duklock @EvilBender47

Running With Bigger And Bigger Feastables

Running With Bigger And Bigger Feastables

Get 10 Mega Boxes OR 60 Starr Drops!!

Get 10 Mega Boxes OR 60 Starr Drops!!

This or That! 🙏🙌

This or That! 🙏🙌

Angelo Song Tu To Riyal😆 | Brawl Stars #shorts #brawlstars

Angelo Song Tu To Riyal😆 | Brawl Stars #shorts #brawlstars