Reinforcement Learning: AlphaGo

I Made a Neural Network with just Redstone!

#1. Q Learning Algorithm Solved Example | Reinforcement Learning | Machine Learning by Mahesh Huddar

Jak na VEDRO?! 🥵

Turning trash into triumph, one can at a time!

버블티로 체감되는 요즘 물가

Reinforcement Learning from scratch

Graphics in 5 Minutes

zhlédnutí 44 698

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 29. 06. 2024
How does Reinforcement Learning work? A short cartoon that intuitively explains this amazing machine learning approach, and how it was used in AlphaGo and ChatGPT.
Part 1 of 3.
0:00 - intro
0:13 - pong
0:28 - the policy
0:51 - policy as neural network
1:32 - supervised learning
2:51 - reinforcement learning using policy gradient
4:24 - minimizing error using gradient descent
4:45 - probabilistic policy
5:01 - pong from pixels
6:58 - visualizing learned weights
8:18 - pointer to Karpathy "pong from pixels" blogpost

Komentáře • 43

@darthvader4899 Před 3 měsíci ⁺¹⁶
this is video is super underrated. In fact the whole channel is underrated.
@themathguy3149 Před 8 měsíci ⁺⁶
Your Channel IS SO GREAT, I share with all my eng friends for you to get more visibility!
@tushargupta1999 Před 3 měsíci ⁺²
This video is amazing. You explained everything in such a simple manner. I am feeling really motivated to learn more about reinforcement learning and neural networks after watching this.
@ashketchum1244 Před 10 měsíci ⁺⁴
I don't know how I stumbled upon this video but that was very interesting and intuitive to understand. Thank you.
@metaljacket8102 Před 2 měsíci ⁺²
This is really awsome! It's the best video that explains DRL in such an easy to understand way!
@a.aspden Před 9 měsíci ⁺²
Your videos are great. Looking forward to more!
@marcinstrzesak346 Před 9 měsíci ⁺¹
Great video, very helpful, easy to understand.
@themax2go Před 3 měsíci ⁺⁴
agi: 1. ai develops understanding of win-loss conditions and sets policy params (inputs & actions) accordingly. 2. ai creates (= designs & builds) training env(s). 3. ai iterates, avals & adjusts policy parameters accordingly 4. done (or validation run(s) w/ human(s))
@gmjammin4367 Před 10 měsíci ⁺¹
Amazing video as always :)!
@codybarton2090 Před 24 dny ⁺¹
I agree once you see how it all works it seems like 1s and zeros give me some feed back on r/grand unified theory or cosmo knowledge
@cloudysh Před 2 měsíci ⁺¹
This was so surprisingly great :3
@CptDoge-rn3ou Před 8 měsíci ⁺¹
I really like the way you visualize what you are talking about. Thank you for putting in the effort!
@moldo800 Před 5 měsíci ⁺¹
Excellent. Congratulations ❤
@swannschilling474 Před 3 dny
Thanks a lot for this one! 😊
@luiseduardocraizer7416 Před měsícem ⁺¹
Excellent content!
@jameslibby5215 Před 9 měsíci ⁺⁶
Very very underrated channel
@benc7910 Před 5 měsíci
Underrated, two Rs
@jameslibby5215 Před 5 měsíci
@@benc7910 thank ya sir
@mado.madeleine Před 10 měsíci ⁺¹
Super helpful! Thank you 🙏🏽
@nikbivation Před 10 měsíci ⁺¹
thank you for this!
@mohajeramir Před 2 měsíci ⁺²
Excellent
@jdlopes06 Před 18 hodinami
Thank you!
@ireoluwaTH Před 10 měsíci ⁺¹
Thank you!!!
@solveigberling1662 Před 3 měsíci ⁺¹
That was dope
@kniv0gaffel Před 8 měsíci ⁺¹
Brilliant
@BlueBirdgg Před 10 měsíci ⁺¹
Can you playlist each one of your topics plz?
I wanted to post on Twitter(X) your video topics but could only post a single video at a time.
Great content by the way. Ty very much.
Your perspective on some topics helped me a lot to get a more intuitive understanding.
@g5min Před 10 měsíci
Good idea! Here's one on generative AI:
czcams.com/play/PLWfDJ5nla8UoR8P7AGqVw7ZPjXajUFLMo.html
Here's one on reinforcement learning
czcams.com/play/PLWfDJ5nla8UoexEaLqVMw7q3Ft0vRYscL.html
Here's one on LLMs + text-to-image
czcams.com/play/PLWfDJ5nla8UoG2mvvHs_OS0asAKC5HJeu.html
@BlueBirdgg Před 10 měsíci
@@g5min Ty!
@edvinbeqari7551 Před 5 měsíci
What is your reward function for the pong game? I did a similar pong game and I couldn't get it to learn.
@maxim_ml Před měsícem ⁺¹
that was good
@axe863 Před 7 měsíci ⁺²
Simple Reinforcement learning is extremely dangerous in certain nonstationary environments 😅
@mineq4967 Před 3 měsíci
but by what number do you change the weights like you never told us
@bombur9007 Před 2 měsíci
how many layers should such network have
@mind6861 Před 17 dny ⁺¹
Can we have the code for this
@nischalyou Před 10 měsíci
whats the name of this video game ?
@gaydemaupassant6263 Před 16 dny
Pls o want the code plsss
@herikaniugu Před 8 měsíci
Imagine using reinforcement learning in quantitative finance 😊
@FRANKONATOR123 Před 10 měsíci
Can you share the source code for this project
@g5min Před 10 měsíci
You can follow the link to the Karpathy site at the end of the video, repeated here:
karpathy.github.io/2016/05/31/rl/
@macratak Před 10 měsíci
ah yes, reinforcement learning. a fundamental computer graphics technology
@g5min Před 10 měsíci ⁺⁵
I think that character/game-AI is pretty central to graphics
@pw7225 Před 10 měsíci ⁺¹
Why so negative?
@revimfadli4666 Před 10 měsíci
@@g5minespecially AI image generation or processing nowadays

Další v pořadí

Automatické přehrávání

Reinforcement Learning: AlphaGo

Reinforcement Learning: AlphaGo

I Made a Neural Network with just Redstone!

I Made a Neural Network with just Redstone!

#1. Q Learning Algorithm Solved Example | Reinforcement Learning | Machine Learning by Mahesh Huddar

#1. Q Learning Algorithm Solved Example | Reinforcement Learning | Machine Learning by Mahesh Huddar

Jak na VEDRO?! 🥵

Jak na VEDRO?! 🥵

Turning trash into triumph, one can at a time!

Turning trash into triumph, one can at a time!

버블티로 체감되는 요즘 물가

버블티로 체감되는 요즘 물가

Jak chutná JOYEHO SANDWICH z Přátel? 🥪 - Insta Bašta #13

Jak chutná JOYEHO SANDWICH z Přátel? 🥪 - Insta Bašta #13

Why Does Diffusion Work Better than Auto-Regression?

Why Does Diffusion Work Better than Auto-Regression?

Evolving Genetic Neural Network Optimizes Poly Bridge Problems

Evolving Genetic Neural Network Optimizes Poly Bridge Problems

An introduction to Reinforcement Learning

An introduction to Reinforcement Learning

Q Learning simply explained | SARSA and Q-Learning Explanation

Q Learning simply explained | SARSA and Q-Learning Explanation

ML Was Hard Until I Learned These 5 Secrets!

ML Was Hard Until I Learned These 5 Secrets!

A friendly introduction to deep reinforcement learning, Q-networks and policy gradients

A friendly introduction to deep reinforcement learning, Q-networks and policy gradients

10 weird algorithms

10 weird algorithms

MAMBA from Scratch: Neural Nets Better and Faster than Transformers

MAMBA from Scratch: Neural Nets Better and Faster than Transformers

10 FORBIDDEN Sorting Algorithms

10 FORBIDDEN Sorting Algorithms

I CAN’T BELIEVE I LOST 😱

I CAN’T BELIEVE I LOST 😱

#JasonDeruloTV // Street Art #GotPermissionToPost From @greg_goya #SlowLow

#JasonDeruloTV // Street Art #GotPermissionToPost From @greg_goya #SlowLow

Tomáš Le Sy | Konec Tadeáše Veselého | Rozhovor o turnaji Clash of the Stars 8

Tomáš Le Sy | Konec Tadeáše Veselého | Rozhovor o turnaji Clash of the Stars 8

Never waste PASTA SAUCE @itsQCP

Never waste PASTA SAUCE @itsQCP

Can You Draw A PERFECTLY Dotted Line?

Can You Draw A PERFECTLY Dotted Line?

MEGA BOXES ARE BACK!!!

MEGA BOXES ARE BACK!!!

This Girl shows the Smart Way of starting fire🔥👩‍🚒 #camping #fire #outdoors #bushcraft #survival

This Girl shows the Smart Way of starting fire🔥👩‍🚒 #camping #fire #outdoors #bushcraft #survival