An introduction to Reinforcement Learning

Sdílet
Vložit
  • čas přidán 8. 06. 2024
  • This episode gives a general introduction into the field of Reinforcement Learning:
    - High level description of the field
    - Policy gradients
    - Biggest challenges (sparse rewards, reward shaping, ...)
    This video forms the basis for a series on RL where I will dive much deeper into technical details of state-of-the-art methods for RL.
    Links:
    - "Pong from Pixels - Karpathy": karpathy.github.io/2016/05/31/rl/
    - Concept networks for grasp & stack (Paper with heavy reward shaping): arxiv.org/abs/1709.06977
    If you enjoy my videos, all support is super welcome!
    / arxivinsights
    If you have questions you would like to discuss with me personally, you can book a 1-on-1 video call through Pensight: pensight.com/x/xander-steenbr...
    ::Chapters::
    00:00 Intro
    01:03 So what is Reinforcement Learning?
    03:39 Learning without explicit examples
    07:25 Main challenges when doing RL
    15:04 Are the robots taking over now?
  • Věda a technologie

Komentáře • 404

  • @rednassie1101
    @rednassie1101 Před 4 lety +209

    People: ANN ARE TAKING OVER THE WORLD AND STUFF WILL NEVER BE THE SAME
    my horribly trained network on a cat: "dog"

    • @I_Lemaire
      @I_Lemaire Před 4 lety +1

      Could they help with the necessary government takeovers associated with COVID-19? Temporary command economies could be more efficient.

    • @revimfadli4666
      @revimfadli4666 Před 4 lety +4

      CZcams's bots: "Robot fighting is animal cruelty"

  • @yuanyuansun3521
    @yuanyuansun3521 Před 3 lety +24

    “If u only give it a positive reward when it successfully stacked a block, it’ll never get to see any of those reward” Only if my tutors realise this.

  • @DanielHernandez-rn6rp
    @DanielHernandez-rn6rp Před 6 lety +291

    Love this guy. As an RL PhD student, your videos are golden.

    • @nikhillondhe5815
      @nikhillondhe5815 Před 5 lety +13

      RL PhD sounds so interesting!

    • @andres18m
      @andres18m Před 5 lety +2

      Institute name?

    • @Ayanwesha
      @Ayanwesha Před 5 lety +1

      hello..sir
      i am a grad stud
      can anyone tell me plzz if back propagation is necessary in supervised and unsupervised learning?or it is only used in reinforcement learning
      thanks

    • @hcgaron
      @hcgaron Před 5 lety

      Ayanwesha 12345 yes, back propagation is used as a basis for gradient based methods of optimization

    • @ernie2111
      @ernie2111 Před 5 lety +3

      "RL PhD" didn't know such things exist lol

  • @denebvegaaltair1146
    @denebvegaaltair1146 Před 2 lety +14

    Your videos have just the right amount of technical terms such that student engineers can learn something, and also the right amount of summary and rewording such that beginners can get a vague idea of concepts. Thank you so much

  • @SukhwinderSingh-fb9qw
    @SukhwinderSingh-fb9qw Před 5 lety +65

    This was one of the best videos on RL that I have seen. Extremely informative. The way you explain things is awesome. Keep up the great work! Cheers man!

  • @floriandebrauwer9140
    @floriandebrauwer9140 Před 4 lety +2

    Thanks for your work ! I like the way you present such a complex field in a clear manner for poeple without any background. Thanks to you I know where to start in my learning journey !

  • @atcer51
    @atcer51 Před 7 měsíci +1

    fiiiinnnaaaallly after tons of googling, I finally fund a USEFUL video that accually EXPLAINS how to reward the agent, and not just saying:
    'oh u just reward it'

  • @cemgocer8185
    @cemgocer8185 Před 3 lety +3

    Quality of the video is off the charts. Topics u have chosen to explain the field, the way u explain them and especially pointing the common misconceptions that make it harder for us to get into what AI really is... I'm sad that there is no superlike button. Rare to see videos of this quality and honesty

  • @snippletrap
    @snippletrap Před 5 lety +16

    The perils of reward shaping are well understood in a public policy context, where incentives can lead to "unintended consequences".

  • @MuditBachhawatIn
    @MuditBachhawatIn Před 4 lety

    I have been meaning to read about RL for a long time. This video couldn't be more simple and clear introduction to it. Thanks man!

  • @davidfield5295
    @davidfield5295 Před 5 lety

    The misuse of 'literally' notwithstanding, this was an excellent video. Very clear and concise explanation.

  • @Hyuts
    @Hyuts Před 4 lety +16

    Explains in an elegant manner more than I have learned in half a semester of my AI college course.

  • @orfeasliossatos
    @orfeasliossatos Před 5 lety +2

    I've been literally looking all over for a video like this, thank you so much

  • @josefpolasek6666
    @josefpolasek6666 Před 4 lety +1

    Your videos are absolutely amazing! Thank you very much for explaining concept of RL in 16 minutes.

  • @shashankshivakumar4732
    @shashankshivakumar4732 Před 5 lety +4

    I love this video. I love his criticial and grounded thinking. Great work !

  • @gusbakker
    @gusbakker Před 5 lety +2

    Great balance between a very well explained content and the interesting facts about current progress in AI at the end. Good work

  • @angelakong653
    @angelakong653 Před 4 lety +1

    This was really helpful. Thank you to people like you for creating this content. Appreciate you, Xander!

  • @TY-un4no
    @TY-un4no Před 3 lety +1

    Complex stuff made simple and easy, this is a very good intro video to RL. Starting to learn RL for work and your video gave me a great starting point, thank you!

  • @nemx4u
    @nemx4u Před 6 lety +2

    You explain hard topics beautifully! great job. Would love to see more RL videos!

  • @PriyanshuGupta-hf2hm
    @PriyanshuGupta-hf2hm Před 3 lety

    You explained so well that I understood each and everything in your video. I am overjoyed!

  • @jackwhite9332
    @jackwhite9332 Před 6 lety +7

    Impressive explanation, found this very useful. Thank you!

  • @7810
    @7810 Před 6 lety

    Good stuff to learn the RL in terms of basic knowledge as well as the challenge it will face. Thanks for your time and sharing!

  • @Krimson5pride
    @Krimson5pride Před 4 lety

    It was both professional and entertaining at the same time. Great and precise explanation.

  • @Lilowillow42
    @Lilowillow42 Před 2 lety +1

    Just wanted you to know that in my university course for introduction to AI our professor recommended your videos for machine learning. Your explanation is highly enjoyable and informative. Thank you!

  • @HARtalks
    @HARtalks Před 3 lety

    It was really interesting and helped me to get a clear picture of what reinforcement learning is... Thank you!!

  • @allamasadi7970
    @allamasadi7970 Před 6 lety +151

    Your channel deserves more views 👍

  • @ArnauViaMartinezSeara
    @ArnauViaMartinezSeara Před 6 lety

    Really useful. I am preparing a Reinforcement Learning class aplied to finance and it is really helpful. Can't wait to see next episode. Thanks

  • @lincolnaisagbonhi8953
    @lincolnaisagbonhi8953 Před 4 lety

    This is a great presentation on RL, short and clear content.

  • @TheBeansChopper
    @TheBeansChopper Před 3 lety

    I think the comment section speaks for itself. This is a fantastic grasp of the basic concepts and issues with this technologies in such short time, without diving unnecessarily into formalism. Thanks :)

  • @Alex-gc2vo
    @Alex-gc2vo Před 5 lety

    your videos are some of the best explanations I've found for a lot of these very advanced subjects. I suspect your viewer count is going to jump very quickly. keep it up.

  • @dean8147
    @dean8147 Před 2 lety

    You’re a legend mate. Honestly, thanks for all of your hard work

  • @saaniausaf9621
    @saaniausaf9621 Před 5 lety

    I loved the way you explained everything. Thanks!

  • @mohammadhatoum
    @mohammadhatoum Před 5 lety

    Great job.. Explained the subject in a simple way. Keep it up and looking forward for new videos

  • @codyheiner3636
    @codyheiner3636 Před 5 lety +1

    Love the philosophical discussion at the end!

  • @nateshrager512
    @nateshrager512 Před 6 lety

    Great job introducing the topic. Very nice job dispelling misconceptions surrounding the topic as well. I put on that notification for your next videos, looking forward to em : )

  • @Z4NT0
    @Z4NT0 Před 3 lety

    I learned so much in just 16 minutes. Awesome Video!

  • @majeedhussain3276
    @majeedhussain3276 Před 6 lety

    You deserve million subscribers hopefully one day you will. So much clarity in every video. Keep going...

  • @OliverZeigermann
    @OliverZeigermann Před 5 lety

    Very lively and understandable. Great work!

  • @soumyakantadash5986
    @soumyakantadash5986 Před 4 lety

    These videos are gem!!!..... incredible, precise and knowledgeable!!!!

  • @matfuckk4736
    @matfuckk4736 Před 6 lety

    Great quality and well-appreciated content. Please, continue, became your patron.

  • @aanex2005
    @aanex2005 Před 4 lety

    I have no idea about RL but your video has given me a good jump start. Thanks man

  • @thanasispappas62
    @thanasispappas62 Před 11 měsíci

    By far the best video of RL ive ever seen.

  • @josephedappully1482
    @josephedappully1482 Před 6 lety

    This is a great video; thanks for making it! Looking forward to your next one.

  • @poojanpatel2437
    @poojanpatel2437 Před 6 lety +4

    Best Channel on yt for ml/dl/rl/ai... Keep up the good work... Would love to see your new video weekly...

    • @ArxivInsights
      @ArxivInsights  Před 6 lety +3

      I'd love to make more videos too! But since I'm currently doing this 100% in my spare time and 1 vid takes about 30hrs of work, there's really no way I can do one per week for now :(

    • @poojanpatel2437
      @poojanpatel2437 Před 6 lety

      Arxiv Insights Still amazing work till now... Love to see your more videos in future.. ❤

  • @amitredkar140
    @amitredkar140 Před 5 lety +1

    Great video!!!! Explained exceptionally, liked other videos as well from your channel. Would love to see more stuff related to AI/DL or RL. Thanks in advance. Keep up the good work....

  • @alirezaparsay8518
    @alirezaparsay8518 Před rokem

    The explanation was so clear. Thank you.

  • @RoxanaNoe
    @RoxanaNoe Před 5 lety

    Your channel is a great resource for getting into Deep Learning and AI.

  • @mehdisauvage1234
    @mehdisauvage1234 Před 6 lety

    Your videos are so useful and interesting ! This is pure gold to me :)

  • @bjbodner3097
    @bjbodner3097 Před 6 lety

    Great video, great channel!
    Thanks so much for making this!
    Can't wait to watch more:)

  • @tonakkie635
    @tonakkie635 Před 5 lety +1

    Great overview, well explained👍.Thanks

  • @rishidixit7939
    @rishidixit7939 Před 8 měsíci +2

    The sudden surprise of hearing Bruno Mars makes you pause video for other open tabs

  • @mantische
    @mantische Před 4 lety

    One of the best explanations I've seen

  • @jonathaskerber5472
    @jonathaskerber5472 Před 5 lety

    Such a great introduction. Keep up the good work!

  • @sharadrawatindia
    @sharadrawatindia Před 6 lety

    Hey Xander! Great videos. Looking forwards for your next video.

  • @ms_1918
    @ms_1918 Před 4 lety

    well came here for a 1 min intro to reinforcement learning for first class of course,
    stopped after 16 minutes what a superb experience.

  • @luiseduardocorralesmendoza9396

    Great examples and great explanation, thank you i was struggling with this topic

  • @tnmygrwl
    @tnmygrwl Před 6 lety

    You do an awesome of structuring the content. Loved the video.

  • @ingeniouswild
    @ingeniouswild Před 5 lety +1

    Very nice episode! One thing that struck me about your suggestion that without Reward Shaping, the auto-learning of the 2600 games would be intractable: even for a human, this would be extremely difficult - we succeed with new, undocumented games because they often have similar sub-components and sub-goals that we already know from other games (or life). But I'm sure you could easily construct a game which would be impossible for a human to learn without any hints, while still having the same overall complexity.

  • @ArturoMoraSoto
    @ArturoMoraSoto Před 3 lety

    Nice explanation, thanks for taking the time to create this great video.

  • @alenasazanova8331
    @alenasazanova8331 Před 4 lety

    That's very interesting and understantable video. Thank you very much!

  • @jorgegarcia-torresfdez2471

    You did again a really nice work ! Congratulations :D

  • @gudusangtani
    @gudusangtani Před 4 lety

    So well explained ....I also liked the comments on Boston robotics considering the hype and buzz about AI and ML.. You are doing a very good job !

  • @empiricistsacademy7181

    Thanks youuu for this video. Looking forward to your future videos!

  • @maisamwasti
    @maisamwasti Před 6 lety

    Your videos = super informative! Thanks a lot for the good work

  • @sidharthaparhi7930
    @sidharthaparhi7930 Před 5 lety

    Also your intro is very high quality, like an intro to a good TV show

  • @funpy772
    @funpy772 Před 3 lety +1

    Just wanted to tell you people.. this video is still awesome.

  • @alanator25
    @alanator25 Před rokem

    Thank you! This was a great introduction!

  • @khajasaen
    @khajasaen Před 6 lety

    Best channel in the crowd ... keep it up Xander

  • @senri-
    @senri- Před 6 lety

    Cant wait for the next videos keep up the great work!

  • @thaermashkoor6225
    @thaermashkoor6225 Před 2 lety

    Thanks for this clear introduction.

  • @laeeqahmed1980
    @laeeqahmed1980 Před 5 lety +1

    Great talk. Humans are not good at multiple sound recognition and you added music to your video.

  • @shirishbajpai9486
    @shirishbajpai9486 Před 9 měsíci

    watched in 2023 after all the LLMs stuff going on... still such relevant and pure gold!

  • @colorlace
    @colorlace Před 4 lety +16

    The Lebowski Theorem: No superintelligent AI is going to bother with a task that is harder than hacking its reward function.

    • @wizardOfRobots
      @wizardOfRobots Před 4 lety +6

      Unless it's reward function punishes it for it.
      Now we have the Meta-Lebowski theorem: It's not going to bother with a task harder than hacking it's hack-detection algorithm.

    • @halifakx
      @halifakx Před 2 lety

      perhaps, a machine become smart, and then smarter as it decides becoiming smarter is shorterst path to reward... finally so smart to realize their reward is just color mirrors? and create a new program inside the program that cancels or outweigh the previous reward and create new rewards? programming this new reawards in their own languaje, not apparent to us....like facebook robot talking their own languaje

    • @halifakx
      @halifakx Před 2 lety

      estramboticusssssss dangerosicusss hahaha

  • @DotCSV
    @DotCSV Před 6 lety +45

    Hi Xander, just found your CZcams channel and I'm very amazed about your content! I also run a CZcams channel with the same topic but for the Spanish speaking audience, and I'm happy to see that more new channels are growing to educate in the field of machine learning. I hope in the future we can crossover our contents :)

    • @ArxivInsights
      @ArxivInsights  Před 6 lety +11

      Checked out your channel, great stuff man!! It's indeed nice to see that many people are starting to contribute to the online ML community in such a huge variety of ways :p

    • @TheHirou
      @TheHirou Před 3 lety

      eeee yo creo que te acabo de ver en tiktok

  • @biiigates7381
    @biiigates7381 Před 4 lety +1

    I've been learning AI for almost a year now and on all the channels I've spent with this is the best one. Very underrated! (btw its the first time i discovered this channel and I instantly subscribed)

    • @mundeepcool
      @mundeepcool Před 4 lety

      Same here, loved this video and I instantly subscribed... and also oh yeah yeah

  • @elvispiss
    @elvispiss Před 2 lety +1

    Even after doing my second course of RL, this video is still so informative in its simplicity. Great videos

  • @adammenges6300
    @adammenges6300 Před 5 lety

    your videos are so good, keep up the great work 💪🏻

  • @mujahid1324
    @mujahid1324 Před 3 lety

    I would say "Wow'. You nailed it in10 mnts what's "reinforcement learning" is. Please keep sending more and more Ai . keep it up, Xander :)

  • @williamkyburz
    @williamkyburz Před 5 lety +1

    Xander, extremely well done, lucid and cogent. You should be teaching at M.I.T. or Universiteit Gent). The ability to teach complex subjects in an intuitive and simple way is a gift. Wish you the best in everything. Peace

    • @ArxivInsights
      @ArxivInsights  Před 5 lety +1

      Thanks William! I am actually doing my PhD in Gent at the moment :)

  • @azmathmoosa4324
    @azmathmoosa4324 Před 6 lety

    I like how u don't hype up anything. Great mate! I subscribe!

  • @sridhasridharan3600
    @sridhasridharan3600 Před 3 lety

    Great Videos! I am recommending these to my students.

  • @doctorartin
    @doctorartin Před 4 lety

    Doing part of my PhD on potantial AI-strategies fordecision-making in healthcare, and this was very useful, thank you.

  • @011azr
    @011azr Před 6 lety

    Your explanations are great, thanks :)

  • @mgilson
    @mgilson Před 6 lety

    I can't wait for your next video !! 😍😍😍

  • @karFLY1
    @karFLY1 Před 6 lety +1

    Great as usual. Thank you :)

  • @bsudharsh
    @bsudharsh Před 5 lety

    succinct; its a brilliant rendition on reinforcement learning

  • @digvijaybhandari9747
    @digvijaybhandari9747 Před rokem

    Really enjoyed the content here!

  • @govindnarasimman1536
    @govindnarasimman1536 Před 4 lety

    Very clear naration and true to.ground comments. All the euphoria about AI needs to be grounded.

  • @wzyjoseph7317
    @wzyjoseph7317 Před 2 lety

    Very clear explaination! Thanks for the work!!!!XD

  • @rajendrarao3057
    @rajendrarao3057 Před 5 lety

    awesome video sir. please keep up the good work in this field....

  • @stefano3808
    @stefano3808 Před 3 lety

    really high quality videos, thanks for that

  • @andreasnatsis3027
    @andreasnatsis3027 Před 5 lety

    Amazing video. Keep up the good work and soon your channel will explode!

  • @owencarey3216
    @owencarey3216 Před 6 lety

    Amazing video! Thank you so much.

  • @LongTheRevolution
    @LongTheRevolution Před 2 lety

    Amazing video. Thanks braddah

  • @qandos-nour
    @qandos-nour Před rokem

    Great and clear explanation

  • @skviknesh
    @skviknesh Před 5 lety

    Awesome!!!!!! Bro!!! Great explanation! !!!! Keep continuing!!!

  • @shahulhameed-xc1to
    @shahulhameed-xc1to Před 4 lety

    Great learning experience. Thank you

  • @papaman1037
    @papaman1037 Před 6 lety +1

    Your content is far better than that guy that copies someone's code from GitHub makes an obscure reference to the original author and states that he added a wrapper to make the code easier to use (a lie Everytime I've checked). He uploads the code as an original comit (no fork from the rightful author's repo). He intentionally misleads people and profits from it -- a legal necessity for calling it fraudulent.
    Your content is excellent, clearly founded in recent research papers and you very professionally point out that material and more. You add value with your discussion of the topic. Thank you for an excellent channel. I would use patreon but I am Ill and not working. I'm doing my best to spread the word.

  • @ipuhbamrash6708
    @ipuhbamrash6708 Před 4 lety

    Fabulous!! No other word for you!!

  • @Vladeeer
    @Vladeeer Před 6 lety

    Awesome video, keep up the good work!