Deep Mind AI Alpha Zero's Positional Masterpiece With the Black Pieces

Sdílet
Vložit
  • čas přidán 11. 12. 2017
  • Download Mproov and Improve Your Chess Today! app.mproov.me/AgadCZcams1
    Follow MprooV on Twitter / mproovapp #agadmator Check out all my videos on this match
    • Google Deep Mind Alpha...
    Read more about Deep Mind Alpha Zero here arxiv.org/pdf/1712.01815.pdf
    Link to the other games lichess.org/study/wxrovYNH
    A chess game between Deep Mind Alpha Zero and Stockfish
    Google Deep Mind Alpha Zero vs Stockfish
    One of the games
    1. e4 e5 2. Nf3 Nc6 3. Bb5 Nf6 4. d3 Bc5 5. Bxc6 dxc6 6. O-O Nd7 7. c3 O-O 8. d4 Bd6 9. Bg5 Qe8 10. Re1 f6 11. Bh4 Qf7 12. Nbd2 a5 13. Bg3 Re8 14. Qc2 Nf8 15. c4 c5 16. d5 b6 17. Nh4 g6 18. Nhf3 Bd7 19. Rad1 Re7 20. h3 Qg7 21. Qc3 Rae8 22. a3 h6 23. Bh4 Rf7 24. Bg3 Rfe7 25. Bh4 Rf7 26. Bg3 a4 27. Kh1 Rfe7 28. Bh4 Rf7 29. Bg3 Rfe7 30. Bh4 g5 31. Bg3 Ng6 32. Nf1 Rf7 33. Ne3 Ne7 34. Qd3 h5 35. h4 Nc8 36. Re2 g4 37. Nd2 Qh7 38. Kg1 Bf8 39. Nb1 Nd6 40. Nc3 Bh6 41. Rf1 Ra8 42. Kh2 Kf8 43. Kg1 Qg6 44. f4 gxf3 45. Rxf3 Bxe3+ 46. Rfxe3 Ke7 47. Be1 Qh7 48. Rg3 Rg7 49. Rxg7+ Qxg7 50. Re3 Rg8 51. Rg3 Qh8 52. Nb1 Rxg3 53. Bxg3 Qh6 54. Nd2 Bg4 55. Kh2 Kd7 56. b3 axb3 57. Nxb3 Qg6 58. Nd2 Bd1 59. Nf3 Ba4 60. Nd2 Ke7 61. Bf2 Qg4 62. Qf3 Bd1 63. Qxg4 Bxg4 64. a4 Nb7 65. Nb1 Na5 66. Be3 Nxc4 67. Bc1 Bd7 68. Nc3 c6 69. Kg1 cxd5 70. exd5 Bf5 71. Kf2 Nd6 72. Be3 Ne4+ 73. Nxe4 Bxe4 74. a5 bxa5 75. Bxc5+ Kd7 76. d6 Bf5 77. Ba3 Kc6 78. Ke1 Kd5 79. Kd2 Ke4 80. Bb2 Kf4 81. Bc1 Kg3 82. Ke2 a4 83. Kf1 Kxh4 84. Kf2 Kg4 85. Ba3 Bd7 86. Bc1 Kf5 87. Ke3 Ke6
    ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
    If you realllly enjoy my content, you're welcome to support me and my channel with a small donation via PayPal or Crypto.
    Link to PayPal donation www.paypal.me/agadmator
    Maiar Wallet @agadmator or get.maiar.com/referral/pv0mam...
    BTC address bc1qckd3ut0hqyymzv33eus97ld8klj02xhk2kcwld
    BCH address qzmfclyn69hqhjslls40r7r0dsttwe3tcsl946w4fr
    LTC address Laarf1RmvCpLt2BcSwC1PBLG3hRC4HjBrz
    NANO nano_1h1kgfaq88t1btwadqzx73rbha5hwbb88sxmfns851kwj8hnosdj51w388xx
    Monero 4AdvvqmC4xhPyyRSAEDxTTAoXdxAtX2Py6b8Eh4EQzBLGbgo5rY5Khcap1x76JrDJH87yibAE9b6TPwTsvBAiFFCLtM8Be7
    For any other currency address, contact me via agadmator@gmail.com
    Check out ALL my videos here • "Grand Opening" - Ande...
    Facebook: / agadmatoryoutube
    Twitter: / agadmator
    Instagram: / agadmator
    Lichess: lichess.org/@/agadmator
    Chess.com: agadmator
    Skype: agadmator
    League of Legends: agadmator :) "Watch me without ads on your Amazon devices (bit.ly/Agadmator_Amazon) and Roku TV (bit.ly/Agadmator_Roku)
  • Zábava

Komentáře • 485

  • @Johaylon
    @Johaylon Před 6 lety +619

    And stockfish resigned the game... I can never hear enough of this 👏

    • @Nash9r
      @Nash9r Před 5 lety +3

      Alpha Zero has no ego as well.

    • @inlovewithi
      @inlovewithi Před 5 lety +14

      I don't think he meant it for ego reasons, but rather because it's such a rare phrase. A situation that rarely happens.

    • @dwm20ll
      @dwm20ll Před 5 lety +2

      it's a super CPU vs a laptop with outdated software

  • @seandesir7272
    @seandesir7272 Před 5 lety +388

    Here is my take on alpha zero. I watch all its games. From what i have seen here is its tactics: it locks the middle, immobilizes a few of its opponent's minor pieces deep in their ranks, it sacrifices a minor piece or pawns to create files and mobilizes all its minor pieces for positional gains. It can be down one piece or two, but in reality it actually up cause some of its opponents pieces are immobilize or lock. Very clever. It creates multiple traps so that its opponents have an option to die slowly or die fast. Very naughty machine

    • @Iodestarr
      @Iodestarr Před 5 lety +21

      Something I've noted from watching alphazero is it seems alpha has a tendency to use his knights to force its opponent in zugzwang. As bait, sacrificing the knight to (usually) a bishop, in turn taking bishop.

    • @shankernarayan5028
      @shankernarayan5028 Před 5 lety +32

      It is a fan of anatoly Karpov

    • @outtabubblegum7034
      @outtabubblegum7034 Před 4 lety +8

      It's called ACTIVITY

    • @shillhuntingseason9707
      @shillhuntingseason9707 Před 4 lety +7

      It’s like watching killer whales force their prey to the surface of the ocean where there is no where left for the prey to swim to

    • @misteratoz
      @misteratoz Před 4 lety

      It just seems like it just does everything perfectly well....

  • @GLu-tb1pb
    @GLu-tb1pb Před 5 lety +399

    stockfish: e4?
    alphazero: you lose.

    • @dannygjk
      @dannygjk Před 5 lety +4

      lol savage.

    • @Hermes1548
      @Hermes1548 Před 4 lety

      @@dannygjk HA!

    • @deanaraula
      @deanaraula Před 4 lety +18

      Just straight up “Lmaooo mate in 211 after e4”

    • @ojasdighe991
      @ojasdighe991 Před 4 lety +1

      @@deanaraula meanwhile me proving alphazero by getting mated in 7

    • @kirill.borisov
      @kirill.borisov Před 5 měsíci

      That’s brutal.

  • @12345DJay
    @12345DJay Před 6 lety +351

    No light square Bishop was harmed in the making of this video

  • @erkintunca
    @erkintunca Před 6 lety +313

    So many alphazero videos despite you had said no more :D keep up the good work we love them all

    • @vvinny8
      @vvinny8 Před 6 lety +7

      Erkin Tunca very hard to resist the temptation!

    • @TaohRihze
      @TaohRihze Před 6 lety +17

      Guess Alpha Zero forced the move :)

    • @5thnation
      @5thnation Před 3 lety

      Taoh Rihze 😂😂

  • @BeerdyBruceLeeCentral
    @BeerdyBruceLeeCentral Před 6 lety +311

    YEAH, when ever I see a deep mind video I insta-click. The last few days I sat down and learned about how deepmind actually works, and it turns out deepmind is learning even when it's playing games against stockfish. This answers your question about the refusing of the draw. Deepmind probably found a good continuation after bying some time with the repeat moves. Keep making deepmind videos please. Deepmind is now my new favorite chess player :)

    • @MadaxeMunkeee
      @MadaxeMunkeee Před 6 lety +35

      Beerdy - Bruce Lee Central while it's certainly true that AlphaZero could learn while playing Stockfish, there are two reasons why I think they wouldn't have bothered:
      Firstly, it might learn bad moves from stockfish. AlphaZero learns through self play, because it gets the best game data from itself.
      Secondly because it would waste computation that could be spent focusing on the match.
      It is true though that after the match the game data could be fun through AlphaZero to make it better, but 100 games would be such a small contribution to its training set that I wouldn't see the point. In the four hours, it would have played over 700 billion games with itself.

    • @NathanBurnham
      @NathanBurnham Před 6 lety +30

      I teach deep learning, and I agree that they likely didn't train deep mind while playing the matches against stockfish. If they did the games would have little impact in it's play vs the millions of games is has already played.

    • @Sky2042
      @Sky2042 Před 6 lety +23

      What is probably happening with the refusals is that A0 is making the move with the next-highest probability of winning.

    • @omerulger8
      @omerulger8 Před 6 lety

      Hey, i have some questions about deep learning, just basic question to how to learn it. if you have time to answer them, i can give you contact details? shouldnt take more than tops 10 mins
      @Nathan Burnham
      facebook.com/omer.ulger.397

    • @EebstertheGreat
      @EebstertheGreat Před 6 lety +8

      I'm not sure where you got 700 billion from, but I believe AlphaZero has played only a few million matches against itself. The preprint mentions only 700,000 games and claims that its performance exceeded Stockfish's after just 300,000.

  • @curtisbrown547
    @curtisbrown547 Před 6 lety +34

    we like stockfish vs alpha zero because it's like watching the chess equivalent of a dragon ball z fight

  • @brianbernstein3826
    @brianbernstein3826 Před 6 lety +111

    Agadmator you are an amazing channel thank you for all your work

    • @agadmator
      @agadmator  Před 6 lety +11

      Thanks Brian

    • @alexcerullo3143
      @alexcerullo3143 Před 4 lety +1

      agadmator's Chess Channel damn this was 2 years ago is alpha any better now

    • @davidegallo2185
      @davidegallo2185 Před 4 lety

      I really hope it can't improve further

  • @yixunnnn
    @yixunnnn Před 6 lety +83

    4:26 you can smell the fear in Stockfish repeating his moves

  • @UnXPLO1Table
    @UnXPLO1Table Před 6 lety +7

    I guess, Alpha0 was programmed to go for 2-time repetitions whenever possible, in order to induce the horizon effect in typical chess engines that examine the game tree to relatively small depths (usually up to 20 moves ahead in the middlegame). One of the ideas that Matthew Lai (one of the Alpha0 contributors) had expressed in his master thesis is to explore the tree deeper in those variations that are assessed (by a special neural network) as the most likely to be in 'the principal variation' (i.e. to be played if both sides play optimally), which is closer to how humans calculate variations, as opposed to usual chess engines that waste time on a lot of improbable variations and extend the search depth only in very specific 'violent' situations (like captures). Alpha0 uses this 'neural network for move probabilities' approach in its Monte-Carlo tree search (search for 'AlphaZero' on arXiv.org and read that preprint) and sees further than Stockfish in the critical variations that end up appearing on the board.

  • @winterguyVV
    @winterguyVV Před 6 lety +28

    There is a fitness function in ai that says how good is your solution. It can be as simple as winning = 1, losing = -1. If they set draw as a -0,1 or something similar it gonna refuse the draw by repetition. They should release the progression of learning from those 4 hours. Usually its funny stuff. First games random moves ending in perpetual checks. Then it learns not to draw and come up with some crazy attacks and trades, and eventually it would come up with openings, and basic strategies. Ending in this fish eating beast.

  • @argonthesad
    @argonthesad Před 6 lety +16

    It's great to see that bully get a taste of its own medicine:)

  • @xyon9090
    @xyon9090 Před 6 lety +52

    *Agadmator said,*
    "No more AlphaZero vs. Stockfish videos..we may not be able to appreciate human chess.."
    You may be a victim of your own words my friend haha.

  • @strengthman600
    @strengthman600 Před 6 lety +10

    My theory for the threefold repetition thing is that both of the bots are playing their best move, which just so happens to be a repetitive move. The thing is, when they get to the third time that move is no longer the best, because it leads to a draw, so it rethinks its move and does a better one

    • @rohangeorge712
      @rohangeorge712 Před rokem +1

      ah so they do the second best move as the "best move" would lead to a draw otherwise so basically the second best move becomes the best move. i think that kinda makes sense yea

    • @liljackypaper
      @liljackypaper Před 9 měsíci

      This doesn't really make sense to me. If the second move is better than a draw then how could the first move be the best move if it leads to a draw? That is counterintuitive

    • @gJonii
      @gJonii Před 4 měsíci

      ​@@liljackypaperThe draw only happens after 3 repetitions. The best move doesn't lead to draw in the first 2 times, so playing it in the tiny hope opponent plays less than optimal move, is worth it.

    • @liljackypaper
      @liljackypaper Před 4 měsíci

      @@gJonii engines don't play like that though. They don't make sub optimal mate in one threats in hopes that opponents miss it

    • @gJonii
      @gJonii Před 4 měsíci

      @@liljackypaper If they lose nothing from doing it, why not? Alphazero specifically, being MCTS, would treat the tiny chance of opponent playing wrong worth the extra move. Stockfish I think would treat the moves equally good, with or without mate-in-1 trap

  • @mechanicalmind07
    @mechanicalmind07 Před 6 lety +94

    You know what would be interesting if they give alpha and stockfish different opening positions like nimzo or sicilian or some well known gambit positions like kings gambit etc and let them play

    • @isolatedprawn6592
      @isolatedprawn6592 Před 6 lety +15

      Debjyoti Bose i'd love to see alphazero play the kings gambit :)

    • @yuyurtrtrt2160
      @yuyurtrtrt2160 Před 6 lety +6

      IIRC in the paper we have alpha vs itself 100 times for a few popular openings. But they don't show any games only the winrates.

    • @fedra2866
      @fedra2866 Před 6 lety

      on arxiv

    • @SniperMonkeh
      @SniperMonkeh Před 6 lety +1

      The king's gambit is a forced win for black.

    • @isolatedprawn6592
      @isolatedprawn6592 Před 6 lety +1

      Old man eating a cookie since when?

  • @EebstertheGreat
    @EebstertheGreat Před 6 lety +114

    "Is e4 a refuted opening?" Let's not go crazy, here.

    • @sovietai2595
      @sovietai2595 Před 6 lety +11

      EebstertheGreat Well, Alpha Go is probably the best chess player ever, and it never plays 1.e4
      So there could be something to it.

    • @EebstertheGreat
      @EebstertheGreat Před 6 lety +6

      If, back when he was the best player in the world, Paul Morphy had decided to never play 1. e4, that wouldn't have meant he had refuted it. If AlphaZero never plays 1. e4, that may be because it is less successful at that opening, but there are all sorts of reasons why that might be the case beyond it simply being a bad opening.

    • @Pintkonan
      @Pintkonan Před 4 lety +4

      @@EebstertheGreat if A0 never plays 1. e4 and this is because it assesses it as less successful, this is exactly what a refutation is dude. after all, it never plays 1. e4 :o and in this video you can clearly see why.

    • @EebstertheGreat
      @EebstertheGreat Před 4 lety +11

      @@Pintkonan That is not what a refutation means. A refutation does not mean the world champion is bad at that opening (or marginally less good than d4). A line is refuted if a refutation is found--that is, if a defense is found that proves the line is worse than another move. There is no defense to e4 that has been demonstrated to be successful, it's just that over many games, some engines win more with d4. If you want to be strict about it, by your logic, every opening is refuted except the single opening that Lc0 or Stockfish prefers. And if that ever changes in a better engine, suddenly the opening becomes unrefuted.

    • @lelik0911
      @lelik0911 Před 4 lety +14

      It’s an interesting question. To definitively refute any opening, one would have to solve the game

  • @albo_ar
    @albo_ar Před 6 lety +88

    Alpha tries to win the game every time playing as he knows is the best move. The rook is the best move until the posible threefold repetition.

    • @boblavey3474
      @boblavey3474 Před 6 lety

      +Albo Nice

    • @cuervo3097
      @cuervo3097 Před 6 lety +3

      is not just that, in the second set of the three moves repetition, the rook and the bishop end up in the opposite position in comparison with the first set. which i thinks it's what alfa wanted. so you are right, the best move for black is the rook but not until the threefold, it's because of it. Saludos de un hincha cuervo desde huerta grande, córdoba

    • @albo_ar
      @albo_ar Před 6 lety +3

      Hola cuervo, i don't think that AlphaZero ever hopes for a threefold. He just want to avoid it as long as he can find another node that's better than draw.

    • @Dragon7Ball
      @Dragon7Ball Před 6 lety +4

      Albo's native language is Spanish. Since in Spanish Alpha Zero's word 'gender' would be masculine we automatically think of "he". It's a mistake we commonly make.

    • @ThePotaToh
      @ThePotaToh Před 6 lety +1

      Cuervo 10 It's not what AlphaZero wanted, but rather it was forced to play a different move as playing the same move gave Stockfish the chance to draw.

  • @benjamineinhorn2314
    @benjamineinhorn2314 Před 4 lety +2

    Really strikes me how visually beautiful alphazeros development is.

  • @xyon9090
    @xyon9090 Před 5 lety +29

    *I'd treat AlphaZero to a drink*
    for winning against stockfish as payback for beating me a lot.

    • @thearmyofiron
      @thearmyofiron Před 5 lety +7

      Then alpha zero beats you 10x more than stockfish

    • @SpaceEag11
      @SpaceEag11 Před 10 měsíci

      I am late but the enemy of my enemy is my friend so he would still buy Alpha zero that drink 😂

  • @znxftw
    @znxftw Před 6 lety +40

    ALPHAZERO IS THE FUTURE.

    • @Jonathan-ec9pp
      @Jonathan-ec9pp Před 6 lety +4

      Maybe... but if Alphazero is the future, we humans are the past...

  • @bardhanjoy
    @bardhanjoy Před 6 lety +14

    No matter how hard I try to define the game with a suitable word, I am heading for the same word over and over again - "Poetry".

  • @onetouchtwo
    @onetouchtwo Před 6 lety

    I'm a visitor, VERY much enjoying the AlphaZero coverage. Thanks for doing these videos.

  • @samsmith9764
    @samsmith9764 Před 6 lety

    Love these alpha zero videos man! keep up the good work :D

  • @fokkusuh4425
    @fokkusuh4425 Před 6 lety +14

    2017 - DeepMind AI
    2018 - By the time DeepMind became self-aware...

    • @kirill.borisov
      @kirill.borisov Před 3 lety

      It did. Now it's developing a master plan to conquer Earth.

  • @monkeysrightpaw
    @monkeysrightpaw Před 6 lety +8

    Hooray! Alpha zero returns :)

  • @FloydMaxwell
    @FloydMaxwell Před 5 lety

    Such great analysis of Deep Mind's maneuvering of the bishop

  • @realways6173
    @realways6173 Před 6 lety

    I really like the fact that these games are a real grind, and not just ownage in just few moves. For humans this game is great (its well balanced for both sides) and certainly has alot of future!!

  • @skaterfugater
    @skaterfugater Před 6 lety

    what i find interesting about the repition breaks by alpha is not the question whether it found a winning variation in the mean time or just trys to win and having the draw in its sleeve all the time but whether it *cares* about not losing a game and going on with a move it considers less optimal because it *wants* to go on.

  • @markusalanko9134
    @markusalanko9134 Před 6 lety

    It would be so interesting to force these engines to a certain opening and let them continue from there, just to see how it would turn out. Gotta say I did not expect to like these engine games, but they are pure gold and I´m so excited when I see you have uploaded another one! Keep up the good work, cheers from Finland \o

    • @liljackypaper
      @liljackypaper Před 9 měsíci

      Isn't that what originally happened? I thought neither engines has opening books?

  • @YotamPiano
    @YotamPiano Před 6 lety

    Alphazero just had a fish for dinner. loving those videos. his type of thinking is astonishing!!

  • @randomlife7935
    @randomlife7935 Před 6 lety +14

    Alpha Zero overprotected the e5 pawn and maneuvered the knight at d6 for the blockade. Is Niemzovich correct all along? Even A0 used it.

  • @barbosagiordano
    @barbosagiordano Před 6 lety +7

    Your dog is back! Great! =)

  • @benl3988
    @benl3988 Před 4 lety +8

    Imagine playing stockfish with black and you're offered a draw.
    But, you just think: "Nah, a4 is winning."

  • @pegion6275
    @pegion6275 Před 3 lety +1

    i would love to see alpha zero playing black with various openings like nimzo defense, scillian, caro-kann, and against some gambot positions too.

  • @meladezzat
    @meladezzat Před 6 lety

    +agadmator , plz keep making more AlphaZero videos, we need all the games against stockfish

  • @jaimeduncan6167
    @jaimeduncan6167 Před 6 lety

    In your variation, you can simply play Bc3 with white and you do stop both pawns for a while. It seems that the pound in H will fall but the black king will take d6 and from there is not dificult to win.

  • @rangedfighter
    @rangedfighter Před 6 lety

    I personally think that alpha chooses the best position, by repeating the position 2 times, and then doing it again, in the end it will be in the same position that it actually wanted to be (because after 2 repeatitions it will be 1 turn away from it's optimal position and after doing it twice it's exactly where it wanted to be)
    It circumwents the 3 fold repetition rule so to say to force the opponent to accept a position where they normally would want to draw.

  • @GowthamChakkravarthyNS

    Agadmator I love your channel and have watched most of your videos. Am planning to watch the rest of the videos too. I would like you to do videos on Chess openings and discuss the various variations in each main opening. There are few good chess openings videos on CZcams and I am sure the entire community would learn from your videos. Please do chess openings videos.

  • @trebledawson
    @trebledawson Před 6 lety

    With respect to AlphaZero *almost* doing three-fold repetition twice in a row: It is very likely that the rook moves (in response to the bishop moves) are in fact the moves that are most likely to lead to a win, if threefold repetition were not a rule. However, AlphaZero is trained to recognize when a move will result in a win, loss, OR draw; the first two repetitions are simply AlphaZero taking the most winning move at the time, but the most winning move changes when it will directly lead to a draw. What is most impressive is that AlphaZero learned the threefold repetition rule; it was not hardcoded into the neural network as it would be for a classical engine. Considering how few games end in threefold repetition, it's truly amazing how DeepMind was able to generate enough games for AlphaZero to learn threefold repetition from scratch.

  • @raisethecurve
    @raisethecurve Před 6 lety

    I pray the algorithm is developed for commercial distribution because this method of search is beautiful to behold. Could go a long way towards training the next generation of chess players.

  • @eshneto
    @eshneto Před 6 lety +17

    When refusing a draw, probably, Alpha Zero considers itself better in both positions so the draw is worse than going for the "less good" position.

    • @Amethyst_Friend
      @Amethyst_Friend Před 6 lety +6

      Yep, this is so obvious and I find it strange that so many people don't get it.

    • @james_carmichael
      @james_carmichael Před 5 lety +1

      Agreed, alpha repeats moves bc he thinks he has a better position and is trying to 'bait' his opponent or exhaust all chances that his opponent will make a different move and not repeat ... After alpha repeats twice he doesn't want the draw bc , idk, alpha thinks the position is still favorable or playable (in alphas mind!) So he avoids the draw and moves on vs. he always has a draw in the back pocket or he learns a new moves after the same position is repeated.

  • @Themozartthug
    @Themozartthug Před rokem

    @9.40
    Look at the pattern with the pawns and the king, it's completely symmetrical. Alpha moved that king loads of moves earlier, not sure how many......it new where to place the king, if knew what colour bishop was best, it basically new the whole

  • @aconsideredmoment
    @aconsideredmoment Před 6 lety

    Deepmind Alpha Zero's tight knit structure and movement of play reminds me of a sliding tile puzzle, both interlocking and spiral. A snapshot of Stockfish seems a looser version of the same structure and movement relative to Deepmind Alpha Zero (e.g. 5:58).

  • @outtabubblegum7034
    @outtabubblegum7034 Před 4 lety

    6:45 I think that this Bishop x Knight exchange has multiple purposes: strategically that's a bad Bishop in a close position, so it's great to exchange for a centralized knight; also that knight was defending c4, which will now depend on the Queen; as she can't move now, the obvious Alekhine Gun that Stockfish was planning to create at f won't happen.

  • @Superawesomebob9
    @Superawesomebob9 Před 6 lety +5

    What do you think would happen if Google let Alpha Zero train for more than 4 hours? What kind of God would they create if they let it train for weeks????
    #suggestion if you can find a Alpha Zero vs Alpha Zero game that would be very interesting.

  • @LunchThyme
    @LunchThyme Před 5 lety +7

    The Berlin defense is better, provided you're a strong enough player to consistently beat Stockfish.

    • @feliscorax
      @feliscorax Před 2 lety

      Stockfish is the Soviet Red Army. There is no Berlin defence.

  • @MadaxeMunkeee
    @MadaxeMunkeee Před 6 lety

    The reason AlphaZero plays for two repetitions is because it's designed to play the best move for the position on the board.
    In those situations, it really is playing the move it thinks is best. And only when the 'best' move would force a draw by three fold repetition does it consider another move.
    I think the takeaway you should probably have is that in those positions, AlphaZero prefers the move only if it does not cause a draw. The move it plays instead is its second choice, but still has winning chances.

  • @RLinares22
    @RLinares22 Před 6 lety +1

    I wonder if there's a way to deconstruct scenarios and outcomes from various playing scenarios by forcing Alpha 0 to play itself and set it's opening sequences (Queen's Indian / Belgian v e4 or others) then release the analysis to discover why... Could be interesting either way it's incredible play and thank you for sharing

  • @stillnessinmovement
    @stillnessinmovement Před 6 lety

    I first learned about the technology that DAZ uses (parallel distributed processing) in the early 90's and it was revelatory; AI is smarter when it tries to act like a real brain than a computer. I use some of the lessons from this in my personal work (making mistakes is GOOD, as it helps you learn, don't be afraid of making a mess of something, you might learn something!) and now seeing how DAZ makes such interesting, elegant moves, it's very cool to see.

  • @fujiapple9675
    @fujiapple9675 Před 6 lety

    7:05 this position reminds me of Alpha's French Defense game, just reversed with the black pieces.

  • @untwerf
    @untwerf Před 6 lety

    Hey agadmaster, can you offer general recommendations on the best chess books available with reference to particular authors and publishers.. i would also be interested to hear specific titles that you think are particularly good!

  • @spikebtvs
    @spikebtvs Před 6 lety

    Hi, i study machine learning, i think the 3 move repetition has to do with how self learning neural networks trains themself -- it has probably learned that the same position happening 3 times means a draw -- but if all it was given is the rules of chess it would have never explored past 3 repeated positions because it would have considered that position "known" or solved for -- the end implication is that is is now forced to pick its second best move, which it also probably thinks is winning .

  • @FloydMaxwell
    @FloydMaxwell Před 6 lety

    7:55 Never has a doubled pawn looked so powerful -- Alpha's pawn fortress...wow.

  • @raamshankar4121
    @raamshankar4121 Před 6 lety

    It was given with clear instructions during the initial programming. It works "Minimum defense and Maximum Attack". Stockfish has it opposite way.

    • @FelixIsGood
      @FelixIsGood Před 2 lety

      That is not how deep learning works.

  • @brandons4240
    @brandons4240 Před 6 lety

    What would be interesting is how fast AO could undisputably solve chess if allowed to play long enough and self learn to the point where it always picks the same move for any of the estimated 10^43 possible chess positions (there is an estimated 10^120 possible chess games). It constantly refines its strategy based on past learnings...it must already be close if not finished if it can beat Stockfish.

  • @dodgecoates8760
    @dodgecoates8760 Před 6 lety

    Great video!

  • @brandybuck7641
    @brandybuck7641 Před 6 lety +35

    HI AGADMATOR.... i am a huge fan of ur chess commentry...i would like to make a suggestion though...
    i think it would be very nice if u could make some space on the screen dedicated for the move (like b7 ,e4 etc) , for dead peices and for the name of the opening or attack(like kings indian defence or scicilian defence etc.)
    thank u

    • @Sameer_S_Kulkarni
      @Sameer_S_Kulkarni Před 6 lety +5

      It may not be a good idea as it would take the attention away from the action on the board. If you want the moves displayed, then what's the need of commentary?

    • @mozisi
      @mozisi Před 6 lety +1

      You're right, Sameer. In the last video someone had requested that the positions (Eg. Knight to c3 etc)NOT be mentioned. I didn't quite understand. I mean how are you supposed to analyze otherwise. But to answer brandybuck, I would suggest this
      1. If you needed screen space because it's difficult to grasp notations along with the video: I think I understand the nature of your request. But trust me, everyone has these struggles in the beginning where they find it difficult to comprehend the positions on the board and the notation. It will come with time. Suddenly one day, you'll be talking to yourself in notations (atleast for the first few moves). Suddenly when someone says 1.e4 c5 you'll inherently know it's the Sicilian without even looking at the screen or the board. And that's a beautiful feeling. Most people have this difficulty in the beginning and that might even deter you at times but that's the beauty of chess. But that's exactly why chess games are immortal because we can recreate games from 200 years ago with just notations. They're really powerful but it takes a few hours to get used to them. I sincerely hope you get through this phase and appreciate them, if this was your concern.
      2. If your request was just aesthetic in nature, I agree with Sameer in saying that it does take attention away from the board. The description of the video gives the exact moves played in the game. One thing that you can do is to import the pgn if you are analyzing or watch more and more videos so that you do not need any screen space for the moves.

    • @Joshuaposada
      @Joshuaposada Před 6 lety +2

      No I agree 29th brandy. If anything have it notated. To his website. Btw need a website lol

    • @mozisi
      @mozisi Před 6 lety +1

      If you haven't noticed already, the notations are there in the description. You don't find other popular youtube channels including notations with every move on the screen. And that is for the same reason. There is a strict correlation between what we see and what is perceived by our auditory senses at the same time. While watching a game, it's important to pay attention to the move on the board while the brain automates the position in the brain (as we hear the notation). If it is displayed along with every move, then it definitely becomes a distraction because our eyes would shift between what's happening and what's being displayed. Like I said, I understand it from a beginner's perspective for the sake of convenience but a little effort and patience can't do any harm. You don't find a lot of subscribers requesting this for the same purpose because most people already know that, it doesn't take ages to get used to notations. I do, however agree with you on the point of making a website. That would be cool. Might need Medo's picture as the background :D

    • @brettluther7303
      @brettluther7303 Před 6 lety +1

      Mozisi. ur right. Initially it was difficult for me but now i can understand easily and now i can visualize positions just by hearing them. like your explanation :-)

  • @orlenespinal5788
    @orlenespinal5788 Před 6 lety +4

    This is really funny because I have not lost with the Berlin defence jet. 15 matches.

  • @abebuckingham8198
    @abebuckingham8198 Před 6 lety

    To understand why the position repeats but the draw is refused we can look at the algorithm they used to train Alpha Zero. It uses a kind of Monte Carlo method which is a randomization procedure to decide which moves to try next. This means while training if you allow your opponent more opportunities to deviate from the best line you have a higher probability of winning in the position just because you get that extra roll of the dice. I would interpret this behavior as showing that alpha zero felt Stockfish's defense is optimal and that deviation from that line significantly improves Alpha Zero's evaluation of black's position.

  • @muhammadfahad1187
    @muhammadfahad1187 Před 6 lety +1

    Hey can you provide us with a download link to the chess engine you are using on your computer? Thanks

  • @Xenon777channel
    @Xenon777channel Před 6 lety

    If you look in the PDF paper on this, they did put Alpha Zero against Stockfish in the Ruy Lopez in 100 games, which they did in several openings, however, it's not clear which position it started from, either 3. Bb5 - a3 as in the picture, or 7. Bb3 - 0-0 as in the "PV". Nonetheless, Alpha Zero as black won 6 games, drew 44 and lost 0 from which ever position. As white, won 27, drew 22 and lost 1.

  • @donny.3775
    @donny.3775 Před 6 lety

    ur the best chess youtuber :)

  • @pgyore3111
    @pgyore3111 Před 6 lety

    I am glad there some discussion in the comments regarding the apparent handicaps Stockfish was dealt at the beginning of the match. Has anyone suggested a rematch yet?

  • @georgiosvavliaras1066
    @georgiosvavliaras1066 Před 5 lety +2

    At 6:41 how did black capture after white moved to f4? They were both on the 4th row next to each other (?)
    Am I missing something? Please help, I'm fairly new to chess, excuse my lack of knowledge

  • @u3k1m6
    @u3k1m6 Před 6 lety

    Thanks for analyzung these Aloha Zero videos. They're quite entertaining.

  • @jcsmith5984
    @jcsmith5984 Před 6 lety

    honestly, the only people i watch when it comes to chess commentary and analysis is MatoJelic and Agadmator's chess channel!
    They give the most accurate analysis and they are entertaining to listen to and watch!

  • @dickbrazen
    @dickbrazen Před 3 lety

    When you said it's difficult to imagine how Alpha makes progress, in situations like that, I just try to find the most likely candidate and start busting stuff up. More successful for someone of my level than you might think.

  • @MrYonch
    @MrYonch Před 6 lety

    Amazing video, thank you! I have some questions: It seems to me from the AlphaZero games and paper that it's power lays in super advanced stratigique thought (or maybe stratigique calculation? Hard to chose words to describe this "entity"). Whereas, from my limited knowledge, Stockfish's (and chess engines in general) strength lays in brute force of calculation. So, added with opening books and different middle and endgame tables, Stockfish is merely "mimicking" stratigique thinking, but isn't actually considering positinal aspects, space usage, flexebilty, activity and synergy. It IS eventualy "taken into consideration" indirectly via brute force, because the consequences of such elements are evident in lines calculated by Stockfish. Against a human or an inferior engine, the force of calculation is enough to "hide" the inability to think/calculate strategy. But it seems this is how it is outplayed by AlphaZero.. Also, Stockfish is engineered by humans to evaluate a position not only by calculating possible lines of play but also through material numeric value. Maybe we, humans, "misled" stockfish by "teaching" it a wrong or incomplete evaluation of material and position process... Maybe AlphaZero can teach us a new way of thinking about material value. Either we will learn that a knight is actually worth 3.5 and a bishop is worth 2.7, for example, or that it's wrong to even go through that line of thinking.
    What's also interesting in my opinion, is that SF's brute force makes it a "god" of tactics, as tactics are based on calculation rather than "thought". (They could also be based on 'post-calculation'. A GM doesn't have to always calculate a full process to spot a tactical trap, he/she can train to see it by noticing patterns and structures, or known lines of "theory" based and calculation made by them or someone else (including engines) in the past).
    I believe Stockfish is bound to always calculate, and it can't develop these abillities that GM's can. Though, It probably doesnt mind (Pun intended ;) ). It is a preety f***ing good calculator.
    But is it possible that AlphaZero DOES develop (like a human would) to recognizes tactis without calculating all the time?
    Is it possible AlphaZero is "thinking" strategy in a broad and complex way?
    Is it possible Stockfish is yet superior in tactics? Would be interesting to present them both with very complicated chess puzzles to see who is better. (Though probably even AlphaZero's inferior calculation power of 'only' 80,000 positions per second can stand any chess puzzle we humans created, and the gap between SF's and AlphaZero's tactiacal quality - if indeed exsists such a gap - would be insignificant or impossible to notice unless both of them are given only fractions of a second to solve the puzzle.)
    I want to add that all the asumpstions I based my thoughts upon could be flase. I am new to chess and know almost nothing about computering and AI tech.
    Also, as some people find it somewhat depressing that AlphaZero belittled centuries of game development in 4 hours, I want to add an incourging thought:
    Even though AlphaZero outclassed us and our programs so effortlessly, it still isn't capable of INVENTING AND DEVOLPING the game of chess. Or even if it is, if instruced to come up with a game, it can't do so just because it WANTS to and INTRIGUED by it. We still have the ability of doing something for the sake of pure enjoyment going for us. For now. :)

    • @brianniemi7051
      @brianniemi7051 Před 6 lety

      You have won the TL; DR award, my friend יונתן ריבק

  • @jasonq7504
    @jasonq7504 Před 3 lety

    4:37 Maybe alpha zero is giving up a move to reposition the White bishop, since it placed the knight in a square blocking the bishop.

  • @cukbeu4662
    @cukbeu4662 Před 6 lety

    i was looking forward to see where it is going to blow today

  • @Vampiracho
    @Vampiracho Před 6 lety

    Helrlo everyone! Love your videos and accent.

  • @existenence3305
    @existenence3305 Před 6 lety

    Hey Agadmator, did you find any ratings for AlphaZero??

  • @alexandre588
    @alexandre588 Před 6 lety

    "And in this position, stock-fish resigned" Kreygasm

  • @shrimp569
    @shrimp569 Před 6 lety

    The key here is the Alphazero calculates move probabilities, and not just what is the best move for its opponent. So there is always a small but non-zero probability that white will play something else, and thus giving black an advantage. Since Alphazero is not penalized for playing cycles (until it leads to a draw), it is always better to play the cycle and see whether the opponent will make a suboptimal move or not.

  • @arielperez3434
    @arielperez3434 Před 6 lety

    I thought you'd decided to let us keep enjoying human chess.
    Won't complain, these videos are awesome.

  • @lapulgaatomica9280
    @lapulgaatomica9280 Před 6 lety

    For me it just looks like Alpha Zero doesn't lose anything playing Rf7, because if the opponent responds in the drawish way it can just go back and nothing changed in the position. It is just scouting SF to see if it will answer in the best way possible, cause if it don't maybe there might be some crushing lines behind it

  • @malpigwalt
    @malpigwalt Před 6 lety

    Yes, we want them all.

  • @Trynottoblink
    @Trynottoblink Před 6 lety

    This is now the DeepMind Alpha Zero chess channel.

  • @i1pro
    @i1pro Před 6 lety

    I was reading about Google's AI was actually acquired from a startup company. The curious thing is that when AZ learned Go by playing against itself for a number of hours . That version actually defeated the AI version that learned from a feed of thousands of previous games. I wonder if AZ learned chess from zero also or it was fed games... Hope Antonio can clarify...

  • @ErnestoAE
    @ErnestoAE Před 6 lety

    I was expecting a bit more at 6:44 regarding the lines with Qxe3 capture and Re2xe3

  • @columbus8myhw
    @columbus8myhw Před 6 lety +7

    I wonder if all the drawn games were drawn because of threefold repetition.

  • @pashapasovski5860
    @pashapasovski5860 Před 4 lety

    It's fkn unbelievable! AI is going to rule the World and this game shows how!

  • @nipunpratap6602
    @nipunpratap6602 Před 6 lety

    more alpha and stockfish matches pls

  • @stateofdecay2210
    @stateofdecay2210 Před 4 lety

    I played against stockfish with an extra queen that I added to my army lol and winning the game was a real pain in the ass because stockfish defending is so strong

  • @hardkur
    @hardkur Před 6 lety

    AI brings the views ;-) i would never hear about your channel if not Deep mind games

  • @RonWolfHowl
    @RonWolfHowl Před 6 lety

    At 3:59 & 4:25, does Stockfish really think it can do no better than a draw? Or does it simply anticipate that the opponent will refuse the draw?

  • @juptor
    @juptor Před 6 lety

    It's pretty clear alpha learned that threefold repetition leads to a draw, and that if it takes a draw everytime one is offered, the overall amount of wins will decrease. However, even though it will not take a draw in this position, it does not mean alpha won't try the VERY cheaky move of just repeting two times in the "hope" that the opponent will instead blunder.

  • @keepthingssimple
    @keepthingssimple Před 6 lety

    Idea of repeating the move ... is to make sure wther yr opponent find the correct sequence .. I have seen many time stockfish doing this to me when i am analysing my games .. even with +4 advantage ..there is a chance that yr opponent might do something wrong that will increase yr advatage and finishing in quick moves ^_^

  • @suezix8689
    @suezix8689 Před 4 lety

    #agadmator I'm trying to find the Leela game (or Alpha) were the white queen spent much of her time on H1 but am failing. Can you or someone else point me in the right direction?

    • @dannygjk
      @dannygjk Před 4 lety

      I think you mean AZ vs SF SF played QID. One of the QID games. Several CZcams people covered it. Maybe you mean this game? :
      czcams.com/video/NaMs2dBouoQ/video.html

  • @DarkestValar
    @DarkestValar Před 6 lety

    I love these no more videos series :p jk agadmator's love all ur content as usual

  • @jakubdaniluk3578
    @jakubdaniluk3578 Před 6 lety

    Is Bg4 a valid attack in your opinion (6:55)?

  • @Koew
    @Koew Před 6 lety +3

    Hi agadmator. I just want to share, wouldn't it be interesting if Alpha Zero plays against itself? I mean, what if there are the same moves every time? What if white always wins? What would it mean?

  • @Stl71
    @Stl71 Před 6 lety

    It is time for the best GMs to unite in one team. We demand a match of this team against A0 robot now!

  • @Jacob32905
    @Jacob32905 Před 6 lety

    Alpha vs Magnus is gonna be an awesome match!

    • @dannygjk
      @dannygjk Před 5 lety

      Um yeah... no

    • @Tutdelasmore
      @Tutdelasmore Před 2 lety

      not really, no human stands a snowballs chance in hell against a top tier engine

  • @dakbabu
    @dakbabu Před 4 lety

    what is the difference between engine approach and alpha0 approach.

  • @Prinrin
    @Prinrin Před 6 lety

    Re: AlphaZero's strategy WRT draws: There's no reason to not repeat the position a second time. If it thinks the opponent made the best move, then offering them the opportunity to redo that move twice can only improve AZ's position. It might not work, sure, but there's no penalty for checking if they mess up.

  • @shankernarayan5028
    @shankernarayan5028 Před 5 lety +2

    Alpha zero is a Karpov fan

  • @chedwick
    @chedwick Před 6 lety

    It might be a good idea changing Stockfish icon to a fish frying on top of a grill when it is facing Deepmind.

  • @krishrao2778
    @krishrao2778 Před 5 lety +1

    alpha zero plays like a super Petrosian.

  • @brtw51
    @brtw51 Před 5 lety

    Would be interesting to see how well Alpha Zero played right after learning the moves... was it instantly a GM with it's calculating ability? I don't think I've seen it make a single bad move yet.

    • @dannygjk
      @dannygjk Před 5 lety

      No it was nowhere GM strength until it had played millions of training games.