AI Plays Trackmania - Bloopers

Sdílet
Vložit
  • čas přidán 8. 06. 2023
  • In this video, and AI is trained with reinforcement learning to accumulate speed and finish a map as fast as possible.
    The AI learned a behavior where it turns around right before the finish line. This is not a one-off mistake, the AI repeatedly did similar things in back-to-back runs. Can you guess why ?
  • Hry

Komentáře • 13

  • @krembananowy
    @krembananowy Před rokem +33

    Because it was rewarded for speed and not for finishing the track?

    • @linesight-rl
      @linesight-rl  Před rokem +25

      It was rewarded for both, but not enough for finishing. You are right 🙂

  • @swapnarajmohanty6698
    @swapnarajmohanty6698 Před rokem +6

    thats so similar when i play the game myself

  • @pajcsilol
    @pajcsilol Před rokem +3

    Great work tho! I saw the best run and the Wirtual video too. I would like to ask some questions. The AI's job is probably finishing the track as fast as it can, but can it actually "see" turns and the track, or you have to train it again and again if you put it on a new map? I am curious if you can train an AI which can finish any non complicated map at a decent time first time or no.

    • @cyb3r._.
      @cyb3r._. Před 10 měsíci +1

      it can actually "see" turns and the track, as it "sees"/uses 20 fps black and white "screenshots" (idk if they actually go and take screenshots or just have it play on 20 fps colorless or something)

    • @Dizastermaster.
      @Dizastermaster. Před 10 měsíci

      It depends. Usually it works best with a top down camera cause 3d shit makes it a lot harder for AI to know what they're looking at. But some do and some don't. Depends on the model

  • @ArKantiK
    @ArKantiK Před rokem +2

    Do you think this AI could play ZRT Trackmania cup circuits ? i'm wondering if it would be possible, but don't really know how to use it and have the time to try on my side.

    • @linesight-rl
      @linesight-rl  Před rokem +1

      Are those circuits available on Trackmania Nations Forever ?

    • @ArKantiK
      @ArKantiK Před rokem +1

      @@linesight-rl I don't think so... but does the AI could work on TM² Stadium ?

  • @masonlyons2959
    @masonlyons2959 Před rokem

    What's the best way to contribute changes to the code? I've got a fork with some changes so far.

    • @linesight-rl
      @linesight-rl  Před rokem

      Thank you for your interest. This is a rapidly evolving 2-person project. At least in the short term, we prefer to keep it small.
      I'd be happy to have a look at changes you make, but will likely not merge your branch in our private repo.
      We will probably have a more open approach in the future and welcome contributions.
      How can we inform you when we decide to open our private repo and welcome contributions?

    • @masonlyons2959
      @masonlyons2959 Před rokem +3

      @@linesight-rl I've got notifications turned on for the channel so if a video is uploaded I'll see it. I'd suggest putting any open source notification there. As far as changes go I've done the following:
      Moved to logging instead of print
      Optimised variable access
      Made resuming last training easier
      Moved to automatic timestamp based filenames for saved data
      My to do is currently:
      Look into using mss instead of dxcam to see if it's quicker, from profiling it seems to screenshot creation is the longest part of the script.
      Introduce the surface and wall contact into the reward function to stop it trending towards ramming into the walls.
      Upon going onto the grass outside of the track ending the run and heavily punishing reware

  • @blu3tacos37
    @blu3tacos37 Před rokem

    Remember me when your viral🙏