How DINO learns to see the world - Paper Explained

Sdílet
Vložit
  • čas přidán 29. 08. 2024

Komentáře • 11

  • @akshaymundra1052
    @akshaymundra1052 Před 4 měsíci +2

    Loved your series on self-supervised learning. Are you also planning to cover DINOv2? I am particularly curios about the emergence property of the model -- how it is able to regress semantically consistent features for different parts of the objects (and not simple FG-BG separation as in DINOv1)!

  • @benmainbird
    @benmainbird Před rokem +3

    Great video! Keep it up👍

    • @borismeinardus
      @borismeinardus  Před rokem

      Genuinely happy to hear you liked it, thanks! ☺️

  • @user-ji8lk5ls2k
    @user-ji8lk5ls2k Před měsícem

    Hi, I'm a bit confused about the centering method you described in this video(3:25). In your video, you're adding the center to the online network's output, which is different from what I've seen in other implementations of DINO (czcams.com/video/h3ij3F3cPIk/video.htmlsi=BUj7iQMXKaEs0Nr1&t=1296). Most implementations subtract the center from the output. Could you please clarify if there's an error in the video or if this is a different approach to centering?

  • @nasosgerontopoulos5267
    @nasosgerontopoulos5267 Před 9 měsíci +1

    Very good content. Congrats 👍. Reading papers can be tough for many people, and such videos make it a lot easier to keep up with these state of the art advancements. As a fellow researcher, do you think investing time in self-supervised learning research is worth it right now? Considering that me and my team do not have access to such computational power as META and Google, I am not sure if we can keep up.

    • @borismeinardus
      @borismeinardus  Před 9 měsíci

      Hey, thanks! 😊
      I think it is worth it! SSL is a broad field and SSL in the case of Multi-Modal Learning is very relevant. Yes, you will likely not be able to build the largest foundation models and go for scale, but you can definitely work on more nuanced research. E.g. Imagebind is a great example of a simple idea that does not require all the data and compute in the world. Btw. I also have a video on that paper :)
      czcams.com/video/QQJ3IR0ahMk/video.htmlsi=VYxxIQPiyAXnlsw9

  • @yossefdiab7452
    @yossefdiab7452 Před 6 měsíci +1

    great explaination

  • @carsongutierrez7072
    @carsongutierrez7072 Před rokem +2

    Transformers~ ML bro~

  • @menkiguo7805
    @menkiguo7805 Před 3 měsíci

    it dose has the projection head though