ConvNeXt: A ConvNet for the 2020s | Paper Explained

Sdílet
Vložit
  • čas přidán 29. 08. 2024

Komentáře • 38

  • @chankhavu
    @chankhavu Před 2 lety +16

    I like how in your videos, you not only explain the details within the paper but also the more "meta" stuffs that is harder for people to grasp without reading through a lot of papers. Reading and understanding one paper is easy. Develop an intuitive understanding of a whole research subfield and its general directions is the hard part.

    • @TheAIEpiphany
      @TheAIEpiphany  Před 2 lety

      Thanks! Yes this one was rich in contextual information: DanNet, diagram correction from Twitter, and Swin transformer mainly I guess?
      Well, it's oftentimes hard to understand a specific paper without having all the necessary context - and it takes time to accumulate it.

  • @TheAIEpiphany
    @TheAIEpiphany  Před 2 lety +13

    We need to start working on reasoning - perception is converging we're out of ideas lol
    Bad jokes aside - at this point, it seems that CNN priors are quite adequate (in the case of natural images) - a hybrid approach (initial stages CNN-like and later stages transformer-like) seems to be the way to go, but the game is still on.

  • @gauravlochab9614
    @gauravlochab9614 Před 2 lety +3

    Thanks for the amazing explanation. Yes mixing up the code and paper boosts the implementation speed many folds. I love your work, you are awesome!

  • @ansariyusuf4774
    @ansariyusuf4774 Před 2 lety +3

    mix of paper and code is great!

  • @yevhendiachenko3703
    @yevhendiachenko3703 Před 2 lety +1

    Thank you! The video is excellent. I like that you mix code + paper in explanation and the fact that you provide a context and highlight the most essential parts.

  • @armingh9283
    @armingh9283 Před 10 měsíci

    Thank you. Very informative

  • @lalitmrinki
    @lalitmrinki Před 2 lety

    Thank you for such an in-depth explanation. Your plan of explaining the history and convergence and then going through the paper and code is great way for learners to understand the concepts deeply. Its very important to select the important portions from the paper for further exposition and to leave-out unnecessary boilerplate stuff. I liked that you didn't say "go and read the paper yourself"!

  • @adityakane5669
    @adityakane5669 Před 2 lety +1

    Excited for this one!

  • @manub.n6223
    @manub.n6223 Před 2 lety +1

    Thank you so much for the brilliant explanation

  • @enip239
    @enip239 Před 2 lety +1

    very nice content!
    I even didn't notice they use the old ResNet top-1 acc instead of wightman's.
    And that's make this model less comparative to the SOTAs.

  • @sushantgautam773
    @sushantgautam773 Před 2 lety +1

    Nice Explanation. By the way, could I know which software you are using just showing multiple things in one.

  • @PritishMishra
    @PritishMishra Před 2 lety

    Very thanks for the awesome content!

  • @luna0609
    @luna0609 Před 2 lety +1

    This was a great video. The best I've seen about explaining a research paper. 👏

    • @TheAIEpiphany
      @TheAIEpiphany  Před 2 lety +1

      Hah I don't know about that but thanks! 😂

  • @MrSebastian12358
    @MrSebastian12358 Před 2 lety

    Thanks a lot for your amazing effort.

  • @user-co6pu8zv3v
    @user-co6pu8zv3v Před 2 lety

    Thank you!

  • @oncedidactic
    @oncedidactic Před 2 lety +3

    always semirants!

    • @TheAIEpiphany
      @TheAIEpiphany  Před 2 lety +1

      My made up word just got its 1st validation - it's an official word from now on!

    • @oncedidactic
      @oncedidactic Před 2 lety +1

      ayyyyyyyyyyyyyyyyyyyy :D

  • @marvlousdasta2566
    @marvlousdasta2566 Před 2 lety +1

    Great video as always. What software are using to present and annotate the paper?

  • @Kenny4PresidentFTW
    @Kenny4PresidentFTW Před 2 lety

    this channles videos are amazing

  • @eng_ajy5091
    @eng_ajy5091 Před 2 lety

    Hi , First of all, I would like to thank you for your excellent and wonderful videos on artificial intelligence.
    I am a PhD student working on fast video captioning and I hope to reach real time captioning
    But I am confused by too many articles and too many techniques and algorithms in this field
    I need your help in guiding me to choose the right path among the existing methods:
    (traditional CNN, Transformer, YOLO, self attention only or make combination or others )
    While maintaining a trade-off between speed and accuracy

  • @mritunjaymusale
    @mritunjaymusale Před 2 lety +1

    the pre-training they did on Imagenet-22k was supervised or unsupervised like the way transformer papers do ?

  • @JapiSandhu
    @JapiSandhu Před 2 lety

    Can this be used for video classification?

  • @mahmoodkashmiri
    @mahmoodkashmiri Před 2 lety

    what tool do you use to read research papers on Ubuntu? Thank You!

  • @jonathansum9084
    @jonathansum9084 Před 2 lety

    3, 3, 9, s3. What does the s3 mean?

  • @momcilomrkaic2214
    @momcilomrkaic2214 Před 2 lety +1

    Boss