Feature Pyramid Network | Neck | Essentials of Object Detection

Sdílet
Vložit
  • čas přidán 24. 02. 2023
  • This tutorial explains the purpose of the neck component in the object detection neural networks. In this video, I explain the architecture that was specified in Feature Pyramid Network paper.
    Link to the paper [Feature Pyramid Network for object detection]
    arxiv.org/abs/1612.03144
    The code snippets and full module implementation can be found in this colab notebook:
    colab.research.google.com/dri...
    The torchvision has a more flexible implementation which would take more than 3 feature layers from backbone
    pytorch.org/vision/main/gener...
  • Věda a technologie

Komentáře • 50

  • @paedrufernando2351
    @paedrufernando2351 Před rokem +8

    Keep the pearls of wisdom dropping sir..Privilage to learn from you miles across...

  • @lostpenguin3682
    @lostpenguin3682 Před 6 měsíci +2

    very helpful! I really like that you're explaining it with an example with concrete numbers!

  • @brunodias3524
    @brunodias3524 Před 11 měsíci +1

    I am so happy I found this video. Really good content!

  • @NehadHirmiz
    @NehadHirmiz Před rokem +2

    Excellent tutorial. Thank you very much.

  • @TeamDman
    @TeamDman Před 9 měsíci +1

    Thank you for sharing your knowledge!

  • @vipingautam9501
    @vipingautam9501 Před rokem +2

    This is excellent! I just love it.

  • @user-do5pn6hb2i
    @user-do5pn6hb2i Před 11 měsíci +1

    Sir, I have a lot of to say after finding your video on CZcams but just ❤ , respect and thank you. 🙏🙏

  • @applestarpie
    @applestarpie Před rokem +1

    I like your videos, which are easy and fun to learn. Thanks a lot!

  • @manueljohnson1354
    @manueljohnson1354 Před měsícem

    Excellent

  • @AdnanMunirkhokhar
    @AdnanMunirkhokhar Před 10 měsíci +1

    amazing explanation Dr.

  • @science.20246
    @science.20246 Před 4 měsíci +1

    is useful to add channel and spatial attention in conv layers to improve

  • @rampavanmedipelli6152
    @rampavanmedipelli6152 Před rokem +1

    Thank you... excellent clarity... please try to make a tutorial on anchor free detectors like FCOS..

  • @ranjithtevnan2909
    @ranjithtevnan2909 Před 7 dny

    I have 2 questions. How are the 1X1 and 3X3 CNN used trained to obtain the weight parameters? Also shouldn't 3X3 with stride 1 change the dimension, though it keeps the number of channels the same the size of the output feature would have changed and reduced by 2

  • @harshith_takkala
    @harshith_takkala Před rokem +1

    thankyou sir !

  • @krishnachaitanya7374
    @krishnachaitanya7374 Před 11 měsíci +1

    This is quite informative and helpful. Can you please create a video on prediction heads in fpn as in how to assign a predicted bbox to a particular feature map. That would be quite helpful.

    • @KapilSachdeva
      @KapilSachdeva  Před 11 měsíci

      Yes, thinking to make some videos about different label assignment techniques.
      Now about your question - the right terminology or phrasing of your request would be how to assign an anchor box to a particular feature map.

  • @yogeshwarshendye4857
    @yogeshwarshendye4857 Před 3 měsíci

    If done with UNet, it won't require upsampling as we concatenate the layers right?

  • @vincentpelletier1246
    @vincentpelletier1246 Před 3 měsíci

    I don't know if I got this wrong but if I take a 1x64x26x26 feature through a convolution that has a K=3 and S=1, I will definitely not end up with a 1x64x26x26, but with a 1x64x24x24. To achieve the desired shape would require a P=1.
    If I'm not correct, would someone please explain how the dimensions would work in this case?

  • @DIAHAYUNINGTYASWATI
    @DIAHAYUNINGTYASWATI Před 7 měsíci

    Do you know how to combine AFPN with the YOLO v8 algorithm? If you know, please tell me. Thanks

  • @kylehuang9035
    @kylehuang9035 Před rokem +2

    Could you give a tutorial of diffusing model to your VAE series? Its related and would like to see your explanation!

    • @KapilSachdeva
      @KapilSachdeva  Před rokem

      Though I understand the theory it’s just that I have never implemented/used them myself. I prefer to share those concepts that I have implemented myself and applied on some real world problem.
      But not saying no :) maybe one day. Thanks for the ask though.

  • @cheeziobodini
    @cheeziobodini Před rokem

    Instead of doing the upsampling via pytorch module and being angry about it, would it be any more useful to train an additional layer to do the upsampling instead? I'm thinking of a layer analogous to the decoder layer in an autoencoder.

    • @KapilSachdeva
      @KapilSachdeva  Před rokem

      No need to be angry at it :) … yes you could do that. As a matter of fact the additional layers after upsampling is to reduce it effects. The cost would be number of parameters. So it is always a trade off.

    • @cheeziobodini
      @cheeziobodini Před rokem +1

      @@KapilSachdeva Thank you! informative video btw

    • @KapilSachdeva
      @KapilSachdeva  Před rokem

      🙏

  • @rampavan4094
    @rampavan4094 Před rokem +1

    Could you give a tutorial on the vision transformer model for object detection?

    • @KapilSachdeva
      @KapilSachdeva  Před rokem

      in some time. have been preoccupied with some stuff but would try my best

  • @LongLeNgoc-qq5qn
    @LongLeNgoc-qq5qn Před 9 měsíci

    what about height and width are odd number (415), sir? In that case, the size after conv and after upsample is miss match. How to fix that, please!

    • @KapilSachdeva
      @KapilSachdeva  Před 9 měsíci

      Resize the image to 416 or any other size (e.g. 640) before feeding it to the network.

  • @user-uf3md5ub5j
    @user-uf3md5ub5j Před rokem +1

    Thanks a lot! would be the following videos soon?

  • @lordfarquad-by1dq
    @lordfarquad-by1dq Před rokem +1

    thank you for the content , next video soon?

    • @KapilSachdeva
      @KapilSachdeva  Před rokem +1

      🙏 … yes. Most likely tomorrow. Thanks for keeping me accountable.

    • @lordfarquad-by1dq
      @lordfarquad-by1dq Před rokem +1

      @@KapilSachdeva thank you again for the content, looking forward for more of these videos

    • @KapilSachdeva
      @KapilSachdeva  Před rokem +1

      Still working on the next video; not yet happy with it hence not published yet.

  • @user-pf8px7iz3z
    @user-pf8px7iz3z Před rokem

    new video when ?

  • @nayab.quteer
    @nayab.quteer Před rokem

    Can you make the video in Urdu language

    • @KapilSachdeva
      @KapilSachdeva  Před rokem

      There are urdu subtitles and may be that will be of some help!