GIoU vs DIoU vs CIoU | Losses | Essentials of Object Detection

What is RAG? (Retrieval Augmented Generation)

73 - Image Segmentation using U-Net - Part1 (What is U-net?)

AI: Giganti, horečka a konec světa | KOVY

Looks realistic #tiktok

Na koncertě v Praze jsme se s vámi rozdělily o Kubíky Waterrr Cool😍 V tom vedru dobrý nápad, ne?😁

Feature Pyramid Network | Neck | Essentials of Object Detection

Kapil Sachdeva

zhlédnutí 11 381

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 24. 02. 2023
This tutorial explains the purpose of the neck component in the object detection neural networks. In this video, I explain the architecture that was specified in Feature Pyramid Network paper.
Link to the paper [Feature Pyramid Network for object detection]
arxiv.org/abs/1612.03144
The code snippets and full module implementation can be found in this colab notebook:
colab.research.google.com/dri...
The torchvision has a more flexible implementation which would take more than 3 feature layers from backbone
pytorch.org/vision/main/gener...
Věda a technologie

Komentáře • 50

@paedrufernando2351 Před rokem ⁺⁸
Keep the pearls of wisdom dropping sir..Privilage to learn from you miles across...
@KapilSachdeva Před rokem ⁺¹
🙏 thanks for the kind words.
@lostpenguin3682 Před 6 měsíci ⁺²
very helpful! I really like that you're explaining it with an example with concrete numbers!
@KapilSachdeva Před 6 měsíci
🙏
@brunodias3524 Před 11 měsíci ⁺¹
I am so happy I found this video. Really good content!
@KapilSachdeva Před 11 měsíci
🙏
@NehadHirmiz Před rokem ⁺²
Excellent tutorial. Thank you very much.
@KapilSachdeva Před rokem
🙏
@TeamDman Před 9 měsíci ⁺¹
Thank you for sharing your knowledge!
@KapilSachdeva Před 9 měsíci
🙏
@vipingautam9501 Před rokem ⁺²
This is excellent! I just love it.
@KapilSachdeva Před rokem
🙏
@user-do5pn6hb2i Před 11 měsíci ⁺¹
Sir, I have a lot of to say after finding your video on CZcams but just ❤ , respect and thank you. 🙏🙏
@KapilSachdeva Před 11 měsíci
🙏
@applestarpie Před rokem ⁺¹
I like your videos, which are easy and fun to learn. Thanks a lot!
@KapilSachdeva Před rokem
🙏
@manueljohnson1354 Před měsícem
Excellent
@AdnanMunirkhokhar Před 10 měsíci ⁺¹
amazing explanation Dr.
@KapilSachdeva Před 10 měsíci ⁺¹
🙏
@science.20246 Před 4 měsíci ⁺¹
is useful to add channel and spatial attention in conv layers to improve
@rampavanmedipelli6152 Před rokem ⁺¹
Thank you... excellent clarity... please try to make a tutorial on anchor free detectors like FCOS..
@KapilSachdeva Před rokem
🙏 yup. First need to implement it :)
@ranjithtevnan2909 Před 7 dny
I have 2 questions. How are the 1X1 and 3X3 CNN used trained to obtain the weight parameters? Also shouldn't 3X3 with stride 1 change the dimension, though it keeps the number of channels the same the size of the output feature would have changed and reduced by 2
@harshith_takkala Před rokem ⁺¹
thankyou sir !
@KapilSachdeva Před rokem
🙏
@krishnachaitanya7374 Před 11 měsíci ⁺¹
This is quite informative and helpful. Can you please create a video on prediction heads in fpn as in how to assign a predicted bbox to a particular feature map. That would be quite helpful.
@KapilSachdeva Před 11 měsíci
Yes, thinking to make some videos about different label assignment techniques.
Now about your question - the right terminology or phrasing of your request would be how to assign an anchor box to a particular feature map.
@yogeshwarshendye4857 Před 3 měsíci
If done with UNet, it won't require upsampling as we concatenate the layers right?
@vincentpelletier1246 Před 3 měsíci
I don't know if I got this wrong but if I take a 1x64x26x26 feature through a convolution that has a K=3 and S=1, I will definitely not end up with a 1x64x26x26, but with a 1x64x24x24. To achieve the desired shape would require a P=1.
If I'm not correct, would someone please explain how the dimensions would work in this case?
@DIAHAYUNINGTYASWATI Před 7 měsíci
Do you know how to combine AFPN with the YOLO v8 algorithm? If you know, please tell me. Thanks
@kylehuang9035 Před rokem ⁺²
Could you give a tutorial of diffusing model to your VAE series? Its related and would like to see your explanation!
@KapilSachdeva Před rokem
Though I understand the theory it’s just that I have never implemented/used them myself. I prefer to share those concepts that I have implemented myself and applied on some real world problem.
But not saying no :) maybe one day. Thanks for the ask though.
@cheeziobodini Před rokem
Instead of doing the upsampling via pytorch module and being angry about it, would it be any more useful to train an additional layer to do the upsampling instead? I'm thinking of a layer analogous to the decoder layer in an autoencoder.
@KapilSachdeva Před rokem
No need to be angry at it :) … yes you could do that. As a matter of fact the additional layers after upsampling is to reduce it effects. The cost would be number of parameters. So it is always a trade off.
@cheeziobodini Před rokem ⁺¹
@@KapilSachdeva Thank you! informative video btw
@KapilSachdeva Před rokem
🙏
@rampavan4094 Před rokem ⁺¹
Could you give a tutorial on the vision transformer model for object detection?
@KapilSachdeva Před rokem
in some time. have been preoccupied with some stuff but would try my best
@LongLeNgoc-qq5qn Před 9 měsíci
what about height and width are odd number (415), sir? In that case, the size after conv and after upsample is miss match. How to fix that, please!
@KapilSachdeva Před 9 měsíci
Resize the image to 416 or any other size (e.g. 640) before feeding it to the network.
@user-uf3md5ub5j Před rokem ⁺¹
Thanks a lot! would be the following videos soon?
@KapilSachdeva Před rokem ⁺¹
🙏 yes.
@lordfarquad-by1dq Před rokem ⁺¹
thank you for the content , next video soon?
@KapilSachdeva Před rokem ⁺¹
🙏 … yes. Most likely tomorrow. Thanks for keeping me accountable.
@lordfarquad-by1dq Před rokem ⁺¹
@@KapilSachdeva thank you again for the content, looking forward for more of these videos
@KapilSachdeva Před rokem ⁺¹
Still working on the next video; not yet happy with it hence not published yet.
@user-pf8px7iz3z Před rokem
new video when ?
@KapilSachdeva Před 11 měsíci
today ... very late sorry :(
@nayab.quteer Před rokem
Can you make the video in Urdu language
@KapilSachdeva Před rokem
There are urdu subtitles and may be that will be of some help!

Další v pořadí

Automatické přehrávání

GIoU vs DIoU vs CIoU | Losses | Essentials of Object Detection

GIoU vs DIoU vs CIoU | Losses | Essentials of Object Detection

What is RAG? (Retrieval Augmented Generation)

What is RAG? (Retrieval Augmented Generation)

73 - Image Segmentation using U-Net - Part1 (What is U-net?)

73 - Image Segmentation using U-Net - Part1 (What is U-net?)

AI: Giganti, horečka a konec světa | KOVY

AI: Giganti, horečka a konec světa | KOVY

Looks realistic #tiktok

Looks realistic #tiktok

Na koncertě v Praze jsme se s vámi rozdělily o Kubíky Waterrr Cool😍 V tom vedru dobrý nápad, ne?😁

Na koncertě v Praze jsme se s vámi rozdělily o Kubíky Waterrr Cool😍 V tom vedru dobrý nápad, ne?😁

Repeat 🥴🤣 LeoNata family #shorts

Repeat 🥴🤣 LeoNata family #shorts

Object Detection introduction and an overview | Essentials of Object Detection

Object Detection introduction and an overview | Essentials of Object Detection

Feature Pyramid Network for object detection

Feature Pyramid Network for object detection

Real time Kalman filter on an ESP32 and sensor fusion.

Real time Kalman filter on an ESP32 and sensor fusion.

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

2017 Feature Pyramid Network for Object Detection (FPN) paper summary

2017 Feature Pyramid Network for Object Detection (FPN) paper summary

Detection Head | Essentials of Object Detection

Detection Head | Essentials of Object Detection

Why Does Diffusion Work Better than Auto-Regression?

Why Does Diffusion Work Better than Auto-Regression?

Softmax (with Temperature) | Essentials of ML

Softmax (with Temperature) | Essentials of ML

Урна с айфонами!

Урна с айфонами!

Deep Cleaning and Fixing The DIRTIEST IPad 🤢🤮 #shorts #apple #ipad

Deep Cleaning and Fixing The DIRTIEST IPad 🤢🤮 #shorts #apple #ipad

100+ Linux Things you Need to Know

100+ Linux Things you Need to Know

Using Your phone in the Rain 💀.

Using Your phone in the Rain 💀.

#best PLAYSTATION CONSOLE #collection #shortvideos #gaming #foryou

#best PLAYSTATION CONSOLE #collection #shortvideos #gaming #foryou

This is the craziest notebook I’ve ever seen🤯

This is the craziest notebook I’ve ever seen🤯

Simple maintenance. #leddisplay #ledscreen #ledwall #ledmodule #ledinstallation

Simple maintenance. #leddisplay #ledscreen #ledwall #ledmodule #ledinstallation

Product Link in Bio ( # 1636 ) @MaviGadgets ✅ Smart Universal Magnetic Car Phone Holder

Product Link in Bio ( # 1636 ) @MaviGadgets ✅ Smart Universal Magnetic Car Phone Holder