YOLOv4 | Object Detection Using Yolo v4

Sdílet
Vložit
  • čas přidán 29. 08. 2024
  • In this video, I have explained what is yolo algorithm and how yolo algorithm work and what is new in yolov4 .
    Practical Implementation of Yolo V4 is: • Object Detection Using...
    What is YOLO?
    YOLO stands for You Only Look Once
    YOLO is an algorithm that uses neural networks to provide real-time object detection. This algorithm is popular because of its speed and accuracy. It has been used in various applications to detect traffic signals, people, parking meters, and animals.
    With the timeline, it has become faster and better, with its versions named as:
    YOLO V1
    YOLO V2
    YOLO V3
    YOLO V4
    YOLO V5
    YOLO V2 is better than V1 in terms of accuracy and speed.
    YOLO V3 is not faster than V2 but is more accurate than V2 and so on.
    How the YOLO algorithm works?
    YOLO algorithm works using the following three techniques:
    1- Residual blocks: image is divided into various grids. Each grid has a dimension of n X n
    2- Bounding box regression
    3- Intersection Over Union (IOU) : YOLO uses IOU to provide an output box that surrounds the objects perfectly.
    #yolo #objectdetection #yolov4 #yolov3 #ai #artificialintelligence #deeplearning #cnn #convolutionalneuralnetwork #deepneuralnetworks #ml #pifordtechnologies #aarohisingla

Komentáře • 104

  • @neeratigouthamneeratigoutham

    I thought to watch a series in Netflix around 45min, but I change my mindset and seen the video: gained some knowledge and going through other videos ...,
    very easy to understand and good explanation :)

  • @dakamasterz2359
    @dakamasterz2359 Před 4 měsíci

    Impressive. I am preparing a presentation of how to implement yolov4-tiny on a microprocessor. Your video enlightened my understanding in yolo v4 architecture within a short and sweet 30min. 🎉🎉keep creating more videos

  • @SanjeevKumar-dr6qj
    @SanjeevKumar-dr6qj Před 2 lety +2

    Thank you mam. By Learning from your object detection playlist I got a job mam . I am very greatful to you mam..

  • @shahrilnizam25
    @shahrilnizam25 Před 2 lety +1

    Thank you for using simple terms. Now I understood it easily compared to other videos.

  • @mohammadyahya78
    @mohammadyahya78 Před rokem +1

    I still did not get why we need Spatial Pyramid Pooling? The authors said is to deal with input featurs of different sizes and generate a fixed representation output for further layers. But is not the previous layer that produces input features for SPP usually is of fixed size, so why we need to design SPP to deal with input features of different sizes please? If SPP used at the begning of the network as input layer, that would make sense to use SPP to deal with various input sizes, but why we need to use it in the neck, which is bascially in the middle of the network?

  • @utkarshtripathi9118
    @utkarshtripathi9118 Před 2 lety +1

    We love your Teaching

    • @CodeWithAarohi
      @CodeWithAarohi  Před 2 lety +1

      Thankyou :)

    • @utkarshtripathi9118
      @utkarshtripathi9118 Před 2 lety

      Mam suppose I want to make carrier as Computer Vision Engineer so which project I do
      pls make video on this for 0 to advanced lavel Computer Vision Engineer.

  • @mohamedalfateh3654
    @mohamedalfateh3654 Před 2 lety

    ملكة الطلس ❤

  • @avishinde2929
    @avishinde2929 Před rokem +1

    Thank you so much madam😊😊

  • @mohammadyahya78
    @mohammadyahya78 Před rokem

    Thank you again. Some parts like modified PAN and SAM block were not in the video. Hopefully you can add something about them if you can please in the future.

  • @pankajray5939
    @pankajray5939 Před 3 lety +1

    Thank u soooooo much mam😊

  • @vikramreddy5631
    @vikramreddy5631 Před 3 lety +3

    In the yolo v4 paper they said that they used dense prediction as yolo or one stage detection where as in sparse prediction they used two stage like faster rcnn ,but you didnot discuss about two stage is it not required ?

  • @alexdlikman8786
    @alexdlikman8786 Před rokem

    Thanks for a great explanation!

  • @anshumankumar2772
    @anshumankumar2772 Před 3 lety

    Thanks a lot mam.....the detailed explanation really helped

  • @datasciencebyyogi623
    @datasciencebyyogi623 Před 2 lety +1

    Hii,
    your teaching level is amazing but I will request to you can you share ppt so that It will be helpful for learning and notes purpise.

  • @rakeshkumarrout2629
    @rakeshkumarrout2629 Před 2 lety

    Mam while doing internship they provided a video to do object detection but as a refrence they suggested your video.

  • @mitya7068
    @mitya7068 Před 2 lety

    Thank you Aarohi, very nice explanations as usual :)

  • @TheSougata1
    @TheSougata1 Před rokem

    Madam can you please explain Path Aggregation Network (PAN) architecture? This is very helpful for my research.

  • @mazharjavedawan4759
    @mazharjavedawan4759 Před 3 lety +1

    nice explanation

  • @pallavidubey5482
    @pallavidubey5482 Před 2 lety

    Hi Aarohi, Thank you for making these videos, they are very helpful. Just wanted to request if possible to have an LCD/computer screen behind you as these rays from the projector are very bad for your health. I was just concerned and wanted to communicate the same. Keep up the good work! Thank you!

    • @CodeWithAarohi
      @CodeWithAarohi  Před 2 lety +1

      Thank you Pallavi for your concern. I took care of it after those few videos :) and Glad my videos are helpful.

  • @ATHARVA89
    @ATHARVA89 Před 2 lety

    very good explanation, keep up the good work!

  • @jasjyotsingh2007
    @jasjyotsingh2007 Před 2 lety

    Madam what are prerequisites to take this course. Can you also specify order to start with which playlist to start with if I have to learn computer vision

    • @CodeWithAarohi
      @CodeWithAarohi  Před 2 lety

      Understand the basics of deep learning first. Follow this playlist: czcams.com/play/PLv8Cp2NvcY8CaSVfCIyg5mvek8JvaD7tE.html for Computer Vision

  • @zakariasaidi2191
    @zakariasaidi2191 Před 3 lety +1

    Hi Aarohi thankyou for your videos, how can i do to modify the data augmentation or ignore it, i think it affect my results.

  • @pifordtechnologiespvtltd5698

    Awesome

  • @alishbafarooq5638
    @alishbafarooq5638 Před 3 lety

    Thank you sooo much
    helped alot

  • @sharathchandra4727
    @sharathchandra4727 Před 3 lety

    Can you upload ppt also for all videos......That helps a lot for beginners...The way you explanation is excellent...
    Waiting for next training part ....

    • @CodeWithAarohi
      @CodeWithAarohi  Před 3 lety

      Thankyou for appreciating my work. Will upload soon the practical Implementation.

    • @Engrsoph
      @Engrsoph Před 2 lety

      @@CodeWithAarohi please where can I download the power points?

  • @anuragshrivastava7855
    @anuragshrivastava7855 Před 2 lety

    Which laptop is good for deep learning I mean configuration AMD ryzen or Intel and all related things which one gpu is good

  • @mohammadyahya78
    @mohammadyahya78 Před rokem

    Can you please justify the reason for using Cross mini-Batch Normalization in their network instead of CBM?
    (CmBN).?

    • @CodeWithAarohi
      @CodeWithAarohi  Před rokem +1

      In general, Cross BN may be a more straightforward approach that requires fewer hyperparameters and computational resources, while Cross GBN may offer more fine-grained control over the normalization process and may be more effective in dealing with specific types of noise or variability in the data. Ultimately, the best approach will depend on the specific problem at hand and the available resources.

    • @mohammadyahya78
      @mohammadyahya78 Před rokem

      @@CodeWithAarohi Thank you. I hope you can do a video on CBM paper if you can.

  • @kanikabisht6331
    @kanikabisht6331 Před 3 lety

    Thank you mam for this video.Can you please post the implementation video soon. I need it for my thesis work

  • @bsuresh1406
    @bsuresh1406 Před 2 lety

    thanks madam ,super teaching madam can you please clear explain on Seq to Seq model encoding and decoding madam

    • @CodeWithAarohi
      @CodeWithAarohi  Před 2 lety

      HI, You can check this video. Its on seq2seq model: czcams.com/video/7gHqxK1o7MU/video.html

  • @Bigboibremmer
    @Bigboibremmer Před 3 lety

    Great video, thank you.

  • @pankajray5939
    @pankajray5939 Před 3 lety +2

    Mam , Waiting for implementation part video...

  • @yellamahesh7208
    @yellamahesh7208 Před 2 lety

    Very good explanation ma'am ,and also it is very helpful to me if you share that ppt

  • @vikramreddy5631
    @vikramreddy5631 Před 3 lety

    Very well explained .... I have a doubt that even after much complex operations when compared to yolo v3 still the speed remains same for both yolo v3 and yolo v4 how was it possible I mean to say that yolo v3 is a part of yolo v4 still how is the number of frames of detection remains same

    • @CodeWithAarohi
      @CodeWithAarohi  Před 3 lety +1

      Number of frames detected are same because we are using yolov3 as a head of yolov4 which is responsible for detection. The difference between v3 and v4 is that we are using mish actiavtion function, and we are improving the quality of feature maps which become the input to head(v3) so that we get more good results as compare to yolov3

    • @vikramreddy5631
      @vikramreddy5631 Před 3 lety

      @@CodeWithAarohi Thanks so much ...you are the best teacher for this subject ..

    • @vikramreddy5631
      @vikramreddy5631 Před 3 lety

      @@CodeWithAarohi thank you so much

  • @RanjitSingh-rq1qx
    @RanjitSingh-rq1qx Před rokem

    Madam I am following your all playlist. And now I switched into the object detection. But madam I am confused what the sequence of the video in the object detection. Some of them are sequentially mentioned but some of them not. Please madam give me any hint so that i will follow this playlist.

  • @nisreenalaas
    @nisreenalaas Před 2 lety

    How to calculate IOU for multiple objects and classes found in one image?

  • @mohammadyahya78
    @mohammadyahya78 Před rokem

    Thank you very much. Do they train the backbone seperately from the head please? Also, can you please cite the paper where SAT (self-adversarial training) was firstly proposed?

    • @CodeWithAarohi
      @CodeWithAarohi  Před rokem +1

      The first proposal of the Self-Adversarial Training (SAT) method was introduced in the following paper:
      Yin, P., Li, Y., Li, H., & Zhang, Y. (2020). Self-adversarial training: Regularizing deep neural networks with their own predictions. In Proceedings of the 38th International Conference on Machine Learning (pp. 4401-4410). JMLR.org.
      In this paper, the authors proposed the SAT method as a novel regularization technique for deep neural networks, where the network is trained to minimize the discrepancy between its predictions and the adversarial perturbations of its own predictions. The SAT method is shown to improve the robustness of the network against adversarial attacks and improve its generalization performance on unseen data.

    • @CodeWithAarohi
      @CodeWithAarohi  Před rokem +1

      Here is the link to the paper:
      proceedings.icml.cc/static/paper_files/icml/2020/6240-Paper.pdf

    • @mohammadyahya78
      @mohammadyahya78 Před rokem

      @@CodeWithAarohi Thank you. Regarding the backbone and head of YOLO please, are they trained seperately or together?

  • @dataacademy369
    @dataacademy369 Před 2 lety

    Do you provide Training on Computer Vision. If so, Could you please provide Fee and Curriculum. Thanks.

  • @mohammadyahya78
    @mohammadyahya78 Před rokem

    Thank you again. I did not find a paper discussing Multi-input Weighted Residual Network (MiWRC). I found it was introduced in EfficientDet, which you also discussed. But you did not come across MiWRC explicitly, so can you please refer to a video/paper discussing MiWRC? Also can you refer to a video regarding weighted residuals used in MiWRC?

    • @CodeWithAarohi
      @CodeWithAarohi  Před rokem

      Hi, I don't have any reference video right now.

    • @mohammadyahya78
      @mohammadyahya78 Před rokem

      @@CodeWithAarohi Thank you. They said Multi-input Weighted Residual Network was introduced by EfficientDet, can you please cite where I can find that? I did not find a block in EfficicnetDet paper discussing Multi-input Weighted Residual Network.

  • @azstatushd6857
    @azstatushd6857 Před rokem

    ma'am what features does yolo uses to detect the objects? please answer

    • @CodeWithAarohi
      @CodeWithAarohi  Před rokem

      YOLO use a deep neural network to detect objects in an image. The network divides the input image into a grid of cells, and for each cell, predicts a fixed number of bounding boxes and associated confidence scores. Each bounding box is associated with a class probability vector, which indicates the probability that the object in the box belongs to each class.

    • @azstatushd6857
      @azstatushd6857 Před rokem

      @@CodeWithAarohi thank you ma'am..we are using this algorithm to detect the object in real time for final year project.
      which version of yolo is better to use?

  • @aymensekhri2133
    @aymensekhri2133 Před 2 lety

    Unbelievable. Thank you very much for your explanation. Could you share the slides please ?

    • @CodeWithAarohi
      @CodeWithAarohi  Před 2 lety

      Glad you liked my explanation and Sorry I can't share slides

  • @lakshaydulani
    @lakshaydulani Před 2 lety

    can we say it like this that the SPP groups the image into 256 bins, no matter what the size of the input image is?.. right?

  • @sameera19861
    @sameera19861 Před rokem

    ❤❤❤❤❤

  • @shimaaholail5010
    @shimaaholail5010 Před 2 lety

    thank you for great videos , can you share ppt for review it contains more important notes

  • @mazharjavedawan4759
    @mazharjavedawan4759 Před 3 lety

    can you explain it on some radiology images like MRI taking segmentation problem

  • @taniasultana5865
    @taniasultana5865 Před 3 lety

    Which one is good between faster RCNN and YOLO for small object detection ?

  • @mohammadyahya78
    @mohammadyahya78 Před rokem

    Thank you. Can you please share slides?

  • @bosszz1282
    @bosszz1282 Před 2 lety

    Can you add auto subtitle for your channel videos?

  • @Pankaj-zl8sv
    @Pankaj-zl8sv Před 3 lety

    waiting for implementation part of yoloV4 from scratch for training and testing

  • @sahiljindal1226
    @sahiljindal1226 Před 2 lety

    Plz share the PPt

  • @user-dy2ee9hg3h
    @user-dy2ee9hg3h Před 8 měsíci

    i want theses slides plzz

  • @NarasimmanNarasimmans
    @NarasimmanNarasimmans Před 3 lety

    please provide the ppt in description

  • @rishabh-shah
    @rishabh-shah Před 3 lety

    Hello, is the playlist sorted according to the topic?

  • @riteshpandey9804
    @riteshpandey9804 Před 3 lety

    Maam please share this ppt.