YOLOv2 vs YOLOv3 vs Mask RCNN vs Deeplab Xception

Sdílet
Vložit
  • čas přidán 3. 04. 2018
  • YOLOv2: • 4K YOLO COCO Object De...
    YOLOv3: • YOLOv3 - Object Detection
    Mask RCNN: • Mask RCNN - COCO - in...
    Deeplab Xception: • Tensorflow DeepLab v3 ...
    Buy me a coffee: bit.ly/Coffee4KarolMajek
    Input 4K video: bit.ly/4KDDPL_WAW2
  • Věda a technologie

Komentáře • 57

  • @DrTune
    @DrTune Před 6 lety

    thanks, appreciate you putting that together. amazing stuff

  • @bygreencn
    @bygreencn Před 6 lety +3

    perfect, thank you very much!

  • @fvzaur
    @fvzaur Před 6 lety

    Thank you!

  • @calmandcode
    @calmandcode Před 5 lety

    Good results. Thank you for posting. Is this real time detection?

    • @KarolMajek
      @KarolMajek  Před 5 lety

      Look at fps (frames per second) 30fps and above is real-time

  • @user-ou2ch7nr4r
    @user-ou2ch7nr4r Před 5 lety +2

    So, what is the best model for object detecting now?

    • @KarolMajek
      @KarolMajek  Před 5 lety +4

      Check this list: github.com/hoya012/deep_learning_object_detection
      I will suggest YOLOv3 for general case and SSDlite Mobilenet v2 for lightweight detector (low power/smartphone)

  • @studentepsilon742
    @studentepsilon742 Před 5 lety +2

    Thank you for this very interesting video! Could you add FPS for Mask RCNN and YOLOv2, too? I think that there should be a big difference between Mask RCNN and YOLO in FPS.

    • @KarolMajek
      @KarolMajek  Před 5 lety

      There is. Check: YOLACT - superfast instance segmentation

  • @zeegrey3447
    @zeegrey3447 Před 4 lety

    what object detector will you suggest for a real time ?

    • @KarolMajek
      @KarolMajek  Před 4 lety

      YOLOv3 (bounding boxes) /YOLACT (masks)

  • @Inspiration_video23
    @Inspiration_video23 Před rokem

    I am trying to run yolov5 model for vehicle Classification on jetson nano 2gb ram but it is not done . It requires the NUMBA JIT python library to install but I tried many times but it was not installed. Help me

  • @spectralcodec
    @spectralcodec Před 5 lety

    Do you know if this type of object detection is possible in real time on a GTX1080 @640x480/30fps? Mainly would like to detect vehicles and people for a robotics application. Thanks.

    • @KarolMajek
      @KarolMajek  Před 5 lety

      Try YOLOv3!

    • @spectralcodec
      @spectralcodec Před 5 lety

      @@KarolMajek I will, thank you.

    • @KarolMajek
      @KarolMajek  Před 5 lety +1

      It will give you very good results and is fast :-)
      With 1080 you will get predictions online.
      You can also try YOLOv3 trained on open Images v4, but trains on COCO can be enough for you

    • @spectralcodec
      @spectralcodec Před 5 lety

      ​@@KarolMajek Thanks, what do you mean by "predictions online"?

    • @KarolMajek
      @KarolMajek  Před 5 lety

      30hz if your camera is 30fps

  • @ThanhPham-xz2yo
    @ThanhPham-xz2yo Před 3 lety

    Thanks for sharing!

  • @ingenganu
    @ingenganu Před 6 lety +4

    In your opinion, which is better in these aspects:
    - Speed to run on no powerful hardware at an acceptable speed for live processig (cheaper hardware is preferred)
    - recognition of objects that occupy few pixels in the image
    - Robust against false positives in open and complex environments, without much light.
    Did you try any implementation of YOLO with tensorflow? Is it worth changing from Darknet to tensorFlow?
    Greetings.

    • @ass8ash
      @ass8ash Před 6 lety +2

      Gastón Nuñez Real time detection is not possible in my opinion. Look at the fps rates. The closest one that I found good for smaller hardware (preferably mobile-like, R-pi chips) is MobileNet_SSD. My solution to this is to perform detections on intermediate frames and apply tracking to the detected objects since tracking is faster than detection. Check out github.com/apoorvavinod/Real_time_Object_detection_and_tracking
      And Yeah, I’ve tried a TensorFlow version of Yolov2. It’s also included in GitHub. I found the DarkNet version to be faster than TensorFlow.

    • @ass8ash
      @ass8ash Před 6 lety

      There are plenty of implementations of YOLO v3 in Tensorflow.. I think you meant OpenCV. It doesn't support some of the new layers introduced in YOLO v3

    • @planktonfun1
      @planktonfun1 Před 6 lety

      you can always pipe it to the cloud that way the load is in the cloud that way it can run on cheap hardware, you only need a stable internet connection

    • @igor-fk3vv
      @igor-fk3vv Před 5 lety +1

      1)Yolo (You only look once) is build for live processing.(use darknet implementation)
      2) it's called bounding boxe, you predict them with YOLO( an example are on kaggle the name of this kernel is "yolov3-rsna-starting-notebook "
      3) To improve the robustesse, use image augmentation

    • @videxjr
      @videxjr Před 5 lety

      Use Alexey's implementation of darknet (GitHub). For small and very fast network I would suggest tiny YoloV3, I am achieving 60fps at 1060 (6GB, but it takes 1-2GB) when images are resized to 608x608. For small details increasing input resolution might help (and different augmentations) and changing the anchors accordingly.
      The speed of tiny Yolo comes at the cost of some more false positives, and in general lower mAP, but that can be improved by proper training.

  • @abirreza7183
    @abirreza7183 Před 3 lety

    In 2020 google introduced Panoptic-DeepLab , which gives better performance for autonomous driving.

  • @tanle5863
    @tanle5863 Před 6 lety

    Wonderful. What’s the tracking method you use with these demo?

    • @KarolMajek
      @KarolMajek  Před 6 lety +3

      There's no tracking at all

    • @weirjwerijrweurhuewhr588
      @weirjwerijrweurhuewhr588 Před 4 lety

      @@KarolMajek How do you add tracking to the results of a YOLO3 model? I found out about DeepSort, but can't find many good tutorials on that.

    • @KarolMajek
      @KarolMajek  Před 4 lety

      @@weirjwerijrweurhuewhr588 one more time - there's no tracking in this video.
      If you want tracking check IoU tracker or Nvidia DeepStream which is ultra fast

  • @bear79426
    @bear79426 Před 5 lety

    I have a question why fps is very low but the frame is very smooth, how is this done?

    • @KarolMajek
      @KarolMajek  Před 5 lety

      Video is processed frame by frame and then output is exported as video 30fps

    • @josaxytube
      @josaxytube Před 5 lety

      I guess this is not a real time result.

    • @KarolMajek
      @KarolMajek  Před 5 lety

      Real time = 30fps, anything below is too slow

    • @KarolMajek
      @KarolMajek  Před 5 lety

      Small model, TensorRT, Titan RTX

    • @KarolMajek
      @KarolMajek  Před 5 lety

      I think it's not that easy, but in general many problems can be solved by putting money

  • @u00xclub
    @u00xclub Před 5 lety +1

    Mask RCNN is probably the best looking, it might not be the best, but good looking to my eyes..

  • @wasimnadaf11
    @wasimnadaf11 Před 5 lety

    I think mask R-CNN is more promising and robust here, it effectively mask out every object in the scene whereas deeplab xception was lacking in some frames

    • @whatohyou01
      @whatohyou01 Před 4 lety

      I think mask R-CNN is precompiled? I ran mask R-CNN on 1 picture of dog and it took 1min 25sec to detect it along with 3 other objects.

    • @KarolMajek
      @KarolMajek  Před 4 lety

      It's no online/reLtime method. Here it was computed frame by frame offline and then merged into a video.
      If you are looking for online method check YOLACT

  • @june9713
    @june9713 Před 5 lety

    Jesus.. dont question driectly!! first, look at the video!! there is fps, gpu and many of informations!! crazy!!

    • @mustafagunay5739
      @mustafagunay5739 Před 5 lety

      Well, only for the right side ones. But nobody actually triggered about it, they asked it nicely.

    • @KarolMajek
      @KarolMajek  Před 5 lety

      No problem, at all

    • @mustafagunay5739
      @mustafagunay5739 Před 5 lety +1

      Thanks man, i appreciate your work.

    • @jepark6586
      @jepark6586 Před 4 lety

      왜 혼자 급발진하누 ㅋㅋㅋ

  • @targetroot211
    @targetroot211 Před 3 lety

    Polska to jest jednak potęga widze w te sieci neuronowe

    • @KarolMajek
      @KarolMajek  Před 3 lety

      Bynajmniej! Wiele jeszcze przed nami

    • @targetroot211
      @targetroot211 Před 3 lety +1

      @@KarolMajek Nieno żarty żartami ale ostatnio przeczytałem że Polacy mają jednych z najlepszych specjalistów w tej dziedzinie. Problem chyba bardziej z ich wykorzystaniem pewnie.

    • @KarolMajek
      @KarolMajek  Před 3 lety

      @@targetroot211 w Polsce przede wszystkim wypracowujemy technologię za granicę. Możliwości lokalne, także w instytutach badawczych są mocno ograniczone niestety.

  • @stevenhong7099
    @stevenhong7099 Před 3 lety

    Thank you!