YOLOv2 vs YOLOv3 vs Mask RCNN vs Deeplab Xception

Karol Majek

zhlédnutí 71 327

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 3. 04. 2018
YOLOv2: • 4K YOLO COCO Object De...
YOLOv3: • YOLOv3 - Object Detection
Mask RCNN: • Mask RCNN - COCO - in...
Deeplab Xception: • Tensorflow DeepLab v3 ...
Buy me a coffee: bit.ly/Coffee4KarolMajek
Input 4K video: bit.ly/4KDDPL_WAW2
Věda a technologie

Komentáře • 57

@DrTune Před 6 lety
thanks, appreciate you putting that together. amazing stuff
@bygreencn Před 6 lety ⁺³
perfect， thank you very much!
@fvzaur Před 6 lety
Thank you!
@calmandcode Před 5 lety
Good results. Thank you for posting. Is this real time detection?
@KarolMajek Před 5 lety
Look at fps (frames per second) 30fps and above is real-time
@user-ou2ch7nr4r Před 5 lety ⁺²
So, what is the best model for object detecting now?
@KarolMajek Před 5 lety ⁺⁴
Check this list: github.com/hoya012/deep_learning_object_detection
I will suggest YOLOv3 for general case and SSDlite Mobilenet v2 for lightweight detector (low power/smartphone)
@studentepsilon742 Před 5 lety ⁺²
Thank you for this very interesting video! Could you add FPS for Mask RCNN and YOLOv2, too? I think that there should be a big difference between Mask RCNN and YOLO in FPS.
@KarolMajek Před 5 lety
There is. Check: YOLACT - superfast instance segmentation
@zeegrey3447 Před 4 lety
what object detector will you suggest for a real time ?
@KarolMajek Před 4 lety
YOLOv3 (bounding boxes) /YOLACT (masks)
@Inspiration_video23 Před rokem
I am trying to run yolov5 model for vehicle Classification on jetson nano 2gb ram but it is not done . It requires the NUMBA JIT python library to install but I tried many times but it was not installed. Help me
@spectralcodec Před 5 lety
Do you know if this type of object detection is possible in real time on a GTX1080 @640x480/30fps? Mainly would like to detect vehicles and people for a robotics application. Thanks.
@KarolMajek Před 5 lety
Try YOLOv3!
@spectralcodec Před 5 lety
@@KarolMajek I will, thank you.
@KarolMajek Před 5 lety ⁺¹
It will give you very good results and is fast :-)
With 1080 you will get predictions online.
You can also try YOLOv3 trained on open Images v4, but trains on COCO can be enough for you
@spectralcodec Před 5 lety
@@KarolMajek Thanks, what do you mean by "predictions online"?
@KarolMajek Před 5 lety
30hz if your camera is 30fps
@ThanhPham-xz2yo Před 3 lety
Thanks for sharing!
@KarolMajek Před 3 lety
No problem
@ingenganu Před 6 lety ⁺⁴
In your opinion, which is better in these aspects:
- Speed to run on no powerful hardware at an acceptable speed for live processig (cheaper hardware is preferred)
- recognition of objects that occupy few pixels in the image
- Robust against false positives in open and complex environments, without much light.
Did you try any implementation of YOLO with tensorflow? Is it worth changing from Darknet to tensorFlow?
Greetings.
@ass8ash Před 6 lety ⁺²
Gastón Nuñez Real time detection is not possible in my opinion. Look at the fps rates. The closest one that I found good for smaller hardware (preferably mobile-like, R-pi chips) is MobileNet_SSD. My solution to this is to perform detections on intermediate frames and apply tracking to the detected objects since tracking is faster than detection. Check out github.com/apoorvavinod/Real_time_Object_detection_and_tracking
And Yeah, I’ve tried a TensorFlow version of Yolov2. It’s also included in GitHub. I found the DarkNet version to be faster than TensorFlow.
@ass8ash Před 6 lety
There are plenty of implementations of YOLO v3 in Tensorflow.. I think you meant OpenCV. It doesn't support some of the new layers introduced in YOLO v3
@planktonfun1 Před 6 lety
you can always pipe it to the cloud that way the load is in the cloud that way it can run on cheap hardware, you only need a stable internet connection
@igor-fk3vv Před 5 lety ⁺¹
1)Yolo (You only look once) is build for live processing.(use darknet implementation)
2) it's called bounding boxe, you predict them with YOLO( an example are on kaggle the name of this kernel is "yolov3-rsna-starting-notebook "
3) To improve the robustesse, use image augmentation
@videxjr Před 5 lety
Use Alexey's implementation of darknet (GitHub). For small and very fast network I would suggest tiny YoloV3, I am achieving 60fps at 1060 (6GB, but it takes 1-2GB) when images are resized to 608x608. For small details increasing input resolution might help (and different augmentations) and changing the anchors accordingly.
The speed of tiny Yolo comes at the cost of some more false positives, and in general lower mAP, but that can be improved by proper training.
@abirreza7183 Před 3 lety
In 2020 google introduced Panoptic-DeepLab , which gives better performance for autonomous driving.
@tanle5863 Před 6 lety
Wonderful. What’s the tracking method you use with these demo?
@KarolMajek Před 6 lety ⁺³
There's no tracking at all
@weirjwerijrweurhuewhr588 Před 4 lety
@@KarolMajek How do you add tracking to the results of a YOLO3 model? I found out about DeepSort, but can't find many good tutorials on that.
@KarolMajek Před 4 lety
@@weirjwerijrweurhuewhr588 one more time - there's no tracking in this video.
If you want tracking check IoU tracker or Nvidia DeepStream which is ultra fast
@bear79426 Před 5 lety
I have a question why fps is very low but the frame is very smooth, how is this done?
@KarolMajek Před 5 lety
Video is processed frame by frame and then output is exported as video 30fps
@josaxytube Před 5 lety
I guess this is not a real time result.
@KarolMajek Před 5 lety
Real time = 30fps, anything below is too slow
@KarolMajek Před 5 lety
Small model, TensorRT, Titan RTX
@KarolMajek Před 5 lety
I think it's not that easy, but in general many problems can be solved by putting money
@u00xclub Před 5 lety ⁺¹
Mask RCNN is probably the best looking, it might not be the best, but good looking to my eyes..
@KarolMajek Před 5 lety
Check DeepLabv3 inplace abn
@wasimnadaf11 Před 5 lety
I think mask R-CNN is more promising and robust here, it effectively mask out every object in the scene whereas deeplab xception was lacking in some frames
@whatohyou01 Před 4 lety
I think mask R-CNN is precompiled? I ran mask R-CNN on 1 picture of dog and it took 1min 25sec to detect it along with 3 other objects.
@KarolMajek Před 4 lety
It's no online/reLtime method. Here it was computed frame by frame offline and then merged into a video.
If you are looking for online method check YOLACT
@june9713 Před 5 lety
Jesus.. dont question driectly!! first, look at the video!! there is fps, gpu and many of informations!! crazy!!
@mustafagunay5739 Před 5 lety
Well, only for the right side ones. But nobody actually triggered about it, they asked it nicely.
@KarolMajek Před 5 lety
No problem, at all
@mustafagunay5739 Před 5 lety ⁺¹
Thanks man, i appreciate your work.
@jepark6586 Před 4 lety
왜 혼자 급발진하누 ㅋㅋㅋ
@targetroot211 Před 3 lety
Polska to jest jednak potęga widze w te sieci neuronowe
@KarolMajek Před 3 lety
Bynajmniej! Wiele jeszcze przed nami
@targetroot211 Před 3 lety ⁺¹
@@KarolMajek Nieno żarty żartami ale ostatnio przeczytałem że Polacy mają jednych z najlepszych specjalistów w tej dziedzinie. Problem chyba bardziej z ich wykorzystaniem pewnie.
@KarolMajek Před 3 lety
@@targetroot211 w Polsce przede wszystkim wypracowujemy technologię za granicę. Możliwości lokalne, także w instytutach badawczych są mocno ograniczone niestety.
@stevenhong7099 Před 3 lety
Thank you!

Další v pořadí

Automatické přehrávání