C4W3L01 Object Localization

Sdílet
Vložit
  • čas přidán 29. 08. 2024
  • Take the Deep Learning Specialization: bit.ly/2IpmuHg
    Check out all our courses: www.deeplearni...
    Subscribe to The Batch, our weekly newsletter: www.deeplearni...
    Follow us:
    Twitter: / deeplearningai_
    Facebook: / deeplearninghq
    Linkedin: / deeplearningai

Komentáře • 59

  • @submagr
    @submagr Před 4 lety +8

    As soon as I find a video by Prof Andrew on a topic I am looking for, I know this topic is so done for good.
    Thanks, Prof for these wonderful lectures. I can't be enough grateful.

    • @oscarcarson6215
      @oscarcarson6215 Před 3 lety

      pro trick : you can watch series on Flixzone. Been using it for watching lots of of movies lately.

    • @landynbyron5728
      @landynbyron5728 Před 3 lety

      @Oscar Carson yup, been watching on Flixzone} for since december myself :)

    • @kendrickkillian2669
      @kendrickkillian2669 Před 3 lety

      @Oscar Carson Definitely, I've been using flixzone} for since november myself :D

    • @davionkye9161
      @davionkye9161 Před 3 lety

      @Oscar Carson yea, I have been using flixzone} for months myself =)

    • @kingbilly7986
      @kingbilly7986 Před 3 lety

      @Oscar Carson Definitely, I've been watching on Flixzone} for years myself :D

  • @morsemo6966
    @morsemo6966 Před 5 lety +5

    I am a total beginner(even for python).
    I couldn't understand the courses here one month ago. Then I took about 1 month to go around and try most of the popular algorithms examples (with GPU linux server). Then I come back and watch the courses. Now I could be more confident to continue the course journey with Andrew.

    • @lokeshbatthula7306
      @lokeshbatthula7306 Před 5 měsíci

      Hello brother could you kindly share the resources you learned before taking this course

  • @dineshbh3837
    @dineshbh3837 Před 3 lety

    for object detection usually use 'blob ' , contors to separate objects from background and classify that slits

  • @constructor365
    @constructor365 Před 6 lety +5

    Since the loss function for the case when y1 = 0 includes only one term in contrast to the case when y1 = 0, isn't it kind of encouraging the network to predict that there is no object(background) over the other cases?

    • @QuanNguyen-vo2xh
      @QuanNguyen-vo2xh Před 4 lety

      Not really. When calculating the loss, it only ignores bx, by, bh, bw, c if the ground-truth value (y) = 0, not when your prediction (y hat) = 0. So if your model tries to predict y hat = 0 more, the loss function will still consider and compare your predicted bx, by, bh, bw, c with their ground-truth values even though y hat = 0.

  • @haroldsu1696
    @haroldsu1696 Před 6 lety +11

    really awesome lecture!

  • @valentino8625
    @valentino8625 Před 2 lety

    Clear and Simple!!! Awesone lecture

  • @ElectricCircuitsLAB-ProfHazemA

    Thnaks a lot for useful and easy presentation

  • @user-vo9ov4dh8b
    @user-vo9ov4dh8b Před 4 lety +1

    In last part of this video, he said we can use softmax, squared error, logistic regression loss. if I use that, I think there will be three different type loss. And then how should I do back prop? Just calculate loss matched each output neuron's loss fucntion?

    • @abubakarali6399
      @abubakarali6399 Před 3 lety

      Have you find the answer?

    • @user-vo9ov4dh8b
      @user-vo9ov4dh8b Před 3 lety

      @@abubakarali6399 I found that yolo outputs multiple tensors. In other word, in last part in yolo there is 3 different type layers (may be more) which is p, box positions, clasees

  • @harshitbad
    @harshitbad Před 4 lety +2

    Waah chacha kya tutorial diye hain!!

  • @TankNSSpank
    @TankNSSpank Před 6 lety +3

    Great introduction.

  • @harishkumaranandan5946
    @harishkumaranandan5946 Před 4 lety +1

    Hi everyone I am confused about the bbox part. How does the feature vector stack in final FC layer spit out some arbitrary 4 number that are bbox parameters even before backpropagation and L2 loss part. Are these initial bbox co-ordinates the one of the feature vector that has the object in it? Lets say in the case of localizing a car the final FC layer before softmax will have learnt high level features like car wheels, windshield etc and at the end these are stacked. Having said the bbox of the whole car will be different than the bbox of high level features like wheel, windshield etc. I am confused in this part of predicting the initial bbox of the whole car even though it might not be accurate initially bathos does the bbox of high level feature vector match the bbox of whole object. correct me if i were wrong somewhere.

  • @tamerzah
    @tamerzah Před 3 lety

    Nice and clear explanation

  • @RahulMahajanGoogle
    @RahulMahajanGoogle Před 6 lety +3

    This is very good information & helpful. Thanks.

  • @darinhitchings7104
    @darinhitchings7104 Před 2 lety

    this is super good content thanks so much

  • @sudarsansudar2120
    @sudarsansudar2120 Před 5 lety +1

    What will be bx,by,bh,bw value in the output vector if multiple classes are present in the picture.

    • @QuanNguyen-vo2xh
      @QuanNguyen-vo2xh Před 4 lety

      This video is more about the basic idea of how people encode the training data so he only talks about the case of 1 class in each image. You will have to duplicate this array to support the multiple classes. He talks more about that in the following videos of the course.

    • @Nishchay-fk5lr
      @Nishchay-fk5lr Před 6 měsíci

      see in next video the next problem is that onlyy!!!!!

  • @iiirannn1
    @iiirannn1 Před 5 lety

    great lecture

  • @sandipansarkar9211
    @sandipansarkar9211 Před 3 lety

    nice explanation .need to watch agaian

  • @computer-sci8457
    @computer-sci8457 Před 4 lety

    bounding box data as input, while training a model is give after convolution operation , am I right ?, I have little confusion . :)

  • @praleen_
    @praleen_ Před 3 lety

    Hi! I am wondering why the background is not included in the vectors!

    • @bright5967
      @bright5967 Před 3 lety +1

      I looked at it as just training with 3 classes and if it cant detect any of them, then it's a background

  • @ssverma80
    @ssverma80 Před 4 lety

    awesome sir

  • @keweml3544
    @keweml3544 Před 3 lety

    Awesome!

  • @seroshmannan
    @seroshmannan Před 2 lety

    wonderful

  • @saikrishnadyavarasetti7833

    If there 'n' of objects in an image, then how the softmax output will be? will be same [pc, bx, by, bw,bh,c1,c2,c3]? How the output will be?

    • @raviiit6415
      @raviiit6415 Před 5 lety

      c1,c2.. extended to the no. of classes

  • @jasmineshaik4371
    @jasmineshaik4371 Před 4 lety

    If there is a pedestrian and car in a frame ??? Is it applicable

  • @skaterope
    @skaterope Před 4 lety

    makes sense

  • @priyankasn4709
    @priyankasn4709 Před 6 lety +1

    how we have given input image to each neuron

    • @dbzkidkev2
      @dbzkidkev2 Před 6 lety

      you feed in pixels to the input neurons

    • @RobertLugg
      @RobertLugg Před 6 lety

      Hi Priyanka, this is a good lecture, but I suggest you start with some that are more fundamental. Here is a nice video: czcams.com/video/2-Ol7ZB0MmU/video.html I really like the way Luis explains things so you may start with a few of them first.

  • @AbhishekSinghSambyal
    @AbhishekSinghSambyal Před 5 lety

    Can you name any training set which has the same classes and bounding boxes values to try this approach?

    • @Nishchay-fk5lr
      @Nishchay-fk5lr Před 6 měsíci

      literally all sensors lidar, radar all of them use this approach!!!

  • @swapnilgautam5252
    @swapnilgautam5252 Před 3 lety

    but how do we get bx and by value ?

    • @EranM
      @EranM Před 2 lety

      pixle of middle object / max pixel, both for x and y

  • @faridalijani1578
    @faridalijani1578 Před 4 lety

    how could one program "don't care" as an output of an image which contains no object?

    • @francoisplessier9913
      @francoisplessier9913 Před 4 lety +1

      Pay attention at 8:59 . When pc_train==0, the loss function is calculated differently: only the object prediction pc_pred is used. So it really "doesn't care" about the values of bw_pred, bh_pred, etc... as there are not in the formula!

    • @valentinfontanger4962
      @valentinfontanger4962 Před 3 lety

      in the formula, "don't care" is defined as the "1| _oobj"

  • @coolamigo00
    @coolamigo00 Před 5 lety +1

    could Pc or C2 be between 0 and 1

    • @robinmanchanda2884
      @robinmanchanda2884 Před 3 lety

      Pc will be the probability of having an object or not, so this neuron will work as logistics regression only, hence the output can be betwwen 0 and 1. For classes, the concept is almost same and it's like the output of softmax.

  • @guardrepresenter5099
    @guardrepresenter5099 Před 5 lety

    How pc variable know without calculating class labels.Because Andrew say if pc 0 the other variable dont care.But how pc know i am 0 or 1?????????

  • @muhammedbuyukknac2777
    @muhammedbuyukknac2777 Před 6 lety

    You can check out my repository over object localization for SINGLE object. It is a ready-to-run repository.
    github.com/MuhammedBuyukkinaci/Object-Classification-and-Localization-with-TensorFlow

  • @samuelmatheson9655
    @samuelmatheson9655 Před 3 lety

    Gonna make TCAS for blind people

  • @wtfJonKnowNothing
    @wtfJonKnowNothing Před 4 lety

    Everyone : Awesome ,very nice , great !!!
    Me : That's why he is Andrew