What is Back Propagation
Vložit
- čas přidán 21. 06. 2023
- Learn about watsonx→ ibm.biz/BdyEjK
Neural networks are great for predictive modeling - everything from stock trends to language translations. But what if the answer is wrong, how do they “learn” to do better? Martin Keen explains that during a process called backward propagation, the generated output is compared to the expected output, and then the error contributed by each neuron (or “node”) is examined. By adjusting the node’s weights and biases, error is reduced and thus the overall accuracy improved.
Get started for free on IBM Cloud → ibm.biz/sign-up-now
Subscribe to see more videos like this in the future → ibm.biz/subscribe-now
Back propagation to neural networks is what negative feedback is to closed loop systems. The understanding come pretty much naturally to the people which studied automation and control engineering.
However - many articles tend to mix thing up. In this case back propagation and gradient descent. Back propagation is the process of passing the error back through the layers and using it to recalculate the weights. Gradient descent is the algorithm used for recalculation. There are other algorithms for recalculation of the weights.
thanks for getting me through my exams cause the script from my professor helps literally nothing in understanding deep learning. Cheers mate
Thanks for this Great explanation MARK 😃
Brilliantly simplified explanation for a fairly complex topic. Thanks, Martin!
marvin u mean, smh
I can't get enough of you brilliant videos. Thank you for making what it seemed to me before as complicated easy to understand . Could you please post a video about loss functions and gradient decent?
What did he even taught actually?
Lovely man..... excellent videos..all complexities eliminated. thanks a lot 😊
The gradient is passed backward using the chain rule from calculus. The gradient is just a multivariable form of the derivative. It is an actual numerical quantity for each "atomic" part of the network; usually a neuron's weights and bias.
Very well explained 🎉
Thank you for the information.
Could you please tell if the the BP is only available and applicable for Supervised models, as we have to have a pre computed result to compare against!!
Certainly, unsupervised models could also use this theoretically but does / could it effect in a positive way?
Additionally how the comparison actually performed?
Especially for the information that can't be quantised !
Awesome.....awesome superb explanation sir
Remember the chain rule that you learned in high school.Well that’s what is used in Backpropogation
Great video! 😊
Thanks Mardnin!
A great video!
thanks marvin
Thank you!
Hi Marklin!
Thank you for such a great explanation.
(btw, I know your name is Martin. 😂 )
Off topic: what drawing board setup do these IBM videos use ?
it's really great.
I'd like to know, too.
I find it ,he is drawing on the Glass
Isn't Back Propagation used to lower the computation needed to adjust the weights? I understand that doing so in a "forward" fashion is much more expensive than in a "backward" fashion?
When doing the Loss Function hove is the "Correct" output given? Is it training data and the compared an other data file with desired outcomes? In the example of "Martin" how does the neural network get to know that your name was not Mark?
5:36 Actually, convergence is does not necessarily mean the network is able to do its task reliably. It just means that its reliability has reached a plateau. We hope that the plateau is high, i.e. that the network does a good job of predicting the right outputs. For many applications, NNs are currently able to reach a good level of performance. But in general, what is optimal is not always very good. For example, a network with just 1 layer of 2 nodes is not going to be successful at handwriting recognition, even if its model converges.
Hi, you seem to have good knowledge on this, can I ask you a question please. Do you know if neural networks will be good for recognizing handwritten math expressions? (digits, operators, variables, all elements seperated to be recognized individually). I need a program that would do that and I tried a neural network, it is good for images from dataset but terrible for stuff from outside the dataset. Would you have any tips? I would be really greatful
In IBM we trust ✊😔
why you didn't explain how the network updates the weight
Wait ... the video is online for four hours ... and still no question how he manages to write mirrored?
There’s a separate video for that
🤦♂
Ha, that's so true. Here you go: ibm.biz/write-backwards
Just flip the video horizontally
Thanks still I need to understand how technically does it happen
It’s done by calculating the derivatives of the y hats with respect to the weights, and the function done backwards in the network applying the chain rule of calculus
ANY CHANCE TO GIVE 1000 LIKES ???😩