BACKPROPAGATION algorithm. How does a neural network learn ? A step by step demonstration.

Defend Intelligence

zhlédnutí 64 610

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 29. 03. 2020
It is my first video in English I hope it is ok. I will start to do on my CZcams channel more expert video in English.
In this first video we details the backpropagation algorithm, really used in Deep Learning to train supervised neural network.
Instagram : / defend.intelligence
Twitter : / dfintelligence
The Blog Post of Matt Mazur : mattmazur.com/2015/03/17/a-st...
Věda a technologie

Komentáře • 81

@sawmill035 Před 2 lety ⁺¹⁷
I actually kind of laughed at around 1:38 when he was like "All you need to know is addition, subtraction, multiplication
...
and partial derivatives."
Lol, you really had all those 3rd graders in the first half, not gonna lie.
@Antagon666 Před 2 lety ⁺²
After watching your video I was finally able to derive the equation myself .
Thank you!
@Tomcatdude1 Před 2 lety ⁺¹⁴
For anyone in the future trying to look for how todo backprop easily:
For the first set of weights, we use -(expected - given) * (given * (1 - given)) * output of previous node. expected and given in respect to this video would be values using o2 or o1 (depending on which weight you're working on), and output of previous node in this would be output of j2 or j1 (again depending on which weight it's attached to). This is our final gradient, so we can multiply this by the learning rate and subtract that from the weight to get our updated weight.
For the rest of the weights in any hidden layer: We take the two (-(expected - given) * (given * (1 - given))) we just computed in step 1 and multiply them by the two weights they were used to update(so if we're updating w4 we use the two weights connected to j2). We then multiply this by (given * (1 - given)) for the given value after the activation function (so for w4 we'd use the output of j2 for given). Finally, we multiply this by the input the current weight is affecting (so for w4, we'd use i2). This is our final gradient, so we can multiply this by the learning rate and subtract that from the weight to get our updated weight.
@elyseekoulnodji4340 Před 2 lety
I spent one month reading books to understand how it works as I'm bigginer and thanks to that video I got the concept in 15mn. very good job, keep on
looking forward to learn more and apply it real world problem
@ammaramirkhan Před 2 lety ⁺¹
Thank you very much for this great video! I have watched a lot of videos before finally landing on this video. Unfortunately all others have seemed to just shy away from explaining the real math behind back-propagation. They just cover the basic idea or update the weight for the output layer only. This is the first video I have seen that explains the actual math in updating the hidden layers too.
@hosseinebrahimian9603 Před 3 lety
cette video ma bien aide a comprendre BP algorithm. merci a vous!
@mauriciobonetti8152 Před 2 lety
Thanks for this amazing video!
@sawomirslusarczyk5299 Před 2 lety ⁺³
in the Compute 02 line in the formula there should be w7 instead of w5 and w8 instead of w6. Regards Slawek
@Dhanush-zj7mf Před 3 lety ⁺¹⁴
I spotted two mistakes but please tell me if I am wrong . At 10:51 "dEo1/d(out o1)" should be "0.80-0" (which is the output_produced-desired_output) which evaulates to 0.80 and you wrote "-0.18" in that place so please once check it and tell me if I am wrong😊😊
@Coder-0 Před 8 měsíci
I believe you are both wrong you were on the right track though at 6:42 it says desired output-produced output (t-a).
@Dhanush-zj7mf Před 8 měsíci
@@Coder-0you forgot to apply chain rule. You r Diffententing wrt a. You have t-a inside so you should multiply by -1.
@alexandervega3463 Před 11 měsíci
good explanation helped to understand some inner details from a basic neural network.
@LionKimbro Před rokem
Great video! I wrote a Python project to carry out and visualize the manipulations, for learning, from this very video.
I think I noticed a mistake at 7:30 in the video -- I think you mean "O2" in the chain rule on the right, rather than "O1." But easy enough to account for. Again, thank you very much for this video! This is the most straight-forward description of how to apply back-propagation that I've found yet.
@LionKimbro Před rokem
8:55 -- Also, isn't the derivative of in(O2) just w8 itself?
@lhyd7hak Před 2 lety
Thanks for a helpful video.
@citoyennumero4434 Před 2 lety
Je ne parle pas anglais, mais étonnamment, j'ai compris ce que vous disiez !!
@DefendIntelligence Před 2 lety
Merci 😊
@123Shunde321 Před rokem
Super helpful, thanks 👍
@CARNEIRAUM Před 9 měsíci
Very nice!
@samiswilf Před 2 lety
Excellent video
@nasgaroth1 Před 3 lety ⁺⁵
There is at least one mistake, but as overall how its work is quite good presented. In sake of correctness you should check number once again. In w dIno2 / dW8 you wrote 0.61 but in dEtotal /dW8 you wrote 0.52. Best regards
@Antagon666 Před 2 lety
Nobody cares about the results, if the formula is correct
@FPChris Před 2 lety ⁺¹
I care about the results. I want to do it all on paper so I can confirm the results went I rewrite it to code.
@darshshah7155 Před 4 měsíci
yes exactly. also bugs me when the result i get are different than the video. @@FPChris
@camillagiuliani1109 Před rokem
Thanks!
@Emoups Před 3 lety ⁺²
Bonjour et un grand merci pour la Vidéo, je cherchais un example vulgarisé et c'est parfait.
Concernant les "Bias" est ce que l'on applique aussi une correction ou on ne s'occupe que des "weights" ?
@nopana_ Před rokem
C'est une vielle question, mais on doit appliquer aussi une correction sur les "biases" de ce que je sais car ils influent aussi le résultat de manière importante ^^- (edit: oui c'est très important d'apporter le changement sur les biases aussi)
@kritsaphongphuthibpaphaisi1509 Před rokem
This is great
@mircopal Před 3 lety
thank you
@gauravonkar6172 Před 2 lety ⁺²
9:06 There should be 0.61 instead of 0.52
@juliano3251 Před 2 lety
Nice video. More videos on english would be cool :)
@gabupouet4221 Před 4 lety ⁺¹⁹
C'est vraiment dommage, je suivais cette chaine pour le simple fait que c'était en français. Des trucs en anglais sur le sujet, il y en a par tonne.
@DefendIntelligence Před 4 lety ⁺²
Hello ! Pas de crainte, je vais continuer en français je voulais juste tenter l'expérience sur ce sujet précis :).
@gabupouet4221 Před 4 lety
@@DefendIntelligence , je vous en remercie beaucoup
@abdel8502 Před 3 lety
@@DefendIntelligence Et du coups, tu peux la refaire en français ?
@karimkondua1736 Před 3 lety
Ça fait travailler l anglais la prononciation est bien mais bon quand on part de loin c est vrai que le français 😅😅 c est plus pratique
@yusrahsumtally707 Před 9 měsíci
At 11:13, could you explain how did you get 0.16? just the values
@jotsinghbindra8317 Před 5 měsíci
NICE VIDEO BRO
@jobrufsite2818 Před 3 lety ⁺⁶
could you please explain why you did not update the biases and how the biases are updated in back propagation?
@nanobert2747 Před rokem
Please can you make a video on support vector machines and ROC AUC
@nafayurrehman4699 Před 7 měsíci
Why you didn't update biases?
@K9Megahertz Před rokem ⁺¹
At 8:10 I dont follow where the -1 came from. Anyone care to shed some light?
@theolaurent3716 Před 4 lety ⁺¹
petite question pour le learning rate qui est de 0,8 dans la formule, tu l'a choisi par défaut ou tu l'a calculé plus tôt?
@DefendIntelligence Před 4 lety
Hello Théo, Non je l'ai juste choisi de manière aléatoire pour démontrer l'intérêt de l'exercice. On verra dans les prochaines vidéos comment ajuster toutes ces variables précieuses en Deep Learning (nombre de layer, nombre de neurones, epochs, learning rate etc..)
@defaultdefault3995 Před 2 lety
In back propagation you didn't update the bias weights. Do they stay constant throughout the whole training?
@meobliganaponerunnom Před rokem
No. Biases are also parameters so they should be updated.
@drjarf Před 2 lety ⁺¹
The forward propagation is well explained but the backpropagation isn^t. The example has errors.
@darshshah7155 Před 4 měsíci
your (d out j2)/(d inp j2) value at 11:09 is wrong. See at 3:45 the value of sigmoid of j2 is 0.61. If you calculate 0.61(1 - 0.61) you will get 0.2379 instead of what you got 0.16. Please fix that. Its bugs me after calculating for so long my answer is not matching the answer in the video.
@theodelsol6498 Před 4 lety ⁺¹
Salut, je crois qu'il y a juste une petite erreur sur ta diapo lorsque tu récupères la dérivée partielle de Etotal par rapport à W8. OutJ2 est de 0,61 or dans le calcul il a la valeur de W8 soit 0,52. Peut-être une incompréhension de ma part, sinon super vidéo même en anglais ! :)
@DefendIntelligence Před 4 lety ⁺¹
Oui il y a une petite erreure :/ . Merci !
@geogeo14000 Před 3 lety
Génial c'est pile que je cherchais, les vidéos sont super propres et claires, en + du contenu en français qui + est ! bravo et merci
@geogeo14000 Před 3 lety
et un subscriber de gagné ofc ^^
@DefendIntelligence Před 3 lety ⁺¹
Merci beaucoup !! Et bienvenu sur la chaine :)
@faelslimane1699 Před 3 lety
Pourquoi lors du calcul du "nouveau poids" on multiplie la dérivée par 0.80? Merci
@noa4953 Před rokem
c'est le learning rate, ie la "force" avec laquelle on déplace les poids. Le gradient en lui-même n'est qu'une direction, on choisit arbitrairement cette valeur.
@hiramegl Před rokem
Great tutorial! I wonder how do you backpropagate for bias values b1 and b2. Great job!
@dmitrysakharnikov1358 Před 2 lety
Thanks for this video, but unfortunately unclear how to update bias values while training.
@FireBurn256 Před rokem ⁺¹
Bias values should not be upgraded. They are here just to push the ending results towards the okayish values for outputs.
It is the weights for biases that should be updated, and they are updated the same way the other links (I think).
@nemuccio1 Před 4 lety ⁺²
The numbers don't add up. From the graph :
(j1 = i1. w1 + i2.w2+b1)
'w2' corresponds to 0.13 and not 0.25.
0.25 appended to w3, as shown in the graph.
w5 is 0.67 and not 0.84! I have a lot of trouble understanding.
@DefendIntelligence Před 4 lety
Yes there is a mistake here. Consider the value in the formula :). Sorry about that.
@pctan9455 Před 2 lety
J1 = 0.5 but diagram show 0.4976
@tristeub997 Před 2 měsíci
Pourquoi la faire en anglais ??? T.T
@jeremyh9841 Před 10 měsíci
Je ne comprend rien c'est quoi e ?
@paultruffault7278 Před 2 lety
La partie simple du problème est longuement & bien expliqué , mais la backward propagation c'est vite fait mal fait. Comme si tu n'avais pas toi même compris la problématique. Tu m'as m'as plus induit en erreur qu'autre chose ...
@rossloubassou8755 Před 4 lety
mais ce n'était pas censé être une chaine en français?
@DefendIntelligence Před 4 lety ⁺¹
C'est l'unique vidéo en anglais :). Je voulais tenter une vidéo en anglais.
@rossloubassou8755 Před 4 lety
@@DefendIntelligence svp, je travaille depuis quelques temps grace à vous sur deep learning mais j'ai quelques soucis...comment je peux vous contacter directement?
@chqara Před rokem ⁺³
a lot of mistakes 👎
@patrickgerard5524 Před 3 lety
en français, je suivais mais en anglais c'est plus possible
@yvesdky6826 Před 3 lety ⁺¹
J'ai manqué un épisode ou quoi ?!
De l'anglais !!!
🤔Dois-je peut-être m'abonner aux chaînes de geek qui ont encore le Français comme langue de diffusion ?
@DefendIntelligence Před 3 lety ⁺¹
C’est la seule vidéo de la chaîne en anglais 😊😊
@yvesdky6826 Před 3 lety
ha ok j'ai eu peur 😅
@gedTech16 Před 2 lety
bjr , ta casser l'emniance avec l'anglais
@abdelobaid7681 Před 2 lety
There are mistakes in your calculations. Check again.
@alexandreivanov1417 Před 2 lety
mec j'ai rien compris je te jure
@MegaBaye Před 3 lety
non mais non ! ! le but de ce sujet est complètement rater.... y'a presque pas de vidéo en français sur le sujet et y'as en des milliard en anglais !!!! fait le en français mon ami
@DefendIntelligence Před 3 lety ⁺¹
Cest la seule en anglais tu m’excusera 😅
@MegaBaye Před 3 lety
@@DefendIntelligence bien alors ....en tout cas tu fais un travail super !!!! il manque juste le français !!!
@sgrimm7346 Před rokem
math is wrong i the first minute....this is useless
@lamineouldslimane9130 Před 2 lety
never make again a video in English plz
@renatomauro6300 Před rokem ⁺⁴
Why not? Because his english is not perfect? No, its isn't. But he makes a valuable video! I loved the video.

Další v pořadí

Automatické přehrávání

Coder un réseau de neurones convolutifs de classification d'image avec Python et Tensorflow.