Kolmogorov-Arnold Networks: MLP vs KAN, Math, B-Splines, Universal Approximation Theorem

Sdílet
Vložit
  • čas přidán 4. 06. 2024
  • In this video, I will be explaining Kolmogorov-Arnold Networks, a new type of network that was presented in the paper "KAN: Kolmogorov-Arnold Networks" by Liu et al.
    I will start the video by reviewing Multilayer Perceptrons, to show how the typical Linear layer works in a neural network. I will then introduce the concept of data fitting, which is necessary to understand Bézier Curves and then B-Splines.
    Before introducing Kolmogorov-Arnold Networks, I will also explain what is the Universal Approximation Theorem for Neural Networks and its equivalent for Kolmogorov-Arnold Networks called Kolmogorov-Arnold Representation Theorem.
    In the final part of the video, I will explain the structure of this new type of network, by deriving its structure step by step from the formula of the Kolmogorov-Arnold Representation Theorem, while comparing it with Multilayer Perceptrons at the same time.
    We will also explore some properties of this type of network, for example the easy interpretability and the possibility to perform continual learning.
    Paper: arxiv.org/abs/2404.19756
    Slides PDF: github.com/hkproj/kan-notes
    Chapters
    00:00:00 - Introduction
    00:01:10 - Multilayer Perceptron
    00:11:08 - Introduction to data fitting
    00:15:36 - Bézier Curves
    00:28:12 - B-Splines
    00:40:42 - Universal Approximation Theorem
    00:45:10 - Kolmogorov-Arnold Representation Theorem
    00:46:17 - Kolmogorov-Arnold Networks
    00:51:55 - MLP vs KAN
    00:55:20 - Learnable functions
    00:58:06 - Parameters count
    01:00:44 - Grid extension
    01:03:37 - Interpretability
    01:10:42 - Continual learning
  • Věda a technologie

Komentáře • 99

  • @josephamess1713
    @josephamess1713 Před 24 dny +68

    The fact this video is free is incredible

  • @AdmMusicc
    @AdmMusicc Před 7 dny +3

    You're on a mission to make the best and friendliest content to consume deep learning algorithms and I am all in for it.

  • @edsonjr6972
    @edsonjr6972 Před 24 dny +10

    Your videos are literally the only ones with 1hr+ I would ever watch on CZcams. Keep going mate, extremely high quality content 👏🏽👏🏽

  • @bensimonjoules4402
    @bensimonjoules4402 Před 2 dny +1

    Amazing content, thanks! I'm very excited about the continual learning properties of these networks.

  • @mohamedalansary2542
    @mohamedalansary2542 Před 24 dny +15

    Clearly explained and very valuable content as always Umar. Thank you!

  • @nokts3823
    @nokts3823 Před 23 dny +6

    Thanks a lot for making this accessible for people outside the field, for which reading and understanding these papers is quite tough. Thanks to you I'm able to stay slightly more up to date with the crazy quick developments in ML!

  • @xl0xl0xl0
    @xl0xl0xl0 Před 14 dny +4

    Wow this was a super clear an on-point explanation. Thank you, Umar.

  • @MrNathanShow
    @MrNathanShow Před 24 dny +3

    The intro of a basic linked up linear layers was so well done and really makes this introduction friendly!

  • @franciscote-lortie8680
    @franciscote-lortie8680 Před 19 dny +3

    Incredibly clear explanations, the flow of the video is also really smooth. It’s almost like you’re telling a story. Please keep making content!!

  • @goldentime11
    @goldentime11 Před 15 dny +2

    Thanks Umar for such a wonderful tutorial! I've been eyeing this paper for a while!

  • @andreanegreanu8750
    @andreanegreanu8750 Před 5 dny +1

    Very clear, well explained, top notch!

  • @manumaminta6131
    @manumaminta6131 Před 24 dny +2

    Your videos help me (a grad student) really understand difficult, often abstract concepts. Thank you so much... I'll always support your stuff!

  • @odysy5179
    @odysy5179 Před 13 dny +2

    Fantastic explanation!

  • @MuhammadrizoMarufjonov-os5fv

    Thanks for including prerequisites

  • @AlpcanAras
    @AlpcanAras Před 22 dny +2

    This is life changing, in my opinion. Thank you for the efforts on the videos!

  • @ozgunsungar9370
    @ozgunsungar9370 Před 11 dny +1

    awesome, easy to follow even person dont know anything :)

  • @anirudh514
    @anirudh514 Před 24 dny +4

    Thanks for the crystal clear explaination!!

  • @stacks_7060
    @stacks_7060 Před 23 dny +1

    One of the best math videos I’ve watched on CZcams

  • @anmolmittal9
    @anmolmittal9 Před 16 dny +1

    This is really great! Power to you!!🚀

  • @luigigiordanoorsini5980
    @luigigiordanoorsini5980 Před 17 dny +1

    Ho appena letto la piccola bio del tuo canale, spero di non essere offensivo dicendo che adesso capisco perché il tuo ottimo inglese mi sembrasse comunque molto familiare.
    Ad ogni modo ti ringrazio enormemente per il tuo contributo hai spiegato tutta la teoria in un modo, a mio avviso, estremamente chiaro e soprattutto coinvolgente.
    Ti prego continua così, di nuovo un enorme grazie e complimenti per il tuo contributo alla scienza

    • @umarjamilai
      @umarjamilai  Před 17 dny +1

      Grazie a te per aver visitato il mio canale! Spero di pubblicare più spesso, anche se per fare contenuti di qualità ci vogliono settimane di studio e preparazione. In ogni caso, spero di rivederti presto! Buon weekend

    • @luigigiordanoorsini5980
      @luigigiordanoorsini5980 Před 17 dny

      @@umarjamilai Avevi già guadagnato un iscritto adesso hai guadagnato un fan.
      Ahahahahah

  • @MuhammadMuzzamil-ki4he
    @MuhammadMuzzamil-ki4he Před 22 dny +1

    Thank you for such great and detailed explanation.

  • @jeunjetta
    @jeunjetta Před 22 dny +2

    I think KAN will be the catalist of a significant tipping point in science.
    I want to apply this to power system grids and replace existing dynamic models with ones made from PMU data using KAN

  • @artaasadi9497
    @artaasadi9497 Před 14 dny +1

    that is very useful, informative and interesting! Thanks a lot!

  • @user-pu4oc9ek9u
    @user-pu4oc9ek9u Před 24 dny +1

    Hello Umar, this video is my best birthday gift I have ever received, thanks a lot :)

  • @JONK4635
    @JONK4635 Před 23 dny +1

    Extremely clear explanation and content here! Very helpful. I am happy that you came from PoliMI as well :) keep it up!

  • @kmalhotra3096
    @kmalhotra3096 Před 17 dny +1

    Hats off, what an awesome video!!!

  • @ansonlau7040
    @ansonlau7040 Před 17 dny +1

    Thankyou Jamil, what a cool video

  • @ScottzPlaylists
    @ScottzPlaylists Před 24 dny +2

    High quality explanations.. Thanks.

  • @arupsankarroy8722
    @arupsankarroy8722 Před 21 dnem +2

    Sir, you are great..💙💙

  • @johanvandermerwe7687
    @johanvandermerwe7687 Před 24 dny +1

    I saw this paper on papers with code, and thought to myself I wonder if Umar Jamil will cover this.
    Thanks for your effort and videos!

  • @enricovompa1876
    @enricovompa1876 Před 24 dny +2

    Thank you for making this video!

  • @zaevi6855
    @zaevi6855 Před 24 dny

    crazy that it took me an hr video to understand that its the (control points) being trained on the spline graph vs weights with MLPs and CNNs, thank you!

  • @bankayxy00
    @bankayxy00 Před 19 dny +1

    Thank you so so much for this amazing content.

  • @user-il1hu5xp2x
    @user-il1hu5xp2x Před 24 dny +1

    What funny, is that i predicted your next video will be on KAN, after i see you in github.
    I WILL WATCH THIS VIDEO, AS I FEEL THIS WILL BE THE FUTURE OF NEUR NETWORK, THANK YOU FOR YOUR WORK AND CONTENT ❤

  • @lethnis9307
    @lethnis9307 Před 23 dny +1

    Your explanations are the best, thank you so much😘🤗

  • @howardmeng256
    @howardmeng256 Před 19 dny +2

    Amazing video! Thanks a lot !

  • @prathamshah2058
    @prathamshah2058 Před 22 dny +1

    Thank-you so much for explaining the paper, it is so easy to understand now, btw can you also make a hands on video with the kan package developed by mit which is based off pytorch.

  • @wolfie6175
    @wolfie6175 Před 4 dny +1

    Good video, quality content.

  • @sergiorego6321
    @sergiorego6321 Před 24 dny +1

    Phenomenal! Thank you :)

  • @hajaani6417
    @hajaani6417 Před 24 dny +1

    You’re fantastic, mate.

  • @coolkaran1234
    @coolkaran1234 Před 24 dny +2

    You are savior, without you mortals like me would be lost in the darkness!!!

  • @GUANGYUANPIAO
    @GUANGYUANPIAO Před 11 dny +1

    awesome explanation

  • @user-wy1xm4gl1c
    @user-wy1xm4gl1c Před 13 dny +1

    This is awesome!

  • @samadeepsengupta
    @samadeepsengupta Před 24 dny +2

    Great Content !!

  • @RiteshBhalerao-wn9eo
    @RiteshBhalerao-wn9eo Před 9 dny +1

    Amazingg explanation !

  • @danielegiunchi9741
    @danielegiunchi9741 Před 22 dny +1

    brilliant video!

  • @dhackmt
    @dhackmt Před 16 dny +1

    i loved it sir .

  • @vaadewoyin
    @vaadewoyin Před 21 dnem +1

    Cant wait to watch this, saved! Will comment again when i actually watch it..😅

  • @JuliusSmith
    @JuliusSmith Před 16 dny

    Excellent video, thanks! At the end, I _really_ wanted to see an illustration of the relatively "non-local" adaptation of MLP weights. Can that be found somewhere?

  • @p4ros960
    @p4ros960 Před 14 dny +1

    bruh so good. Keep it up!

  • @seelowst
    @seelowst Před 22 dny

    Having a such good teacher is so adorable, i wish i could be your students.

    • @umarjamilai
      @umarjamilai  Před 22 dny +1

      哪里哪里啊,谢谢你的赞成!

    • @seelowst
      @seelowst Před 22 dny

      @@umarjamilai 太棒了,您还会中文👍

    • @umarjamilai
      @umarjamilai  Před 22 dny +1

      @@seelowst 我就是刚刚从中国来的,在中国主了4年了,现在回欧洲了。

    • @seelowst
      @seelowst Před 22 dny

      @@umarjamilai 我从没离开过我的城市,我希望像您一样👍

  • @pabloe1802
    @pabloe1802 Před 19 dny

    An implementation video will be awesome

  • @ezl100
    @ezl100 Před 19 dny

    thanks Umar. Very nice explanation. Just 2 questions :
    1 - Does it mean we can specify different knots per edge?
    2 - I am not understanding how the backpropagation will work. Let's say we calculate the gradient from h1. It will update phi 1,1 and phi 1,2 but how the learning process will impact the knots to the desired value?

  • @akramsalim9706
    @akramsalim9706 Před 24 dny +1

    awesome bro.

  • @ai__76
    @ai__76 Před 24 dny +1

    amazing

  • @fatemeshams9758
    @fatemeshams9758 Před 8 dny +1

    awesome👍

  • @faiqkhan7545
    @faiqkhan7545 Před 23 dny +1

    Umar bhai you the great

  • @subhamkundu5043
    @subhamkundu5043 Před 23 dny

    Hey @Umar, great content as always. Looking forward to a KAN implementation video from scratch. Also I think in 31:01 there is a minor language mistake. I think it will be for using a quadratic Bspline curve rather than quadratic Bezier curve

  • @satviknaren9681
    @satviknaren9681 Před 22 dny +1

    Please do post more ! please do more videos !

  • @user-jb3ht1wq5l
    @user-jb3ht1wq5l Před 18 dny +1

    THANK YOU

  • @daleanfer7449
    @daleanfer7449 Před 24 dny +1

    刚好期盼这个!

    • @umarjamilai
      @umarjamilai  Před 24 dny

      期待你的评价😇

    • @daleanfer7449
      @daleanfer7449 Před 24 dny +1

      ❤很好的内容,有考虑做inverse rl的内容吗❤

  • @shubhamrandive7684
    @shubhamrandive7684 Před 22 dny

    Great explanation. What app do you use to create slides ?

    • @umarjamilai
      @umarjamilai  Před 22 dny

      PowerPoint + a lot a lot a lot a lot a lot of patience.

  • @plutophy1242
    @plutophy1242 Před 15 dny +1

    this video is so amazing!!!!!!!

  • @bzzzzz1736
    @bzzzzz1736 Před 23 dny +1

    thank you

  • @fouziaanjums6475
    @fouziaanjums6475 Před 24 dny +1

    Hi, can you please make a video on multimodal LLMs, fine tuning it for custom dataset...

  • @emiyake
    @emiyake Před 23 dny

    Thanks!

  • @user-hd7xp1qg3j
    @user-hd7xp1qg3j Před 24 dny +2

    Could you please next explain multi modal llms, techniques like Llava, llava plus, llava next?

  • @baba42kachari
    @baba42kachari Před 19 dny

    Thanks

  • @user-sy6xn7nq7s
    @user-sy6xn7nq7s Před 13 dny

    There are continuous but indiferable points in the spline, right? What are you going to do?

  • @Kishan31468
    @Kishan31468 Před 24 dny +1

    Thanks man. Next xLSTM please.

  • @DiegoSilva-dv9uf
    @DiegoSilva-dv9uf Před 23 dny

    Valeu!

  • @willpattie581
    @willpattie581 Před 23 dny

    One thing I didn’t catch: how are the functions tuned? If each function consists of points in space and we move around the points to move the B spline, how do we decide to move the points? Doesn’t seem like backprop would work in the same way.

    • @umarjamilai
      @umarjamilai  Před 23 dny +1

      The same way we move weights for MLPs: we calculate the gradient of the loss function w.r.t the parameters of these learnable functions and change them in the opposite direction of the gradient. This is how you reduce the loss.
      We are still doing backpropagation, so nothing changed on that front compared to MLPs.

  • @routerfordium
    @routerfordium Před 21 dnem

    Thank you for the great video! Can you (or anyone) help understand why you need to introduce the basis functions b(x) in the residual activation functions?

  • @MrAloha
    @MrAloha Před 24 dny +2

    Wow! 🙏

  • @Engrbilal143
    @Engrbilal143 Před 24 dny

    Time to implement it

  • @rohitjindal124
    @rohitjindal124 Před 24 dny

    Sir I have been a huge fan of your videos and have watched all of them . I am currently in my second year BTech and really passionate about learning ml sir if possible can work under you I don’t want any certificate or anything just want to see observe and learn

  • @jeremykothe2847
    @jeremykothe2847 Před 23 dny

    fwiw I took a MLP solution for MNIST, substituted KAN for the MLP layers and no matter what I did (adding dimensions etc) it couldn't solve it. My intuition is that KANs only work well for approximating linear-ish functions, not irregular, highly discontinuous ones like image classification would need. But perhaps I just screwed it up :D

  • @ChukwuemekaAmblessedchinenye

    can you make tutorial video on model like Perplexity that use website live search

  • @ScottzPlaylists
    @ScottzPlaylists Před 24 dny +1

    Please explain DSPy

  • @suman14san
    @suman14san Před 24 dny +1

    Please add a payment option

    • @umarjamilai
      @umarjamilai  Před 24 dny +8

      Your love and support is enough! Have a great weekend!

    • @Patrick-wn6uj
      @Patrick-wn6uj Před 24 dny +1

      @@umarjamilaiJust woow

  • @pratishdewangan132
    @pratishdewangan132 Před 18 dny +1

    In search of gold i found a diamond

  • @einsteinsapples2909
    @einsteinsapples2909 Před 24 dny

    Your explenations are great. I think though, you should take breaks to blow your nose maybe, because you were sniffing a lot. It will make you videos more enjoyable.

  • @ln_exp1
    @ln_exp1 Před 22 dny

    Interesting

  • @kiffeeify
    @kiffeeify Před 15 dny

    Thanks!

  • @alfredmanto5487
    @alfredmanto5487 Před 22 dny

    Thanks