The mathematics behind Shapley Values

A Data Odyssey

zhlédnutí 20 169

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 26. 03. 2023
Shapley values are a fair way to divide the value of a game amongst its players. We explain the mathematics behind the Shapley value formula. To understand why it is fair, we also discuss the Shapley value axioms that the formula is derived from. The formula may seem scary but you will find it has an intuitive explanation.
*NOTE*: You will now get the XAI course for free if you sign up (not the SHAP course)
SHAP course: adataodyssey.com/courses/shap...
XAI course: adataodyssey.com/courses/xai-...
Newsletter signup: mailchi.mp/40909011987b/signup
Read the companion article (no-paywall link): towardsdatascience.com/from-s...
Medium: / conorosullyds
Twitter: / conorosullyds
Mastodon: sigmoid.social/@conorosully
Website: adataodyssey.com/

Komentáře • 62

@adataodyssey Před 4 měsíci
*NOTE*: You will now get the XAI course for free if you sign up (not the SHAP course)
SHAP course: adataodyssey.com/courses/shap-with-python/
XAI course: adataodyssey.com/courses/xai-with-python/
Newsletter signup: mailchi.mp/40909011987b/signup
@abstractqqq Před 11 měsíci ⁺¹¹
Good job sir. Very few in the ML (industry) will go this deep. Seeing more like-minded people always feel great.
@adataodyssey Před 11 měsíci ⁺¹
Thank you! I always feel a bit uncomfortable if I don't have some sort of an understanding of the theory :)
@HenryCagnini Před 4 měsíci ⁺¹
That's a great video to explain the concept of Shapley values. Many thanks!
@adataodyssey Před 4 měsíci
Thanks Henry! I’m glad you found it useful :)
@santizdr Před 21 dnem
Best channel to dig deep into XAI.
It would be great a video about the state of art of XAI applied on LLMs.
@adataodyssey Před 19 dny ⁺¹
Thanks Santi! I will consider this however my interests are more in computer vision at the moment
@umarkhan-hu7yt Před 3 měsíci
Dear Odyssey you are doing great. Keep continue and hit hard on all XAI models for a layman.
@adataodyssey Před 3 měsíci
Thank you Umar! Will do :)
@mugiwxrx6282 Před 11 měsíci ⁺¹
thank you sir, that seems more clear in my mind !
@adataodyssey Před 11 měsíci
I’m glad it helped!
@v-ba Před měsícem
Great explanation, thank you very much
@adataodyssey Před 29 dny
Thanks!
@shadmohammed618 Před 4 měsíci
Great explanation, thanks very much. 🙂
@adataodyssey Před 4 měsíci
No problem Shad! I’m glad you found it useful
@muhammadawais581 Před dnem
hats off to you for such a nice explanation.
@adataodyssey Před 22 hodinami
Thanks Muhammad!
@miguelgarciaortegon Před 3 měsíci
Great explanation, thank you!
@adataodyssey Před 3 měsíci ⁺¹
I'm glad you found it useful Miguel :)
@silver_soul98 Před 2 měsíci
Bro that was a nice explanation. thanks so much.
@adataodyssey Před 2 měsíci
No problem :) I’m glad it was useful
@elenagolovach384 Před 11 měsíci
Thanks very much
@adataodyssey Před 11 měsíci
No problem, Elena!
@dennisestenson7820 Před 6 měsíci ⁺¹
I really don't like that this subject is presented and studied as "games" when the underlying math is so incredibly enlightening and important.
@adataodyssey Před 6 měsíci ⁺⁵
This is a term that comes from "game theory". Rest assured that the "games" it deals with are very serious! Perhaps the example I've chosen is a bit silly but I was hoping that it would help the target audience relate to the concepts :)
@lakshman587 Před 7 měsíci
The time machine thing really got me hahaha!!
I was wondering how can individual values be calculated!!
Thanks for clear explanation!!
@adataodyssey Před 7 měsíci ⁺¹
No problem Lakshman! Are there any other related concepts you're interested in learning about?
@lakshman587 Před 7 měsíci
@@adataodyssey I would like to learn about ChatGPT like how transformers work.
@adataodyssey Před 7 měsíci ⁺¹
@@lakshman587 This is a bit out of my comfort zone tbh. My content is more aimed towards computer vision and explainable AI. I was considering doing a tutorial on how you can use the GPT API though!
@lakshman587 Před 7 měsíci
@@adataodyssey Ok No problem, can we have a video about how diverse counterfacuals work under the hood, we currently are using DiCE package from interpretml repo.
I would like to know how these counterfacuals are getting generated!
@adataodyssey Před 7 měsíci ⁺¹
@@lakshman587 will look into that!
@yelancho Před měsícem
Appreciate a lot Prof Odyssey！Shaply values is now a more clear concept in my mind!
@adataodyssey Před měsícem ⁺¹
Thanks Ye! I'm glad you found it useful :)
@smithanair787 Před rokem ⁺²
Great video! Can you make a video on how exactly the Kernel SHAP and TreeSHAP works?
@adataodyssey Před rokem ⁺¹
Thank you Smitha! I will consider that. But, to be honest, it will take me some time to fully understand the algorithms first.
@smithanair787 Před rokem
@@adataodyssey Thank you!
@adataodyssey Před rokem
@@smithanair787 by the way, the course goes into a bit more detail on the difference between kernelSHAP and treeSHAP. Otherwise you might find this article helpful: towardsdatascience.com/kernelshap-vs-treeshap-e00f3b3a27db
@Empobaer Před rokem ⁺⁴
Great video, formula was well explained! Though, I do have a question. What is the Intuition behind P(C1-C0) = 2/6 at 7:11 ? Because intuitively I would have thought there is only one way how player 1 can start its new coalition and thus I would have thought the weight should be 1/6. I see that if we look into the formula on how to calculate the weights, we obtain (p-|S|-1)!=2 and thus (1*2)/6=1/3, but I still miss the intuition behind why we need the (p-|Sl-1)!. Where am I thinking wrong?
@adataodyssey Před rokem ⁺¹
Great question! Keep in mind that to receive the full value of the game all 3 players need to participate. So, after P1 joins, there are 2 ways for the full coalition to form -- P2 joins then P3 or P3 joins then P2. In other words, there will be 2 scenarios where P1 makes a marginal contribution to a team of no players. Does that make sense?
@adataodyssey Před rokem ⁺¹
If not, then this article may help. It calculates the values in a slightly different way which may be more intuitive to you.
www.analyticsvidhya.com/blog/2019/11/shapley-value-machine-learning-interpretability-game-theory/
@Empobaer Před rokem
@@adataodyssey Thank you for your fast reply, indeed the blog post was very useful for an intuitive understanding! Anyways your series on SHAP and Shapley values was a very helpful introduction. Now I only need to fully understand Kernel SHAP, which will probably take a bit longer :)!
@hasnainayub2369 Před 3 měsíci
Great explanation! I have a question though. Why don't the shap values for each feature (from the waterfall plot) add up to the predicted output at that particular observation?
@adataodyssey Před 3 měsíci
They do if you also add the average prediction across all the instances in the dataset:
f(x) = E[f(x)] + sum(shap values)
You can see the average prediction, i.e. E[f(x)], on the bottom of the waterfall plot :)
@hasnainayub2369 Před 3 měsíci
Got it ! Thanks mate @@adataodyssey
@adityababel3998 Před 8 měsíci
Hello, can you make a video explaining the calculation of TreeSHAP??
@adataodyssey Před 8 měsíci
Hi Aditya, this is already on my list of videos to do! I want to make a video about both Kernel SHAP and tree SHAP that go more in-depth into the algorithms.
@TheOzpad Před rokem ⁺¹
Lekker vid
@mathieucordier9248 Před 9 měsíci
Does Shap Value is adapted for imbalanced data set ? Because one assumption is that we consider equality of chance for players combination (at 6:00).
@adataodyssey Před 9 měsíci
That's a good question! I haven't really thought about that.
In ML we don't assume equal chance for all feature values. We use the empirical distributions of the features. So you don't have to worry about that assumption for unbalanced features.
For unbalanced targets, I'd say you should be fine as long as the model is still making accurate predictions. E.g. if it is always predicting one class then the SHAP values won't be meaningful.
@QuantizedFields Před 7 měsíci
This is a very good explanation. However, I found it a bit confusing when you were referring to "Player-1" as "You". Because it is not clear who I am from the animation, am I "Player-1" or "Player-2" ? It would be better if you simply refer to the animation/picture and say "Player-1" or "Player-2" instead of "You". Thanks for your great work!
@adataodyssey Před 7 měsíci
Thanks for the input Daniya! When it comes to technical content, it is difficult to strike a balance between making it interesting and easy to understand. My goal was to get the audience engaged but I see how this can be confusing.
@abdelbaki8625 Před měsícem
what is the article reference for this information i need it for my studies emergency, please
@adataodyssey Před 29 dny
czcams.com/users/redirect?event=video_description&redir_token=QUFFLUhqbktFYXFNVHVzc3NsTWpaYkc4Y3l3alZ0N3dmZ3xBQ3Jtc0trX2c3WmlOUVQwYW1USmJsaDh4YnpLV191dk5tOEdnOUtnVF9vZm5BbG8yTmRTaU56RXZNSE12Nkh2MjRITUZSLUZINUNPWmM3WFRlbnVGZWlscDFLZnFOZy1Xb0JiYm1RMnlQbVU2MEJ4R0hoUmJxMA&q=https%3A%2F%2Ftowardsdatascience.com%2Ffrom-shapley-to-shap-understanding-the-math-e7155414213b%3Fsk%3D329a1f042a0167162487f7bb3f0ffd46&v=UJeu29wq7d0
@cheeseybox Před 6 měsíci
Ur a legend
@adataodyssey Před 6 měsíci
Coming from you cheese vision, I take that as a great compliment!
@hasnainayub2369 Před 3 měsíci
Could you please explain P(C1-C0) = 1/3 ? (at 7:12). The rest is very well explained.
@adataodyssey Před 3 měsíci
Remember, to get the prize money all players must eventually join the coalition. So, there are 2 ways that P1 can contribute to a coalition with no players (i.e. C0):
- P1 joins then P2 then P3
- P1 joins the P3 then P2
So they make the marginal contribution C1 - C0 in 2/6 = 1/3 ways the coalition of 3 players can form.
@hasnainayub2369 Před 3 měsíci
@@adataodyssey thank you so much ! How did i not see that :/
@adataodyssey Před 3 měsíci
@@hasnainayub2369 don't stress! It took me forever to understand
@abdelbaki8625 Před měsícem ⁺²
I don't understand
@seanjohn6956 Před měsícem
just stick to the explanations no need for the jarring adlibs
@adataodyssey Před měsícem
That's boring...

Další v pořadí

Automatické přehrávání