Silhouette Score for clustering Explained | Silhouette (clustering)- Validating Clustering Models

Sdílet
Vložit
  • čas přidán 18. 05. 2021
  • Silhouette Score for clustering Explained | Silhouette (clustering)- Validating Clustering Models
    #SilhouetteScore #UnfoldDataScience
    Hello ,
    My name is Aman and I am a Data Scientist.
    About this video:
    In this video, I speak about Silhouette Score and explain step by step how Silhouette Score works for cluster validation. I explain how to validate clusters and how to measure goodness of clusters. I explain the mathematical formula of Silhouette Score and intuition behind it. Below points are discussed in this video:
    1. Silhouette Score for clustering
    2. Validation on K-means clusters
    3. Cluster validation techniques
    4. How to measure goodness of clusters
    5. Unsupervised machine learning accuracy
    About Unfold Data science: This channel is to help people understand basics of data science through simple examples in easy way. Anybody without having prior knowledge of computer programming or statistics or machine learning and artificial intelligence can get an understanding of data science at high level through this channel. The videos uploaded will not be very technical in nature and hence it can be easily grasped by viewers from different background as well.
    If you need Data Science training from scratch . Please fill this form (Please Note: Training is chargeable)
    docs.google.com/forms/d/1Acua...
    Book recommendation for Data Science:
    Category 1 - Must Read For Every Data Scientist:
    The Elements of Statistical Learning by Trevor Hastie - amzn.to/37wMo9H
    Python Data Science Handbook - amzn.to/31UCScm
    Business Statistics By Ken Black - amzn.to/2LObAA5
    Hands-On Machine Learning with Scikit Learn, Keras, and TensorFlow by Aurelien Geron - amzn.to/3gV8sO9
    Ctaegory 2 - Overall Data Science:
    The Art of Data Science By Roger D. Peng - amzn.to/2KD75aD
    Predictive Analytics By By Eric Siegel - amzn.to/3nsQftV
    Data Science for Business By Foster Provost - amzn.to/3ajN8QZ
    Category 3 - Statistics and Mathematics:
    Naked Statistics By Charles Wheelan - amzn.to/3gXLdmp
    Practical Statistics for Data Scientist By Peter Bruce - amzn.to/37wL9Y5
    Category 4 - Machine Learning:
    Introduction to machine learning by Andreas C Muller - amzn.to/3oZ3X7T
    The Hundred Page Machine Learning Book by Andriy Burkov - amzn.to/3pdqCxJ
    Category 5 - Programming:
    The Pragmatic Programmer by David Thomas - amzn.to/2WqWXVj
    Clean Code by Robert C. Martin - amzn.to/3oYOdlt
    My Studio Setup:
    My Camera : amzn.to/3mwXI9I
    My Mic : amzn.to/34phfD0
    My Tripod : amzn.to/3r4HeJA
    My Ring Light : amzn.to/3gZz00F
    Join Facebook group :
    groups/41022...
    Follow on medium : / amanrai77
    Follow on quora: www.quora.com/profile/Aman-Ku...
    Follow on twitter : @unfoldds
    Get connected on LinkedIn : / aman-kumar-b4881440
    Follow on Instagram : unfolddatascience
    Watch Introduction to Data Science full playlist here : • Data Science In 15 Min...
    Watch python for data science playlist here:
    • Python Basics For Data...
    Watch statistics and mathematics playlist here :
    • Measures of Central Te...
    Watch End to End Implementation of a simple machine learning model in Python here:
    • How Does Machine Learn...
    Learn Ensemble Model, Bagging and Boosting here:
    • Introduction to Ensemb...
    Build Career in Data Science Playlist:
    • Channel updates - Unfo...
    Artificial Neural Network and Deep Learning Playlist:
    • Intuition behind neura...
    Natural langugae Processing playlist:
    • Natural Language Proce...
    Understanding and building recommendation system:
    • Recommendation System ...
    Access all my codes here:
    drive.google.com/drive/folder...
    Have a different question for me? Ask me here : docs.google.com/forms/d/1ccgl...
    My Music: www.bensound.com/royalty-free...

Komentáře • 163

  • @shaneglean217
    @shaneglean217 Před 2 lety +20

    The explanation is so concise and simple, even in spite of the fact that english isn't your 1st language. Amazing work my dude.

  • @chrisrichardson8864
    @chrisrichardson8864 Před 2 lety +5

    Clear and right to the point! Thank you for your work! Much appreciated!

  • @nicolaevasiliu9430
    @nicolaevasiliu9430 Před rokem

    Amazing explanations! THANK YOU !!!
    You are an excellent teacher !!!

  • @Pannafreestyle
    @Pannafreestyle Před rokem +1

    Sir you are the best teacher for DS ! Keep up the great work

  • @isaacyekoniya8067
    @isaacyekoniya8067 Před 3 lety +11

    This is simple and well explained, thank you Aman!

  • @JuliaCh47
    @JuliaCh47 Před 2 lety +2

    Thank you so much, best explanation I've seen on the Internet!

  • @KHakimin
    @KHakimin Před 3 lety +3

    This is gold! thanks for putting it very simple to understand

  • @YouTribalChief
    @YouTribalChief Před rokem +1

    What a gem of an explanation you give bhai! Truly you make things so simple and understandable!
    You deserve more!

    • @UnfoldDataScience
      @UnfoldDataScience  Před rokem

      Thanks a lot. Pls share with friends as well. Have a nice weekend ahead.

  • @the.quantum.paradox
    @the.quantum.paradox Před 2 lety

    Very simple and easy to understand!

  • @Felipe90820
    @Felipe90820 Před 4 měsíci

    Amazing explanation! Congrats and thank you!

  • @dbapple
    @dbapple Před 7 měsíci

    thank you, straight to the point!

  • @aaryalad7775
    @aaryalad7775 Před měsícem

    I'm glad i found a teacher like you

  • @samiayasmin5782
    @samiayasmin5782 Před rokem

    Thank you so much. Simple and clean explanation.

  • @victorcardenas9621
    @victorcardenas9621 Před 2 lety

    Simple but powerful, thanks and greetings from Colombia!

  • @Anon11229
    @Anon11229 Před 3 lety +1

    Too good...Way better than online courses that take thousands and thousands of money
    keep up the hard work

  • @RoopaNandu-k8b
    @RoopaNandu-k8b Před 24 dny

    Aman, your explanation is really good. Very precise and in detail

  • @sanjithaamarathunga9949
    @sanjithaamarathunga9949 Před 3 lety +2

    Thank you! Simple and well explained!

  • @Krishna-pm8ty
    @Krishna-pm8ty Před rokem

    Excellent Amen. Perfect explanation👏👏👏

  • @mmmommm237
    @mmmommm237 Před 2 lety

    Great, ty!

  • @uchennanwosu5327
    @uchennanwosu5327 Před 2 lety

    Excellent illustration and explanation.

  • @optimalsolutionprogramming1299

    Thank you for your help. Really great explanation!

  • @ravisavaliya4333
    @ravisavaliya4333 Před 2 lety

    at first it was very difficult for me to understand this, but you explained this in a very easy way....THANKYOU AMAN SIR FOR YOUR EASY AND INTUITIVE EXPLANATION!!!

  • @kushik.naveen
    @kushik.naveen Před 2 měsíci

    This is so useful for understanding & revising. Amazing work! Thankyou!

  • @muhammedthayyib9202
    @muhammedthayyib9202 Před rokem

    Clearly explained!

  • @sangeethaagoudar2175
    @sangeethaagoudar2175 Před 2 lety

    Thank You so much for this video sir

  • @keinestudiere-chillzone633

    Straight to the point. Thank you

  • @krishnabhadke6161
    @krishnabhadke6161 Před 2 lety

    very well explained sir thank you

  • @mdyounusahamed6668
    @mdyounusahamed6668 Před rokem

    very concise and simple explanation.

  • @RezoanurRahman
    @RezoanurRahman Před rokem

    You explained it so well even a 5th grader would get it. Its so easily explained!.

  • @Suno-99
    @Suno-99 Před 3 měsíci

    beautiful explanation, keep it up

  • @MyerNore
    @MyerNore Před 8 měsíci

    Clearer than my professor, for sure

  • @omniscienceisdead8837
    @omniscienceisdead8837 Před 7 měsíci

    you are the best bro!

  • @vallimuthaiyah5098
    @vallimuthaiyah5098 Před 3 lety +1

    Thank you sir for very clear explanation with example.. 👍

  • @akashpb4044
    @akashpb4044 Před 2 lety

    Awesome ...plain and simple 👍🏼

    • @UnfoldDataScience
      @UnfoldDataScience  Před 2 lety +1

      Glad to hear that Akash 🙂, please share with others as well who could be benefited from such content.

  • @chrisx742
    @chrisx742 Před 6 měsíci

    Thank you.

  • @lhadz7290
    @lhadz7290 Před 9 měsíci

    Thank you so much for Sir, I am so glad I came across your video. I look forward to learn more from you. Thank you again.

  • @liqs_xd
    @liqs_xd Před 11 měsíci

    Great video, thanks for your time!

  • @anshuraghav5620
    @anshuraghav5620 Před 2 měsíci

    Great video

  • @veyselaytekin8734
    @veyselaytekin8734 Před rokem

    thank you

  • @parvsharma8767
    @parvsharma8767 Před 3 lety +1

    Thanks bro for explaining so perfectly

  • @sourabhpramanik3957
    @sourabhpramanik3957 Před 2 lety

    Its really a simple and good explanation for anybody to understand...thanks for making this simple explanation video.

  • @blackyogurt
    @blackyogurt Před 2 měsíci

    Great Explanation . thanks

  • @amoghbharadwaj9252
    @amoghbharadwaj9252 Před 2 lety

    beautifully explained!!! Thank u so much

  • @diyap7343
    @diyap7343 Před 2 lety +1

    This is amazing!

  • @sajjaddehghani3280
    @sajjaddehghani3280 Před 2 lety

    Thanks for the very simple and easy to understand explanation

  • @user-hs6gz4yn1u
    @user-hs6gz4yn1u Před 2 měsíci

    Ur Explanation is simple and understandable,TQ sir

  • @nandankakadiya1494
    @nandankakadiya1494 Před 3 lety +2

    Amazing explaination thank you sir

  • @shaikhuzma786
    @shaikhuzma786 Před 8 měsíci

    Tqsm sir so well explained ❤

  • @legolife3012
    @legolife3012 Před 3 lety

    Thanks a lot for this simple explenation

  • @user-dd3mf2sp7s
    @user-dd3mf2sp7s Před 2 lety

    Great explanation, Thank you!

  • @almonddonut1818
    @almonddonut1818 Před 2 lety

    Thank you!!

  • @stevemungai3542
    @stevemungai3542 Před 2 lety

    Crystal clear

  • @paulholzapfel1856
    @paulholzapfel1856 Před 2 lety

    Perfect explanation. Thank you Aman

  • @kavyasharma4738
    @kavyasharma4738 Před 2 lety

    how can i thank you brother . you explained this topic so easily.

  • @mohe4ever514
    @mohe4ever514 Před 2 lety

    Very well explained!

  • @theone-jz2rf
    @theone-jz2rf Před rokem

    keep up the good work!!!

  • @puodzius
    @puodzius Před 2 lety +1

    Very good explanation, thanks

  • @sandipansarkar9211
    @sandipansarkar9211 Před 2 lety

    finished watching

  • @venkataprashanth9831
    @venkataprashanth9831 Před 2 lety

    great and simple explanation, a big clap to ur efforts

  • @anisadillap.7131
    @anisadillap.7131 Před 2 lety

    oke i understand, thankyou sir ...

  • @nandinik7816
    @nandinik7816 Před 2 lety

    Always to the point...and you dont waste any time actually...

  • @megalaramu
    @megalaramu Před 3 lety

    Hi Aman, I do have a general question in clustering -I have heard of people do clustering in the dataset initially and then train separate model for each cluster. my question is lets take an example of telecom churn - there will be many services like internet, Phone, Dish etc. is there anyway we can cluster like internet users in one cluster, phone service in one cluster like that.

  • @SACHINKUMAR-px8kq
    @SACHINKUMAR-px8kq Před 2 lety

    Thanks you so much sir

  • @Chaseaction-outdoor-activities

    thank you!!!

  • @dr.zliiiiii
    @dr.zliiiiii Před 2 lety

    nice video

  • @PranavVashisth
    @PranavVashisth Před 2 lety

    verry well explained !!!!

  • @kamal_douma
    @kamal_douma Před 6 měsíci

    very very good explnation

  • @user-ur2en1zq4f
    @user-ur2en1zq4f Před rokem

    thanks sir

  • @dosendaring
    @dosendaring Před 2 lety

    This is very good explanation, thank you Sir....

  • @beprepare8128
    @beprepare8128 Před 2 lety

    Very well explained sir

  • @MohdRashid-hi7cp
    @MohdRashid-hi7cp Před 11 měsíci

    Very nice explanation

  • @jeanhwang18
    @jeanhwang18 Před rokem

    Thank you so much for your explaination. I have a small question: after I got point i ' s silhouette coefficient, I calculate other two points in cluster A, then I take the average of the three silhouette coefficients, let's say it's NO.1. What we can learn from No.1? If No.1 is close to +1, it's good, it means data points in cluster A are most similar to each other, right?

  • @bangarrajumuppidu8354
    @bangarrajumuppidu8354 Před 2 lety +1

    excellent

  • @salilshukla08
    @salilshukla08 Před 2 lety

    thanks

  • @shaheerrehman3151
    @shaheerrehman3151 Před rokem

    Thank you. Much love to you from Pakistan

    • @UnfoldDataScience
      @UnfoldDataScience  Před rokem

      Thanks Shaheer, pls share with friends if you like the content.

  • @prashanths4455
    @prashanths4455 Před 2 lety +1

    super bro

  • @vishwass5103
    @vishwass5103 Před 3 lety

    Hi Aman thanks for this explanation. Please explain about assumptions of k means clustering

    • @UnfoldDataScience
      @UnfoldDataScience  Před 3 lety

      Assumption at high level is, you data has clusters and centriods.

  • @iftikhar58
    @iftikhar58 Před 2 lety

    clear

  • @gnaneshgn8341
    @gnaneshgn8341 Před 3 lety +1

    Nice video Sir thank you. please make an video of EDA sir ..!

  • @handsomemehdi3445
    @handsomemehdi3445 Před rokem

    Thank u for simpler explanation! A question: In our case, it seems that for both clusters, b should be the same unless we have more than two clusters, (or since those clusters are mutually nearest) doesn't it?

    • @lkjhlkjh1954
      @lkjhlkjh1954 Před rokem

      I think that’s why we do not calculate silhouettes score for less than 2 cluster

  • @bipulnath4602
    @bipulnath4602 Před 3 lety +1

    Amazing explaination..plz make a video on Dunn index.

  • @neajinnovations8587
    @neajinnovations8587 Před 9 měsíci

    I got all the concept clear, but please when is it appropriate to use silhouette distance? it when you have mixed data? or mix data is best for elbow method?

  • @ranajaydas8906
    @ranajaydas8906 Před 3 lety +1

    Sir please make a video on PCA . It would be very helpful

  • @Live_life_with_no_regrets
    @Live_life_with_no_regrets Před 5 měsíci

    is there any way to run it in SPSS? Is it correct if I use the average of the Euclidean distance of the cluster?

  • @christopherlegarda5164
    @christopherlegarda5164 Před 4 měsíci

    This guy is god!

  • @sampathvinaykumarreddymajj790

    Topic is very well explained sir ..
    But should we learn DB Index, Dunn Index, Jaccard score ... or only this Silhouette score is enough

  • @TheGentlemanGR
    @TheGentlemanGR Před 2 lety

    So which score is better? -0.2 or 0.21 ?

  • @subhasisdutta3711
    @subhasisdutta3711 Před 3 lety +2

    you have said "Min distance of A1,A2,A3 is the value of b ". Here you are talking about mean value of three or the which one has the minimum value among three , we'll chose it. Please clear it once, if possible

    • @UnfoldDataScience
      @UnfoldDataScience  Před 3 lety +2

      HI Subhasis, A1, A2 etc are "mean" Distance from various other clusters ok. Now let's take att these A1 TO AN values and take the minimum from this set. This mimum is b.

    • @subhasisdutta3711
      @subhasisdutta3711 Před 3 lety

      @@UnfoldDataScience thanq for the clarification. Have a blessed life ahead

  • @mahipatil369
    @mahipatil369 Před rokem

    hello sir.. great video i understand this topic very well,, thank you so much...
    just want to ask did you make video of implementation of silhoutte score in python?? if yes plz provide link..

    • @UnfoldDataScience
      @UnfoldDataScience  Před rokem

      Thanks a lot. Its very simple to do in python - see this link - stackoverflow.com/questions/59919627/how-to-calculate-the-silhouette-score-for-each-cluster-separately-in-python

  • @parikshitgurjar5545
    @parikshitgurjar5545 Před 3 lety +1

    hello sir, Plese make a video on the explanation and implementation of ppf,pdf,cdf in python with simple explanation and understanding, I am very confused in this.

  • @himanshumangoli6708
    @himanshumangoli6708 Před 2 lety

    As we have find cohesion and separation for only one point in one cluster A, so we have to find Silhouette score for all points for all points in same cluster so just to ensure that points are correctly clustered ??
    Please explain??

  • @pruthvi491
    @pruthvi491 Před 2 lety +1

    How to get silhouette score for spectral clustering

  • @arisurya9867
    @arisurya9867 Před 2 lety

    Nice video sir. But i have problem when calculate cohesion in cluster just has one data point. How i should set value of cohesion? 0 or 1?

  • @gretchenbekker2371
    @gretchenbekker2371 Před 2 lety

    How do I get the amount of data points within the clusters ?

    • @UnfoldDataScience
      @UnfoldDataScience  Před 2 lety

      Levels can be found out after clustering then we can count using pandas.

  • @nerdymath6
    @nerdymath6 Před rokem

    is 0.4 is good silhouette score?

  • @abeerhamid
    @abeerhamid Před měsícem

    How can i read the graph of silhouette score? Most important thing you didn't tell

    • @UnfoldDataScience
      @UnfoldDataScience  Před měsícem

      Positive Scores (close to 1): Samples are well-clustered, and there is good separation between clusters.
      Scores around 0: Samples are on or near the boundary between clusters, indicating potential overlap.
      Negative Scores: Samples are likely in the wrong cluster.

  • @dariomelconian9502
    @dariomelconian9502 Před rokem

    Is 0.39 a bad score?

  • @sameeransari9129
    @sameeransari9129 Před rokem

    apki english samajhne me time lg jata hai.. ki aap kya bolna chahte ho

  • @sameeransari9129
    @sameeransari9129 Před rokem

    Sir aap hindi me samjhaya karo toh zyada aacha samjha payoge...

  • @sameeransari9129
    @sameeransari9129 Před rokem

    apki english me confuse ho jate hai log ki aap kya bolna chahte ho...isko negitive comment ki tarah mt lena...aap samjhate aacha ho pr agar hindi me samjhaoge toh hm logo ko clear samaj ayega