Live Day 6- Discussing KMeans,Hierarchical And DBScan Clustering Algorithms

Sdílet
Vložit
  • čas přidán 24. 08. 2024

Komentáře • 87

  • @AmirAli-id9rq
    @AmirAli-id9rq Před 2 lety +16

    ek number session ... in easy terms ... BIAS is the inability of ML algorithm to capture the 100 percent or exact relationship. To understand bias one must think why do we need a ML in first place. In mathematics or physics we have absolute relationship or formula between dependent and independent variables like s=ut+1/2 at2 (std 7 Physics) or SI = P*R*T so for computing cases like we have absolute formula we don't need any ML algo. ML try to do the same i.e. estimate a formula, let say I want to calculate the purchasing power (P) so I train a model with different variables like income,age, family income and m model fetches a formula P = wo+ b1*income+b2*age + b3* family income..... So this formula is not absolute or universal as its derived by a specific ML algo for specific data but let say by miracle we derive a formula that exactly calculates the purchasing power with 100 percent accuracy so for that model bias is 0 as the model accurately captures the relationship..... Variance ---- Talking about variance, in short way the difference in fits between data set is called variance , imagine we used that same miracle formula in test data and data fits 100 percent as in we get 100 percent accuracy(for different test set) then we can say that the variance is 0 which means the ML formula is perfect or let say when use the same miracle formula in test set we get 50% accuracy which means the bias was low but variance is high as formula didnt work well with unseen (test) data... SO in an imaginary world if bias is 0 and variance is also 0 then my friend you have discovered a formula not an estimation .... In a practical world we aim for a model with low bias and low variance..... Subscribe Krish Channel if this helped

  • @parth.mandaliya
    @parth.mandaliya Před 2 lety +26

    A humble request to you @Krish, make next live streams on Deep Learning.

    • @Ishaheennabi
      @Ishaheennabi Před 2 lety +3

      ya

    • @prashantkandarkar8993
      @prashantkandarkar8993 Před 2 lety +2

      Yes

    • @kkevinluke
      @kkevinluke Před 2 lety +2

      I would EDA, cuz that is more applicable in the job scenarios, i.e. it depends on the role, but generally, most roles, require strong EDA knowledge, so, I would go for EDA 7 days. next,

    • @parth.mandaliya
      @parth.mandaliya Před 2 lety +1

      @@kkevinluke looks like your opinion won. And I also agree with you.

  • @rahulalladi2086
    @rahulalladi2086 Před 2 lety +10

    I got placed at tiger analytics
    Credit goes to u krish
    Your videos helped me to crack the interview

    • @rafibasha4145
      @rafibasha4145 Před 2 lety +1

      Hi Rahul ,congrats .please share interview quesions

  • @geekyprogrammer4831
    @geekyprogrammer4831 Před 2 lety +15

    Good Evening Krish. Your contents is absolutely a gold mine. Please arrange Deep Learning sessions next :)

  • @pollypravir5378
    @pollypravir5378 Před 2 lety +3

    Thanks

  • @akhilbez88
    @akhilbez88 Před rokem

    You are the best teacher that I have in my life in this domain,thanks a lot to share this kind of knowledge...

  • @vikascbr
    @vikascbr Před 2 lety +1

    Good morning krish.. You have really made my foundation very strong before that I was null in statistic and machine learning since from non technical background.. Now I can read very high level books and could really understand.. You are really great value addition to my learning path..

  • @yusmanisleidissotolongo4433
    @yusmanisleidissotolongo4433 Před 5 měsíci

    Excellent, just excellent. Thanks

  • @gayanath009
    @gayanath009 Před 5 měsíci

    Super Explanation as always. hats off

  • @kaustubhkapare807
    @kaustubhkapare807 Před 2 lety +2

    Thank You

  • @gummalasaiteja961
    @gummalasaiteja961 Před 2 lety +1

    1.75 speed is he best way to watch and lot of information covered in less time

  • @kumarnityanand4731
    @kumarnityanand4731 Před 2 lety +1

    Excellent and knowledge gaining session and every second spend was gain. Thanks alot 😊 keeping helping and sharing the knowledge & concepts 💐💐💐

  • @krishnadhawalapure
    @krishnadhawalapure Před rokem

    you are one of the best teachers any student can have..❤

  • @KamalSingh-rt2bb
    @KamalSingh-rt2bb Před 2 lety

    Hello sir I started every morning with a new session of machine learning. And last 6 days teach me a lot about machine learning algorithms. Thank you very much for this playlist.

  • @rafibasha4145
    @rafibasha4145 Před 2 lety +4

    Please cover XGboost'GBM and catboost in live videos so we can understamd learn better

  • @shubhamgupta09
    @shubhamgupta09 Před rokem +5

    Hi Sir, At 1:11:00, I think you had mistakenly spoken the wrong terms for High Bias & low Bias. It should be like for High Bias-> Not perform well, Low Bias-> Perform well. We use Low Bias & low variance for the Generalized Model as it performs well. Correct me if I am wrong.

    • @ashutoshmishra6920
      @ashutoshmishra6920 Před rokem

      Pata hai bsdk galti se boldiye sir iske liye comment krne ki jarurat nai thi gyaan mat chodo

  • @abhishekpatil1106
    @abhishekpatil1106 Před 2 lety

    First thing First !
    Great session 👏 👌 👍

  • @Dovahkiin7994
    @Dovahkiin7994 Před rokem

    Thanks for this great Tutorial.

  • @harshitsamdhani1708
    @harshitsamdhani1708 Před 8 měsíci

    Thank you for the lecture

  • @tanwilliam7351
    @tanwilliam7351 Před 2 lety +2

    Yes DEEP LEARNING NEXT!

  • @sandipansarkar9211
    @sandipansarkar9211 Před 2 lety +1

    finished watching

  • @piyushsonekar1225
    @piyushsonekar1225 Před rokem

    thanks! really want know about exact definition of bias & var
    great teaching

  • @gh504
    @gh504 Před 2 lety +1

    Amazing explanation thank you sir

  • @ridoychandraray2413
    @ridoychandraray2413 Před rokem

    Krish Naik Sir is Awesome

  • @AmirAli-id9rq
    @AmirAli-id9rq Před 2 lety +2

    at 1:11:31 , I guess its wrong if the model captures the good relationship(between dependent and independent variable) in data then it has low bias not high bias. Low bias means that model output the formula is flexible (low bias) to capture the relationship , high bias means that the accuracy is low and model is unable to capture the actual data points .. please verify guys

  • @akarkabkarim
    @akarkabkarim Před rokem

    Thank your sir Krish

  • @navalsehgal1015
    @navalsehgal1015 Před 9 měsíci

    Keep it up.

  • @rafibasha4145
    @rafibasha4145 Před 2 lety +2

    Please start mock interview sessions as well

  • @LearningWithNisa
    @LearningWithNisa Před 7 měsíci

    Hello sir, you are doing great job. do you have any video related to OPTIC clustering?

  • @ramdasprajapati7884
    @ramdasprajapati7884 Před rokem

    Beautiful sir....

  • @user-yc7zi3gy9v
    @user-yc7zi3gy9v Před rokem

    Hello sir take care of your health

  • @raghavsharma8512
    @raghavsharma8512 Před 2 lety

    superb.....!!

  • @pankajkumarbarman765
    @pankajkumarbarman765 Před 2 lety

    Thank you so much sir❤️

  • @ishwarsalunke1838
    @ishwarsalunke1838 Před rokem

    Depends on the data points

  • @cloudengineer1348
    @cloudengineer1348 Před 2 lety +2

    Hi Krish, Are you planning to take ML (Deep Learning) session?

  • @rohanwaghulkar3551
    @rohanwaghulkar3551 Před 10 měsíci

    sir pls make video on homogeneity, completeness, V-measure and Davies-Bouldin Index

  • @md.ishtiakrashid1523
    @md.ishtiakrashid1523 Před 6 měsíci

    The video was very good. But how to calculate the feature importance after k-means clustering?

  • @kkevinluke
    @kkevinluke Před 2 lety

    Hello @Krish, thank you for the explanations. Please do an extensive depth in EDA sessions next. I appreciate your efforts very much, thanks again.

  • @minhaoling3056
    @minhaoling3056 Před 2 lety +2

    will you do deep learning series?

  • @rakeshliparefms2
    @rakeshliparefms2 Před rokem

    Hi krish sir its learning from you.
    Can you please detailed video of Principle components analysis

  • @mainakseal5027
    @mainakseal5027 Před 11 měsíci

    east or west naik sir is suppper duper best

  • @pankajgoikar4158
    @pankajgoikar4158 Před rokem

    You are just amazing Sir. 😊

  • @darshanvala9224
    @darshanvala9224 Před 2 lety

    10 out of 10

  • @mdyounusahamed6668
    @mdyounusahamed6668 Před rokem

    Please make some videos on soft clustering algorithm (ex. Fuzzy C Means)

  • @rahulaher3874
    @rahulaher3874 Před rokem

    10/10 rating

  • @sejalkale67
    @sejalkale67 Před 2 lety

    A humble request to you @Krish,make next live session streams on Machine learning practice and practicals

  • @sandeepagarwal8566
    @sandeepagarwal8566 Před 2 lety +2

    Yes Deep learning course

  • @harshgupta3641
    @harshgupta3641 Před 2 lety

    This video is incredible, and very well explained . But if we have more than one feature in our dataset, should we make the feature selection first and then perform the elbow test?

  • @kkevinluke
    @kkevinluke Před 2 lety +1

    Is the silhouette score applicable to hierarchical clustering? as some clusters are within other clusters. How do we differentiate a(i) from b(i) then?

  • @amritakaul87
    @amritakaul87 Před 2 lety

    @KRISHNAIK SIR, KINDLY PROVIDE THE DBSCAN VIDEO LINK

  • @zahrasiraj766
    @zahrasiraj766 Před 2 lety

    sir can you make an urgent lecture on cluster labeling problem ?? document cluster labeling thing ? and what if we enhance this issue as hierarchical cluster labeling thing ?

  • @dukesoni5477
    @dukesoni5477 Před 2 lety

    Mil gya bhai ml padhna ka channel ekdum maja aagya sir

  • @ishwarsalunke1838
    @ishwarsalunke1838 Před rokem

    Silhouette score

  • @rafibasha4145
    @rafibasha4145 Před 2 lety

    Please let me know on which kind of data like linear ,non linear etc which algorithm works better

  • @chitrranshia7765
    @chitrranshia7765 Před 2 lety +1

    Quick qq. High bias meaning better accuracy. ??

  • @ankan54
    @ankan54 Před 2 lety

    What are the type of Biases can there be in a dataset? how to answer this question ?

  • @anubhabsaha3760
    @anubhabsaha3760 Před rokem

    Andrew NG of INDIA==Krish Naik Sir

  • @BhavyaArora-co2wd
    @BhavyaArora-co2wd Před 3 měsíci

    Could someone share github link which is being referenced at 51:51?

  • @ALLINONEMEDIA33
    @ALLINONEMEDIA33 Před měsícem

    Can I've the git hub link here please 😵‍💫

  • @dataanalyst1012
    @dataanalyst1012 Před 2 lety

    Hello sir. Do you, by any chance, know about the assumptions of k means cluster analysis in the case of large variance?

  • @paneercheeseparatha
    @paneercheeseparatha Před rokem

    K means clustering is not mathematically clear. The line you're drawing connecting the two centroids is ok, but how does that perpendicular line drawn. means how is that perpendicular line decided? Also for any new point, will that line be used to classify for k nearest neighbours is to be used?

  • @user-mo4xq3zp2j
    @user-mo4xq3zp2j Před 4 měsíci

    Sir can you please provide the github link?

  • @dataanalyst1012
    @dataanalyst1012 Před 2 lety

    In k means clustering, is there an assumption in numbers of observations and variables? Would having variables greater than observation affect the results of clustering and make it less accurate?

  • @hamzasabir6480
    @hamzasabir6480 Před rokem

    Hello Krish! How it is possible to have 3 centroids when k=2 is specified as you told at 32:00 while introducing kmeans plus?

  • @sridharbajpai420
    @sridharbajpai420 Před 11 měsíci

    51:27 k means cant do cluster like this , kmeans created convex pattern in data

  • @harshavardhansvlkkb2290

    10/10

  • @arpita0608
    @arpita0608 Před rokem

    I don't understand after knowing the clusters we draw the histogram in hierarchical clustering and you are showing we need to draw a parallel like and the number of vertical lines it intersects will be number of clusters?? I mean we already drawing the histogram based on the clusters. Doesn't make sense what you told.

  • @user-wg4ms3xk3p
    @user-wg4ms3xk3p Před 9 měsíci

    how to find eps and impis in dbsan

  • @tom-shellby
    @tom-shellby Před 2 lety

    Sir, if low bias - high variance is overfitting and high bias - high variance is underfitting , then what is high bias - low variance ?

    • @shubhamnaik9555
      @shubhamnaik9555 Před rokem

      That is practically not possible because u will not get a model that performs bad on training data but somehow performs well on test data.

  • @sidindian1982
    @sidindian1982 Před 2 lety

    silhouette Code is dam tough to understand Sir 😞

  • @siddhantkohli5063
    @siddhantkohli5063 Před 2 lety

    Sir pls make a video ON pea

  • @parthshah5482
    @parthshah5482 Před rokem

    silhoit score

  • @basavarajag1901
    @basavarajag1901 Před rokem

    can i know the matrial link ?

  • @shreyasnatu3599
    @shreyasnatu3599 Před 2 lety

    anyone knows where I can get data science/ml internships? I am in third yr of comp eng

  • @deepsarkar2003
    @deepsarkar2003 Před 2 lety

    Where is the Github link for this?

  • @RumiAnalytics2024
    @RumiAnalytics2024 Před 2 lety

    I didnt find the githuub link sir