Hierarchical Cluster Analysis [Simply explained]

Sdílet
Vložit
  • čas přidán 5. 07. 2024
  • What is Hierarchical Cluster Analysis? And how is it calculated?
    A hierarchical cluster analysis is a clustering method that creates a hierarchical tree of objects to be clustered (Dendrogram). The tree represents the relationships between the objects and shows how the objects are clustered at different levels.
    ► Load sample data set
    datatab.net/statistics-calcul...
    ► Online Calculator Hierarchical Cluster Analysis
    datatab.net/statistics-calcul...
    ► Hierarchical Cluster Analysis Tutorial
    datatab.net/tutorial/hierarch...
    ► E-BOOK
    datatab.net/statistics-book
    00:00 What is Hierarchical Cluster Analysis?
    00:31 Example of Hierarchical Cluster Analysis
    00:50 Calculate hierarchical cluster analysis
    06:32 Calculate hierarchical cluster analysis online

Komentáře • 55

  • @4chanFootballMemes
    @4chanFootballMemes Před 4 měsíci +26

    I loved learning about "Heyrakikal" clustering

  • @amobindubuisi2631
    @amobindubuisi2631 Před měsícem +3

    this is an extremely good material. top-notch. never seen something so easily explained as done on this content.

  • @odiakaolika5715
    @odiakaolika5715 Před 3 měsíci +7

    You just made my evening with your simple explanation.

    • @datatab
      @datatab  Před 3 měsíci

      Glad it was helpful and many thanks for your feedback! Regards Hannah

  • @ozgurogur1297
    @ozgurogur1297 Před rokem +2

    I found it very understandable and simple. thanks a lot!

  • @alexfrancois
    @alexfrancois Před rokem +1

    Beautifully explained, thanks! 🙏 Incredibly clear.

  • @Motivasi.Quotes
    @Motivasi.Quotes Před 12 dny

    such a very good vidio. Thank u so much for your explanation

  • @masteroftheworld001
    @masteroftheworld001 Před 10 měsíci

    well explained thank you so much

  • @shawnkim6287
    @shawnkim6287 Před 10 měsíci

    thank you so much. you clarified a lot!!!!
    😀

  • @osmancetinkaya8930
    @osmancetinkaya8930 Před rokem +10

    How might be the sqr of 17 (16+1) =equal to 3,162 ? it must be 4,123 is not?

    • @manuelruelas3496
      @manuelruelas3496 Před 10 měsíci +4

      The error is that the x distance is 3 (from 1 to 4) not 4, so it’s the sq root of 10.

  • @nakirambau7632
    @nakirambau7632 Před 8 měsíci +2

    thank you so much, you have explained it so well

    • @datatab
      @datatab  Před 8 měsíci +1

      Glad it was helpful!

  • @saurabhjoshi3010
    @saurabhjoshi3010 Před 7 měsíci

    nicely explained

  • @manuelleitner3196
    @manuelleitner3196 Před rokem +2

    Great video, thank you!!!

  • @rodidoesburg4061
    @rodidoesburg4061 Před 5 měsíci

    How do you name the clusters? Just from left to right, so cluster 1, cluster 2, cluster 3. Or are there more methods to name a cluster?

  • @ricardorivashernandez4023
    @ricardorivashernandez4023 Před 9 měsíci

    Real good!

  • @rileyharper7679
    @rileyharper7679 Před 7 měsíci +4

    The Euclidean distance horizontal component at 2:17 should be 3 not 4 since 4 - 1 = 3. Also, the manhattan distance should be 4 and the maximum distance should be 3 for the same reason.

    • @playbros332
      @playbros332 Před 6 měsíci +1

      I agree they are wrong, but shouldn't it be square root of 17, which is 4.12?

    • @fabianr9394
      @fabianr9394 Před 6 měsíci

      Because you go 3 steps to the right and 1 up; so sqrt(3^2 + 1^2)​@@playbros332

  • @matheusdelima1743
    @matheusdelima1743 Před rokem +1

    Great content. I'm a fan :)

    • @datatab
      @datatab  Před rokem +1

      Glad it was helpful and many thanks for your nice feedback! Regards Hannah

    • @iqraahmad130
      @iqraahmad130 Před rokem

      youre kinda cute

  • @ibethdiaztapia1033
    @ibethdiaztapia1033 Před 21 dnem

    hi. it should 3 - 1 for euclidean as the formula is square of XB1-XA1

  • @samuraixyz22
    @samuraixyz22 Před rokem +2

    I would like to CZcams tutorials like this. Do you have recommendations on what softwares to use?

  • @ibrahimabubakarzango9803
    @ibrahimabubakarzango9803 Před 3 měsíci +1

    Pls endeavour to avoid making mistakes thanks for comment section i could have got it so difficult to comprehend. That aspect of sqrt of 17 is terrible. But u did well and this video is good too

    • @datatab
      @datatab  Před 3 měsíci

      Hi thanks for youre feedback! We try to avoid mistakes, sorry for that and for the resulting trouble! Regards, Hannah

    • @Oladayo1
      @Oladayo1 Před 3 měsíci

      well, that's because it's the sqrt of 10 not sqrt of 17. The mistake was using 4 instead of 3

  • @luisamar8214
    @luisamar8214 Před 2 měsíci

    How you calculate the distances between Lisa, Joe with the others?? you have a group of positions not just one... how do you do that? thankss!

    • @datatab
      @datatab  Před 2 měsíci

      Hi, in this case you would first calcualte the center between Lisa and Joe and then the diestance from this center to one other Person. Regards Hannah

  • @nazhifmuh.kasyfan2148
    @nazhifmuh.kasyfan2148 Před 2 měsíci +1

    I would like to ask, is Hierarchical Cluster Analysis always associated with the Euclidean Distance? Thank you

    • @datatab
      @datatab  Před 2 měsíci

      Hi many thanks for your question, Hierarchical Cluster Analysis (HCA) is not always associated with the Euclidean distance. While Euclidean distance is commonly used, HCA can work with various distance metrics depending on the nature of the data and the analysis goals.
      Here are some common distance metrics used in HCA:
      - Euclidean Distance: This is the straight-line distance between two points in a multi-dimensional space. It's one of the simplest and most widely used distance metrics.
      - Manhattan Distance (also known as City Block or L1 distance): This is the sum of absolute differences between coordinates. It can be suitable when diagonal movement isn't meaningful.
      - Cosine Similarity: This measures the cosine of the angle between two vectors, commonly used in text analysis and other contexts where vector magnitude might vary.
      - Mahalanobis Distance: It accounts for correlations in data by incorporating the covariance matrix, making it suitable for data with different scales and correlations among variables.
      - Minkowski Distance: A generalization of Euclidean and Manhattan distances, with a parameter 'p' to control the degree of the norm.
      - Correlation-based Distance: This distance uses the correlation between data points rather than absolute differences. It's common in gene expression analysis or other contexts where relationships between variables matter more than absolute values.
      I hope this was helpful : ) Regards Hannah

  • @python4ncert202
    @python4ncert202 Před rokem

    Nice video!
    I want to know the name of algorithm that you have used here to explain hierarchical clustering.

  • @maxwellspyk494
    @maxwellspyk494 Před rokem +1

    hi where can i find the elbo method

    • @datatab
      @datatab  Před rokem

      Oh sorry, it will be there soon!!!

  • @s.h.a6472
    @s.h.a6472 Před 5 dny

    خدا خیرت بده بانو

  • @mahidahmed7
    @mahidahmed7 Před rokem +1

    klaaastarrrrss

  • @fredh3152
    @fredh3152 Před 2 měsíci +1

    i love your accent

  • @ahmad3823
    @ahmad3823 Před 2 měsíci +1

    4-1=3 though!

  • @user-vo4ew1gx
    @user-vo4ew1gx Před rokem +4

    Excellent explanation. Why it takes too long to create a new video?

    • @datatab
      @datatab  Před rokem +2

      Good question! : ) We need almost two weeks to prepare the topic and to create the slides! Regards Hannah

    • @user-vo4ew1gx
      @user-vo4ew1gx Před rokem

      @@datatab i hope it will be fast :)

  • @PaulKam1997
    @PaulKam1997 Před rokem +1

    is and not und at 3:15

  • @asrarbw
    @asrarbw Před 3 měsíci

    Claaaastars 😂

  • @abdulaziznazarov9661
    @abdulaziznazarov9661 Před 4 měsíci +1

    i think you have a mistakes with calculating