Hierarchical Cluster Analysis in SPSS (SPSS Tutorial Video #29) - Dendrogram

Sdílet
Vložit
  • čas přidán 9. 07. 2024
  • In this video I describe how to conduct and interpret the results of a Hierarchical Cluster Analysis in SPSS. I especially emphasize using Ward's method to create more evenly sized clusters, and spend a lot of time on how to interpret a dendrogram with the goal being to identify a reasonable number of clusters to include in a final clustering solution.
    This SPSS tutorial series is designed to teach you the basics of how to analyze and interpret the results of data using SPSS. I will cover everything from the very basics of the main windows within SPSS, to manipulating data, to running and interpreting meaningful analyses like t-tests, ANOVA, regression, and many more, and visualizing results.
    Link to Hierarchical Cluster Analysis Video: • Hierarchical Cluster A...
    Link to K-Mean Cluster Analysis Video: • K-Means Cluster Analys...
    Link to Two Step Cluster Analysis Video: • Two Step Cluster Analy...
    The data file used in this video can be found here: drive.google.com/file/d/1-Bbn...
    Video tutorial and walkthrough of the data file used in this video: • Introduction to Data F...
    Playlist of video covering INTUITION for statistics and data science: • Data Intuition
    All the SPSS tutorial videos are in this playlist: • SPSS Tutorials
    Learn more about who I am and why I'm doing this here: • Data Demystified - Who...
    Follow me at:
    LinkedIn: / jeff-galak-768a193a
    Patreon: / datademystified
    Website: www.jeffgalak.com/datademystified
    Equipment Used for Filming:
    Nikon D7100: amzn.to/320N1FZ
    Softlight: amzn.to/2ZaXz3o
    Yeti Microphone: amzn.to/2ZTXznB
    iPad for Teleprompter: amzn.to/2ZSUkNh
    Camtasia for Video Editing: amzn.to/2ZRPeAV

Komentáře • 40

  • @danasfar
    @danasfar Před 2 lety +2

    This video is SO clear! As short as possible with everything I need to know. THANKS a lot!

  • @herbhupfer1295
    @herbhupfer1295 Před 11 měsíci

    Really good job of explaining a lot of options. You spoke VERY clearly which is so important in understanding the nuances of these procedures.

  • @radosawpuakowski6982
    @radosawpuakowski6982 Před 2 lety +3

    Excellent tutorial. In two first minutes you have answered my questions and solved my problem. My compliments on your didactic skills.

  • @pietroserrano8468
    @pietroserrano8468 Před 2 lety

    Thank u sir, Im from Portugal and your videos helped me to a Data Analysis test. You're such a good teacher!!!

  • @dsavkay
    @dsavkay Před 3 měsíci

    Thank you for the explanation!! Amazing

  • @4Basileus
    @4Basileus Před 5 měsíci

    Nice video! Thanks a lot!

  • @akibreza450
    @akibreza450 Před 2 lety +1

    Thank you this was really helpful.

  • @mahamanoumar1802
    @mahamanoumar1802 Před 3 lety +2

    thank you Sir

  • @FriendlyOne_
    @FriendlyOne_ Před 3 měsíci

    Thank you so much

  • @white1145
    @white1145 Před rokem

    tysm!!!

  • @kristingibson3883
    @kristingibson3883 Před rokem

    This is such a helpful video! I am curious as the why you used individual items instead of creating a scale from the items for the cluster analysis? Thanks in advance for your help!

  • @Charlatan_11235
    @Charlatan_11235 Před rokem

    Great job, thanks so much for these videos. I have question tho. Do you have some author/s so I can justify how I pick my clusters? Thanks so much in advance!

  • @horanghae4649
    @horanghae4649 Před 2 lety

    Hello, sir! I am currently working on my thesis and this tutorial helps a lot! May I ask, how can I identify the cluster with high similarities of data by inspecting the dendogram? Is it possible or should I apply some calculations to address this? Thank you so much!

  • @tinageorge9065
    @tinageorge9065 Před rokem

    This is a wonderful video! thank you. During discussion with a statistician they asked me what the cluster analysis fit was? How do i defend (with numbers or a stat) the number of clusters i chose?

  • @sigurd805
    @sigurd805 Před 3 lety +3

    Hello Jeff! Thanks for a great video. I'm currently trying to interpret the results of a dendrogram. I'm using a data set provided by my professor, who informed us there should be 7 clusters for our K-means clustering. However, I wanted to check if that would match a dendogram following your instructions, but it doesn't seem to add up. Despite the lack of information do you have an idea about what might cause the difference?

    • @DataDemystified
      @DataDemystified  Před 3 lety +1

      Hi, thanks! It's really hard to know without seeing the data/dendrogram, but a first guess would be the method you're using. I suggest using Ward's Method, but it's possible your professor prefers something else for perfectly good reasons. That would be the first place I would look. Good luck!

  • @marie-eveclement280
    @marie-eveclement280 Před 3 lety +1

    Thank you for the videos! Very helpfull! Can I use binary variable to enter into cluster analysis?

    • @DataDemystified
      @DataDemystified  Před 3 lety

      Glad you're finding them useful! You can, but not in a hierarchical analysis. For that, you'll need Two-Step Cluster Analysis (see my video on that here: czcams.com/video/rh0Hv0SUIZk/video.html)

  • @heynadia
    @heynadia Před rokem

    should we include the dependent variable in the analysis?

  • @missioneconomics7490
    @missioneconomics7490 Před 3 měsíci

    Can cluster analysis be applied on secondary data

  • @the_tons
    @the_tons Před 6 měsíci

    Is thee ward method available in the standard spss package?
    What version of spss was used in this video? TY❤

  • @warisk7980
    @warisk7980 Před rokem

    thx

  • @susanachieng5419
    @susanachieng5419 Před 2 lety

    Hello sir, can you do the same using K-means?

  • @mehdimajidi8755
    @mehdimajidi8755 Před 5 měsíci

    How to remove the gray background from the dendrogram? I want to use the dendrogram for my article... Is there a way to export like only the cluster lines?

  • @farshadjahangiri5492
    @farshadjahangiri5492 Před 2 lety

    Hi there if I want to cluster 30 station with 7 paramateres and time series for 9 years what can I do ? can you help me . this will bi 3 dimensional matrix

  • @muhammadqasim-bm7oj
    @muhammadqasim-bm7oj Před 2 lety

    Dear Researcher, kindly guide, how can i cluster the questionnaire line items of the large data set. like more than1000 observation?
    I have 79 final line items of questionnaire now i want to cluster the line items into distinct latent variable. kindly guide me how can i cluster the line items. thanks in anticipation.

  • @ehteshamali2893
    @ehteshamali2893 Před 2 lety

    Hello sir. I watched the video. I still do not understand how did you deduce that we have 2 clusters from the dendrogram.

  • @jeremysayer8680
    @jeremysayer8680 Před 2 lety

    Thank you, your video is very informative, and definitely deserves more views. I was wondering about the clustering of variables together - not on a case-by-case basis, but on a variable-by-variable basis. I assume that, in SPSS, this would give you an indication of the overall clustering of similar variables together, and how similar the data points in one variable are, relative to the others. Is that right? So in your example, it would show groups of similar dimensions of "importance", and their hierarchical linkage to the other dimensions of "importance"?

    • @DataDemystified
      @DataDemystified  Před 2 lety +1

      I think you're describing a factor analysis. See my video on that here: czcams.com/video/8uVEEGeg45s/video.html

    • @jeremysayer8680
      @jeremysayer8680 Před 2 lety

      @@DataDemystified Thank you, I’ve also watched your excellent video on factor analysis; and you’re right, that is very good for looking at the correlations between variables, specifically. But in terms of the actual hierarchical cluster analysis, I was wondering what outcome is generated in SPSS when you are performing a hierarchical cluster analysis and under the cluster options you select cluster>variables, as opposed to cluster>cases. Most people talk about clustering the cases, but I was wondering what the outcome is if you cluster the variables, when performing a hierarchical cluster analysis.

    • @DataDemystified
      @DataDemystified  Před 2 lety

      @@jeremysayer8680 Got it! Full disclosure: I've never used the cluster->variables approach in hierarchical cluster analysis. I always use PCA/Factor analysis to group variables. However, looking at it a bit more closely, it appears that hierarchical cluster (variables) applies the same basic approach to grouping variables that it does to grouping values. For instance, the dendorgram shows groupings of variables in a hierarchical manner (sub and superordinate groups). So, it seems like a nice way to eyeball groups of variables...however I have not seen this approach used anywhere in the social science, so I suspect it isn't particularly robust.

    • @jeremysayer8680
      @jeremysayer8680 Před 2 lety

      ​@@DataDemystified Thanks, that makes sense. As you allude to, I've also seen quite different clusters of (latent) variables being spotted between the two techniques, but it’s valuable as a support tool. Thanks for all the time you’ve taken to respond!

  • @thesky7418
    @thesky7418 Před 2 lety

    Hello sir, Thank you very much for your great lesson. I would like to ask, I have a data list for fungal species and their host plants. Can I do a single cluster analyses for both for understanding the relationship between fungi species vs plant species ?
    Thank you.

    • @DataDemystified
      @DataDemystified  Před 2 lety

      Happy to help. I don't have a good answer for you as I don't really know anything about fungi and plants. I would need a much better sense of the data and what you're trying to accomplish. Good luck!

  • @asdadadadsasda
    @asdadadadsasda Před 2 lety

    Dear Prof., thanks for all the fantastic videos. I would like to perform an cluster analysis with binary data (0 or 1). What algorithm would you recommend for a hierarchical analysis. And you make a video how to perform the silhouette method in SPSS. Thank you very much.

    • @DataDemystified
      @DataDemystified  Před 2 lety

      Two step is the way to go.

    • @asdadadadsasda
      @asdadadadsasda Před 2 lety

      @@DataDemystified Could you explain it a little bit more or share a link etc.

  • @dagnawzeleke8638
    @dagnawzeleke8638 Před 2 lety

    could you do the data i give you