Video není dostupné.
Omlouváme se.

Shapiro-Wilk test

Sdílet
Vložit
  • čas přidán 18. 08. 2024
  • The Shapiro-Wilk test to test for deviations from normality. Also includes an introduction to Q-Q plots, and how they can be used to graphically assess normality.

Komentáře • 16

  • @logantyner2214
    @logantyner2214 Před 3 lety +6

    The way you broke this down makes it incredibly easy to understand. thank you!

  • @langleymcentyre2754
    @langleymcentyre2754 Před 5 měsíci

    Thank you for making this video it really clarified the concepts for me

  • @krahoc
    @krahoc Před 3 lety +6

    Remember Shapiro-Wilk's test should not be used for populations greater than 50, so basically it's quite useful for small populations even if it's a bit sensitive.
    Anyway, great video, maybe you should develop a bit the p-value part explaining how you could interpret it as a confidence indicator about H0.

    • @user-hu1sn5or7j
      @user-hu1sn5or7j Před 3 lety +1

      Why such a limitation? Which test is best for a population greater than 50?

    • @krahoc
      @krahoc Před 3 lety +3

      @@user-hu1sn5or7j the limitation is inheritant to the test method. In their paper, Shapiro and Wilk use a table to compare the W value to a tabulated critical W, but the table goes only from 3 to 50.
      There is a way to "extend" the test by tabulating yourself the critical values with Monte-Carlo simulation, but that's beyond the Shapiro-Wilk's paper. ;)
      I don't really know what's behind the scenes of the R calculations but I think that's what is done here.

    • @diazjubairy1729
      @diazjubairy1729 Před 3 lety

      @@krahoc but i've read recent paper talking about shapiro-wilk test is good enough up until 1000 datas

    • @krahoc
      @krahoc Před 3 lety +1

      @@diazjubairy1729 if you compute the critical W values by Monte-Carlo simulation, you may use it with any amount of values but the more data you test the less discriminant it will be, just like every normality test. That's the curse of big samples :')

    • @waytolegacy
      @waytolegacy Před 4 dny

      Just wondering, if it is not good for more than 50 samples, then why shouldn't we just simply use the skewness to asses the normality. I mean If -0.5 < skewness < 0.5 then Yes else No.

  • @keerthanavivin450
    @keerthanavivin450 Před 2 lety

    Great video! Understood the concept perfectly. A big thanks.

  • @aliakbarsafdari4084
    @aliakbarsafdari4084 Před 3 lety

    This was an excellent video, thanks.
    So, my dataset has columns with very varying counts of values (from 8 to >250).
    Can I use this test as a rough estimation of normality for each column? Or should I prepare Q-Q plots for each?
    Also, if I wish to seek the statistical significance between these columns (they measure the same parameter), what test would you recommend?
    Thanks again.

  • @mobileentertainment212

    could we get more explanation on the numerator? on the actual derivation of the variable a

  • @tomp4925
    @tomp4925 Před rokem

    Is there a formula to calculate "a"?

  • @HuzMusFinanceData
    @HuzMusFinanceData Před 3 lety

    Thank you for this Video.. In simple words, explain if the P-value is < 0.05 so Data is normal or Abnormal?

    • @swastikpatro6436
      @swastikpatro6436 Před 2 lety +2

      Normal

    • @HuzMusFinanceData
      @HuzMusFinanceData Před 2 lety

      @@swastikpatro6436 Thank you. ❤

    • @vikramjoshi3452
      @vikramjoshi3452 Před 7 měsíci

      If p < 0.05, the data are not normally distributed. Remember, the null hypothesis for Shapiro.test() is that the data are normally distributed.