Video není dostupné.
Omlouváme se.

The Right Way to Detect Outliers - Outlier Labeling Rule (part 1)

Sdílet
Vložit
  • čas přidán 18. 08. 2024
  • I demonstrate arguably the most valid way to detect outliers in data that roughly correspond to a normal distribution: the outlier labeling rule. I also point out that using 2.2 rather than the more common 1.5 is more appropriate as a multiplier.
    The formulae I use in the video are:
    Upper = Q3 + (2.2 * (Q3 - Q1))
    Lower = Q1 -- (2.2 * (Q3 - Q1))
    The references in video are:
    Tukey, J.W. (1977). Exploratory Data Analysis. Reading, MA: Addison-Wesley.
    Hoaglin, D.C., Iglewicz, B., and Tukey, J.W. (1986). Performance of some resistant rules for outlier labeling, Journal of American Statistical Association, 81, 991-999.
    Hoaglin, D. C., and Iglewicz, B. (1987), Fine tuning some resistant rules for outlier labeling, Journal of American Statistical Association, 82, 1147-1149.
    "outliers statistics" "statistical outlier"

Komentáře • 30

  • @ftothel794
    @ftothel794 Před 9 lety +27

    I don't think you fully comprehend HOW HELPFUL YOU AND YOUR VIDEOS ARE!!!!
    Thank you so much!!

  • @joannaelson6166
    @joannaelson6166 Před 2 lety

    I don't think you fully comprehend HOW HELPFUL YOU AND YOUR VIDEOS ARE!!!! (still true)

  • @imchillin2k7
    @imchillin2k7 Před 9 lety +2

    if you havent already brought out a book- please do!!!!
    this helped me way more than any book ive tried to decipher. thank you!!!

  • @parksongkun
    @parksongkun Před 11 lety +1

    Great ! Thank you for the nice solution for detecting outliers systematically !

  • @nikkivanluxemburg344
    @nikkivanluxemburg344 Před 8 lety +11

    What about non normal distribution?

    • @deadlypalms
      @deadlypalms Před 5 lety +1

      God I hope I find this video soon - my data is totally non parametric!

  • @salve_rex9256
    @salve_rex9256 Před 9 lety +4

    you mentioned in the video a method for non-normally distributed data. Can you give some clues?

  • @MitsosDA
    @MitsosDA Před 7 lety +1

    Do you have and can you test for outliers when dealing with categorical data? Thank you.

  • @caaanyoudigit
    @caaanyoudigit Před 3 lety

    Hmm...why are my box plots and histograms blue instead of the color in the video?

  • @lenka4497
    @lenka4497 Před 6 lety

    thank you so much...your videos are very helpful :)

  • @lauraguerrero876
    @lauraguerrero876 Před 9 lety

    Thanks. Very useful.

  • @vanessamadrazo5179
    @vanessamadrazo5179 Před 10 lety

    Hello,
    Thanks for creating this video! :) Is there a reference citation for the Outlier Labeling Rule?

    • @how2stats
      @how2stats  Před 10 lety

      The references are in the description of the video.

  • @dcincoltesccartofii
    @dcincoltesccartofii Před 11 lety

    if i have multiple scale scores from the same sample, what variable do i put in? the total raw score? the individual questions? do i put them all, or do i do this for every scale in particular?

  • @aminaalaqal
    @aminaalaqal Před 9 lety

    thanks a lot sir it was a helpful video

  • @MichelleSoekoe
    @MichelleSoekoe Před 11 lety

    What multiplyer does SPSS use to calculate outliers? I know the norm is either 1.5 or 2.2 but I need to calculate my outliers at 3 times the IQR to the left and right of the first and third quartiles. Is there a way to change this setting in SPSS so that I can get a Box and Whiskers plot that shows the outliers with the 3* instead of what ever SPSS uses as its standard multiplyer?

  • @ruth1351
    @ruth1351 Před 3 lety

    Hey thanks for the video - may I ask what if the score is minus?

  • @Schatsie525
    @Schatsie525 Před 8 lety

    Hey, every time I do a lineair regression, my model keeps giving me new casewise diagnotics. My KS-test is under 0.2. Can you please help me? We have already made dummies and of our independent we made a LN

  • @assoc.prof.dr.hamidmohsinj5254

    Thank you very much.
    Is there anyone know what is the name of this method (for example: Mahalanobis , The Variance-Covariance Matrix, Fast-MCD Algorithm, .... etc)

  • @datsme888
    @datsme888 Před 11 lety

    please tell which software package you used for creating your data (normal and non-normal)??

  • @esunder2003
    @esunder2003 Před 12 lety

    thanks for the info.

  • @tabi151214
    @tabi151214 Před 8 lety

    my data has academic achievement as dependent variable and intelligence as independent....i should calculate outliers w.r.t.achievement or intelligence??

    • @how2stats
      @how2stats  Před 8 lety

      +tabi151214 Both variables need to be examined for outliers.

  • @humtum7983
    @humtum7983 Před 7 lety

    can you please share with us sample data

  • @co20b
    @co20b Před 8 lety

    What if my data is not normally distributed?

    • @how2stats
      @how2stats  Před 8 lety

      +Onomato Poet Consider using bootstrapping for your test statistic

    • @co20b
      @co20b Před 8 lety

      +how2stats How does that work?

  • @Romeowasbleeding1
    @Romeowasbleeding1 Před 12 lety

    I really really appreciate these videos. If you would like some constructive criticism, you tend to smack your lips a lot, it might be a good idea to pay attention to that while recording. Thanks once again!

  • @how2stats
    @how2stats  Před 11 lety

    What ever your doing your statistical analysis upon. If you're only going to analyse the total score, then you only need to look for outliers there. But if you plan on doing analyses at the item level, then you should look for outliers there too. I doubt you'll find outliers at the item level.

  • @speedshift2971
    @speedshift2971 Před rokem

    who came up with the outlier formula? maybe we've been throwing out important data this whole time!