How to resolve Class Imbalance in R

Sdílet
Vložit
  • čas přidán 17. 09. 2020
  • Download the R script at:
    github.com/mariocastro73/ML20...

Komentáře • 17

  • @barbaraellensf2128
    @barbaraellensf2128 Před 3 lety +3

    I am searching for hours on how to do oversampling in R, your explanation was the only one that worked for me kkkkkk !!!!! Thank youuuuu very much , you saved my semester

  • @jrichalot
    @jrichalot Před rokem +1

    Thanks. Love how simple you made it look like. It's great to have base R explanation before using a package to do the same.

  • @lauraeli2286
    @lauraeli2286 Před rokem +1

    Thanks once again Mario - your videos are always concise!

  • @domillima
    @domillima Před rokem +1

    Amazing tutorial. Great job. You gained a subscriber! Thanks for the content and keep it up!

  • @gabrielmurarideandrade5755

    Thank you! Helped me in econometrics class. From Brasil!

  • @katerina_lina
    @katerina_lina Před rokem

    Yogur videos are extreamly useful!❤

  • @floh2209
    @floh2209 Před 3 lety +1

    Very helpful, thanks!

  • @praneshbiswas4335
    @praneshbiswas4335 Před 2 lety +1

    Thank you very much for the hands-on tutorial on binary imbalanced issue. Could you please also do a video tutorial on solving class imbalance issue on multiclass problem where there are 5 or 6 classes to identify and data is not balanced among them. Thanks again for the video.

  • @fellamechouar2255
    @fellamechouar2255 Před 2 lety

    thank you so much , you are the best of the quartier !

  • @poojamahesh8594
    @poojamahesh8594 Před 3 lety +1

    This was very helpful

  • @julietlozano7197
    @julietlozano7197 Před 2 lety +1

    Thank you very much, it has been very helpful, but I have a question, in your opinion which one is better and why?

    • @MarioCastroPonce
      @MarioCastroPonce  Před 2 lety +2

      It depends on the amount of data available. If you have thousands of observations and the imbalance is not as extreme as 20%/80%, do nothing. Otherwise, try everything to see the impact of imbalance. Sometimes the right answer is learning more stuff about your data. But there's not clear-cut answer here...

  • @transportation-talk
    @transportation-talk Před 3 lety +1

    Isn't sampling essentially changing the original dataset? If we predict the Survived class on these sampled data, would that mean anything for the original dataset?

    • @MarioCastroPonce
      @MarioCastroPonce  Před 3 lety +1

      It's not exactly changing the dataset but the proport of some observations of the dataset. It's not guaranteed that it should work but it does most of the time.

  • @popi20101
    @popi20101 Před 2 lety

    I have multiclass to resample and the result is not equal, what should i do?

  • @poojamahesh8594
    @poojamahesh8594 Před 3 lety

    But on doing this, there so many duplicated variables are being created...this is effecting me.. I'm getting NAs on running machine learning code..(on using lda model)

  • @poojamahesh8594
    @poojamahesh8594 Před 3 lety

    But on doing this, there so many duplicated variables are being created...this is effecting me.. I'm getting NAs on running machine learning code..(on using lda model)