Video není dostupné.
Omlouváme se.

Preprocessing Data in R for ML with "caret" (2021)

Sdílet
Vložit
  • čas přidán 19. 08. 2024
  • Subscribe to RichardOnData here: / @richardondata
    Patreon: / richardondata
    GitHub: github.com/Ric...
    In this video I provide a beginning to a multi-part tutorial series on machine learning in R using the "caret" package. We will begin with pre-processing of a dataset to get it into a format appropriate for the machine learning pipeline, as well as identifying zero or near zero variance predictors. The beauty of this package is that it is truly a one stop shop for all of your machine learning needs.
    There are a few sources from which this tutorial draws influence and structure. The first is the GitHub documentation on "caret" from its creation, Max Kuhn. The second is a very well-written and comprehensive tutorial by author Selva Prabhakaran on Machine Learning Plus. Third is a helpful resource for dealing with class imbalance, as we often find with classification problems.
    - GitHub documentation from Max Kuhn: topepo.github....
    - Tutorial by Selva Prabhakaran: www.machinelea...
    - Tutorial on "caret" with class imbalances: shiring.github...

Komentáře • 20

  • @wivineblekic4928
    @wivineblekic4928 Před 2 lety +1

    Thank you ! It's hard to find clear information about preprocessing and you were a life savior for me and my research !

  • @DataProfessor
    @DataProfessor Před 3 lety +2

    Great video on Caret, Happy New Year Richard!

    • @RichardOnData
      @RichardOnData  Před 3 lety

      Thank you Chanin! Happy new year to you and your family as well!

  • @faustin289
    @faustin289 Před 3 lety +2

    This is a very good and informative series.
    Keep teaching us.

    • @RichardOnData
      @RichardOnData  Před 3 lety

      Thank you, I will! "tidymodels" series is coming too.

  • @fullsurr3465
    @fullsurr3465 Před 3 lety +1

    Richard, thanks for your R-enthusiasm! On my particular learning step your tutorials really help! I had been implementing all ISLR exercises using book methods + the same using caret since the middle of the book. Caret seems user-friendly. I hope this feeling is not deceptive and it will help with "Applied Predictive Modeling"
    Looking forward to furthering the caret series!

    • @RichardOnData
      @RichardOnData  Před 3 lety +1

      Yup, I'm a huge fan of the book Applied Predictive Modeling and I think it's just about the best R machine learning book out there... glad that this tutorial (and hopefully the future ones) could be a helpful supplement to that book!

  • @picassoofai4061
    @picassoofai4061 Před 2 lety

    Man Very Very Good Content, You must love what you are doing.

  • @empowercode
    @empowercode Před 3 lety +1

    Hey! I just found your channel and subscribed, love what you're doing!
    I like how clear and detailed your explanations are as well as the depth of knowledge you have surrounding the topic! Since I run a tech education channel as well, I love to see fellow Content Creators sharing, educating, and inspiring a large global audience. I wish you the best of luck on your CZcams Journey, can't wait to see you succeed! Your content really stands out and you've put so much thought into your videos!
    Cheers, happy holidays, and keep up the great work :)

    • @RichardOnData
      @RichardOnData  Před 3 lety

      Thank you so much! I'm glad to have you onboard and that you find my explanations clear! In an ideal world I'd like to have as many as 100 videos up this year; we'll see what happens!

    • @empowercode
      @empowercode Před 3 lety

      @@RichardOnData Yup, glad to be able to follow your journey!

  • @hectormotsepe1581
    @hectormotsepe1581 Před 3 lety +2

    Thanks Richard

  • @DrgreenSlime
    @DrgreenSlime Před 3 lety +3

    Are you planning on showcasing purrr sometime in the future? Love your videos btw!

    • @RichardOnData
      @RichardOnData  Před 3 lety +2

      YES! That will be coming probably some time in March or so. "furrr" too.

  • @setevezesdoismenos1
    @setevezesdoismenos1 Před rokem

    Great channel, great videos! Thank you!

  • @rajeshshigdel1472
    @rajeshshigdel1472 Před 2 lety

    Simply awesome

  • @DW2WD
    @DW2WD Před 3 lety +1

    good

  • @bosscs
    @bosscs Před 3 lety

    hey Richard, do u do tutoring? I just have my thesis and I need to validate my dataset, I need some help in some details that need an expert to help me decide. input and output tables are ready to be used.