Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science Training | Edureka

Sdílet
Vložit
  • čas přidán 19. 05. 2024
  • ( Data Science Training - www.edureka.co/data-science-r... )
    This Edureka Random Forest tutorial will help you understand all the basics of Random Forest machine learning algorithm. This tutorial is ideal for both beginners as well as professionals who want to learn or brush up their Data Science concepts, learn random forest analysis along with examples. Below are the topics covered in this tutorial:
    1) Introduction to Classification
    2) Why Random Forest?
    3) What is Random Forest?
    4) Random Forest Use Cases
    5) How Random Forest Works?
    6) Demo in R: Diabetes Prevention Use Case
    Subscribe to our channel to get video updates. Hit the subscribe button above.
    Check our complete Data Science playlist here: goo.gl/60NJJS
    #RandomForest #Datasciencetutorial #Datasciencecourse #datascience
    How it Works?
    1. There will be 30 hours of instructor-led interactive online classes, 40 hours of assignments and 20 hours of project
    2. We have a 24x7 One-on-One LIVE Technical Support to help you with any problems you might face or any clarifications you may require during the course.
    3. You will get Lifetime Access to the recordings in the LMS.
    4. At the end of the training you will have to complete the project based on which we will provide you a Verifiable Certificate!
    - - - - - - - - - - - - - -
    About the Course
    Edureka's Data Science course will cover the whole data life cycle ranging from Data Acquisition and Data Storage using R-Hadoop concepts, Applying modelling through R programming using Machine learning algorithms and illustrate impeccable Data Visualization by leveraging on 'R' capabilities.
    - - - - - - - - - - - - - -
    Why Learn Data Science?
    Data Science training certifies you with ‘in demand’ Big Data Technologies to help you grab the top paying Data Science job title with Big Data skills and expertise in R programming, Machine Learning and Hadoop framework.
    After the completion of the Data Science course, you should be able to:
    1. Gain insight into the 'Roles' played by a Data Scientist
    2. Analyse Big Data using R, Hadoop and Machine Learning
    3. Understand the Data Analysis Life Cycle
    4. Work with different data formats like XML, CSV and SAS, SPSS, etc.
    5. Learn tools and techniques for data transformation
    6. Understand Data Mining techniques and their implementation
    7. Analyse data using machine learning algorithms in R
    8. Work with Hadoop Mappers and Reducers to analyze data
    9. Implement various Machine Learning Algorithms in Apache Mahout
    10. Gain insight into data visualization and optimization techniques
    11. Explore the parallel processing feature in R
    - - - - - - - - - - - - - -
    Who should go for this course?
    The course is designed for all those who want to learn machine learning techniques with implementation in R language, and wish to apply these techniques on Big Data. The following professionals can go for this course:
    1. Developers aspiring to be a 'Data Scientist'
    2. Analytics Managers who are leading a team of analysts
    3. SAS/SPSS Professionals looking to gain understanding in Big Data Analytics
    4. Business Analysts who want to understand Machine Learning (ML) Techniques
    5. Information Architects who want to gain expertise in Predictive Analytics
    6. 'R' professionals who want to captivate and analyze Big Data
    7. Hadoop Professionals who want to learn R and ML techniques
    8. Analysts wanting to understand Data Science methodologies
    For more information, Please write back to us at sales@edureka.co or call us at IND: 9606058406 / US: 18338555775 (toll free).
    Instagram: / edureka_learning
    Facebook: / edurekain
    Twitter: / edurekain
    LinkedIn: / edureka
    Customer Reviews:
    Gnana Sekhar Vangara, Technology Lead at WellsFargo.com, says, "Edureka Data science course provided me a very good mixture of theoretical and practical training. The training course helped me in all areas that I was previously unclear about, especially concepts like Machine learning and Mahout. The training was very informative and practical. LMS pre recorded sessions and assignmemts were very good as there is a lot of information in them that will help me in my job. The trainer was able to explain difficult to understand subjects in simple terms. Edureka is my teaching GURU now...Thanks EDUREKA and all the best. "

Komentáře • 98

  • @edurekaIN
    @edurekaIN  Před 5 lety +2

    Got a question on the topic? Please share it in the comment section below and our experts will answer it for you. For Data Science Training Certification Curriculum, Visit our Website: bit.ly/37q65Oc

  • @NishantKumar-ir2cn
    @NishantKumar-ir2cn Před 7 lety +1

    thanks shivani for such a pretty explaination of random forest...

  • @francinagoh2541
    @francinagoh2541 Před 3 lety

    I enjoy this youtube very much. The explaination is very clear and easy to understand. Thank you!

  • @el7ke
    @el7ke Před 6 lety +1

    This was very helpful, thank you!

  • @Arunkumar-gx2je
    @Arunkumar-gx2je Před 4 lety +1

    Thank you so much mam.. i really enjoying and it is clear picture of random forest and decision tree.. I really thankful to u. Keep posting your videos mam..

  • @sonalikapanda8398
    @sonalikapanda8398 Před 6 lety +1

    Thanks for the wonderful video.Its really helpful.

  • @mayurgobade9519
    @mayurgobade9519 Před 6 lety +1

    nice teaching
    i recommended to everyone please watch this video
    random forest best
    it help in interview to explain everything about random forest

    • @edurekaIN
      @edurekaIN  Před 6 lety

      Hey Mayur, thank you for watching our video. We are delighted to know that you found it useful. Do subscribe to us and stay connected with us. Cheers :)

  • @babarabbasi8688
    @babarabbasi8688 Před 5 lety

    A very informative and concise tutorial indeed. I didn't know about RF before watching this video but now, I have a clear idea how to apply it on my data set. Thanks

    • @edurekaIN
      @edurekaIN  Před 5 lety

      Hey Babar, we are glad you loved the video. Do subscribe and hit the bell icon to never miss an update from us in the future. Cheers!

  • @mayurgobade9519
    @mayurgobade9519 Před 6 lety +2

    out of all lectures this session is very good
    i thanks to lady.she explain each and everything in detail
    she run the code step by step.
    thanks once agian to edureka to provide such kind of knowledge

    • @edurekaIN
      @edurekaIN  Před 6 lety

      Thank you for watching our video. Do subscribe, like and share to stay connected with us. Cheers :)

  • @VijayKumar-to3hy
    @VijayKumar-to3hy Před 6 lety +24

    Hi Mariana...hope you are clear..!! ;)

  • @pradnyaasolkar9116
    @pradnyaasolkar9116 Před 5 lety

    Very well explained. Cannot find any better video explaining random forest so easily and in detail than this one! Thank you Shivani and Eudeka.. Happy learning!

    • @edurekaIN
      @edurekaIN  Před 5 lety

      Hey Pradnya, we are glad you feel this way. Do subscribe and hit the bell icon to never miss an update from us in the future. Cheers!

  • @brahmaiahchowdarym1913
    @brahmaiahchowdarym1913 Před 5 lety +1

    Nice Explanation... Thanks edureka!

  • @yadavthapa2838
    @yadavthapa2838 Před 6 lety +1

    Very nice and easily explained...

  • @nikller
    @nikller Před 6 lety +1

    Very useful, thanks!

  • @motherofmicrobes
    @motherofmicrobes Před 6 měsíci

    very clear explanation, thank you 😊

  • @rajbir_singh0517
    @rajbir_singh0517 Před 5 lety

    Great video great explanation. Some code are not matching with video and R studio but overall it is great insight

  • @jolaoduwole4523
    @jolaoduwole4523 Před 6 lety +3

    Yeah! This tutorial is super useful, helpful and interesting to me. Keep it up

    • @edurekaIN
      @edurekaIN  Před 6 lety

      Thank you for watching our video. Do subscribe, like and share to stay connected with us. Cheers :)

  • @geospatialdatascientist

    thank you so much. the instruction is quite clear and really helpful to me.

    • @edurekaIN
      @edurekaIN  Před 5 lety

      Hey Thuy Doan, we are glad you loved the video. Do subscribe to the channel and hit the bell icon to never miss an update from us in the future. Cheers!

  • @vivekdaga1880
    @vivekdaga1880 Před 5 lety +1

    Very well explained.

  • @AmarjeetKumar-to9ub
    @AmarjeetKumar-to9ub Před rokem +2

    Thank You Ma'am 😊

  • @bharatsharma2907
    @bharatsharma2907 Před 6 lety +3

    Best explanation Available on CZcams abt Random Forest !!

    • @edurekaIN
      @edurekaIN  Před 6 lety +1

      Hey Bharat, thank you for appreciating our work. Do subscribe to our channel and stay connected with us. Cheers :)

  • @ahamedimad4554
    @ahamedimad4554 Před 5 lety

    I've checked almost 19 video tutorials on this topic truly I didn't see anything like this..this is a Cristal explanation. Thanks a ton edureka and thanks Shivani.
    Could you please share the codes.

    • @edurekaIN
      @edurekaIN  Před 5 lety

      Hey Ahamed, we are glad our video made you feel this way. Do subscribe and hit the bell icon to never miss an update from us in the future. Please mention your email ID over here and we will send the files to you. Cheers!

  • @PawanSingh-iu5mi
    @PawanSingh-iu5mi Před 6 lety +1

    very good explanation to Random Forest algorithms and its implementation example.

    • @edurekaIN
      @edurekaIN  Před 6 lety

      Hey Pawan! We are happy to see you browse through our channel and watch the videos. Look through the videos and tell us how you liked it. Thanks :)

  • @mahimsd7645
    @mahimsd7645 Před 6 lety +1

    gr8 work Shivani .......
    I'm highly impressed with ur explanations , clarity and way of explaining........ gr8.... gr8.... gr8

    • @edurekaIN
      @edurekaIN  Před 6 lety

      Hey Mak, it's great to see avid learners like you on our channel watching multiple videos. Do browse through other videos on our channel and let us know how you liked it. Any suggestions are welcomed :)

  • @georgiamajdalani7680
    @georgiamajdalani7680 Před rokem +1

    Very helpful. Thank you.

    • @edurekaIN
      @edurekaIN  Před rokem

      You're welcome 😊 Glad it was helpful!!

  • @sokcintye732
    @sokcintye732 Před 5 lety

    Hi, thank you for a clear presentation about random forest. I am wondering of you have ther videos about random forest for time to event model?Thanks.

    • @edurekaIN
      @edurekaIN  Před 5 lety

      Hi Tye, thanks for the compliment! We don't have that specific video, however you can check out this content on Random forest: www.edureka.co/blog/random-forest-classifier/

  • @f5057
    @f5057 Před 6 lety +1

    Very good explanation but more further formulas and technical and scientific explanation is needed. Thank you

    • @edurekaIN
      @edurekaIN  Před 6 lety

      You can check out our Data Science course if you are truly looking to master the technology. Hope this helps :)

    • @f5057
      @f5057 Před 6 lety

      edureka! Could you provide me the link of the course please, I appreciate it thank you very much

  • @constantineallen5267
    @constantineallen5267 Před 3 lety +1

    Cool demonstration, congrats!

  • @ayyasamy8730
    @ayyasamy8730 Před 5 lety

    Nice explanation !!

  • @ndagijimanafrankaimeerodri8893

    Thank you very much for such a detailed lecture!

    • @edurekaIN
      @edurekaIN  Před rokem

      You're Welcome 😊 Glad it was helpful!! Keep learning with us..

  • @improve2101
    @improve2101 Před 6 lety +1

    best explaination.....tnx

    • @edurekaIN
      @edurekaIN  Před 6 lety

      Thank you for watching our video. Do subscribe, like and share to stay connected with us. Cheers :)

  • @niranjans4248
    @niranjans4248 Před 5 lety

    What can you do to improve the model accuracy for random forest and was the number of variables selected for each tree built in this forest 3 as calculated or only 2?

    • @edurekaIN
      @edurekaIN  Před 5 lety

      Hey Niranjan, It is always a better idea to apply ensemble methods to improve the accuracy of your model. There are two good reasons for this: a ) They are generally more complex than traditional methods. b) The traditional methods give you a good base level from which you can improve and draw from to create your ensembles. Hope this helps!

  • @PRIYANSHU_NEGI_108
    @PRIYANSHU_NEGI_108 Před 5 lety

    Please make a video on spam detection in twitter using Random Forest .

  • @trandangan7177
    @trandangan7177 Před 6 lety +1

    It is a great tutorial! Do you have data sources and codes then we can practice easily?

    • @edurekaIN
      @edurekaIN  Před 6 lety

      Hey Tran, yes we do. Mention your email address and we will send it over. Cheers :)

  • @AdityaGupta-fz8mr
    @AdityaGupta-fz8mr Před 5 lety +1

    while explaining how random forest works you told that we split the features but in the example that you gave split on training set, so which is correct ?

    • @edurekaIN
      @edurekaIN  Před 5 lety

      Hey Aditya, you have to use split on the training set.

  • @svanishree2614
    @svanishree2614 Před 5 lety

    Its really useful...Tnku

    • @edurekaIN
      @edurekaIN  Před 5 lety

      Hey Vanishree, we are glad you loved the video. Do subscribe and hit the bell icon to never miss an update from us in the future. Cheers!

  • @sayantanmukherji4568
    @sayantanmukherji4568 Před 6 lety +1

    Great explanation.
    This is what quality teaching is.
    Very much cleared with the concepts now.
    Just similar with the diabetes data do you guys have a heart attack/disease patients data.
    If yes then can I be provided with that?

    • @edurekaIN
      @edurekaIN  Před 6 lety

      Hey Sayantan, thanks for the wonderful feedback! We're glad we could be of help.
      Please share your email address and we will send it. Cheers!

    • @sayantanmukherji4568
      @sayantanmukherji4568 Před 6 lety

      mukherjee.sayantan96@gmail.com
      Thank you in advance

    • @edurekaIN
      @edurekaIN  Před 6 lety

      We have shared it with you, Sayantan. Do subscribe to our channel to stay posted on upcoming videos. You can also check out our complete training here: www.edureka.co/data-science. Hope this helps. Cheers!

  • @AbhishekVigg
    @AbhishekVigg Před 5 lety

    How were the subsets divided in the first step of the Random Forest Algorithm? Is there a parameter that was used to decide on these subsets?

    • @edurekaIN
      @edurekaIN  Před 5 lety

      Hey Abhishek, Each tree gets the full set of features, but at each node, only a random subset of features is considered.
      Hope this helps!

  • @rahulm2028
    @rahulm2028 Před 5 lety

    hi can u pls send one health insurance claims dataset to find the fraud claims using random forest

  • @theforester_
    @theforester_ Před 2 lety +1

    nice.

  • @narenenagares7459
    @narenenagares7459 Před 6 lety

    How does Random Forest work if there are more than two classes or multi-classification, let's say 3 outcomes?

    • @edurekaIN
      @edurekaIN  Před 5 lety

      Hey Narene, "A good multi-class classification machine learning algorithm involves the following steps:
      Importing libraries
      Fetching the dataset
      Creating the dependent variable class
      Extracting features and output
      Train-Test dataset splitting (may also include validation dataset)
      Feature scaling
      Training the model
      Calculating the model score using the metric deemed fit based on the problem
      Saving the model for future use"
      Hope this helps!

  • @Tony-rb3pd
    @Tony-rb3pd Před 2 lety

    I know this video is a bit old, but how can i obtain the dataset? i would like to follow the example but i cant without it. Do you have it in an external site?

    • @edurekaIN
      @edurekaIN  Před 2 lety

      Good to know our contents and videos are helping you learn better . We are glad to have you with us ! Please share your mail id to send the data sheets to help you learn better :) Do subscribe the channel for more updates : ) Hit the bell icon to never miss an update from our channel : )

  • @jay-rathod-01
    @jay-rathod-01 Před 5 lety

    Cool vid

  • @diverse4985
    @diverse4985 Před 5 lety

    could any one explain about subset? so if we have 500 tree, the number of subset will be 500, right?

    • @edurekaIN
      @edurekaIN  Před 5 lety +1

      Hey Bison, Each tree gets the full set of features, but at each node, only a random subset of features is considered. Hope this helps!

  • @noobshady
    @noobshady Před 7 lety

    how can we use tuneRF to optimize the model?

    • @edurekaIN
      @edurekaIN  Před 7 lety +1

      +qυαятєямαɨиє, thanks for checking out our tutorial!
      Below is a summary of how tuneRF works:
      a. Set mtry to the default value of sqrt(p) for classification, and p/3 for regression (where p = total number of variables)
      b. Compute the out-of-bag (OOB) error (say error_default) for a Random Forest with mtry set to the default value found above
      a. Look to the left: set mtry = default value/stepFactor. For instance, if stepFactor=1.5 and your default starting value is 8, mtry would be set to be 8/1.5=5.33, rounded up to the be an integer, which gives 6
      b. Compute the OOB error, say error_left
      a. Look to the right: set mtry = default value*stepFactor. To continue with my example, mtry would be set to be 8*1.5=12
      b. Compute the OOB error, say error_right
      i. If (error_default < error_right) OR (error_default < error_left), the best mtry is the default value
      ii. If the previous condition is not met, but the delta between errors_default and error_right/error_left is less than the improve parameter, the best mtry is the default value
      iii. Without any loss of generality, if the condition is not met, and if error_right < error_left, and if (error_default-error_right) > improve, set mtry to be mtry_right (12). From now on, always go to the right
      If 4.iii. is verified, iterate: set mtry to be mtry_right*stepFactor (in my example, 12*1.5=18), compute the OOB error and compare it with the error obtained at the previous step (in my example, for mtry=12). If the error new error is smaller, and if the gain in error reduction is enough (i.e, >improve), select the new mtry and continue to repeat these steps, otherwise stop and return the current mtry as the best mtry
      The smaller stepFactor you set (e.g., 1.1, 1.2), the more values of mtry you try (fine search), the bigger stepFactor you set (e.g., 2, 2.5), the less values you try (rough search). Also, with low values of improve, the search will continue longer.
      Hope this helps. Cheers!

    • @noobshady
      @noobshady Před 7 lety +1

      thank you for your response and you great tutorial

  • @irahcabangon5079
    @irahcabangon5079 Před 7 lety

    Can I use Random Forest on data with only 2 variables?

    • @edurekaIN
      @edurekaIN  Před 7 lety +3

      Hey Irah, thanks for checking out our tutorial.
      If you have only two variables then random forest is not advisable. You should go for something like decision tree or regression. Polynomial regression will work best in the mentioned case.
      Hope this helps. Cheers!

  • @aritrachatterjee8057
    @aritrachatterjee8057 Před 6 lety

    You are not explaining the key concepts like Mean decrease Gini and why should I select the high value for mean decrease gini and interpret as most important variable????

    • @edurekaIN
      @edurekaIN  Před 5 lety

      Hey Aritra, sorry for the delay.
      Gini Impurity signifies how pure or impure your dataset is. Root node has the highest value of gini impurity, while the leaf nodes have the least value of the gini impurity. Why? Because at root node the dataset is completely mixed and unsegregated while at leaf node the data is pure and segregated. So if the value of gini impurity is high there it means there is still a chance to further divide the tree. Hope this clarifies your doubt. For further query stay tuned for our next video on Decision Tree Using Python, This video will cover all the basics and the concepts related to decision tree.
      Hope this helps!

  • @VarunSharma-ym2ns
    @VarunSharma-ym2ns Před 5 lety

    randomForest library package is not available in my R

    • @edurekaIN
      @edurekaIN  Před 5 lety

      Hey Varun, The basic syntax for creating a random forest in R is −randomForest(formula, data).
      Hope this helps!

  • @metinmercan8139
    @metinmercan8139 Před rokem

    can you send me codes?

    • @edurekaIN
      @edurekaIN  Před rokem

      Thanks for showing interest in Edureka! Kindly share your mail id for us to share the datasheet/ source code :) Do subscribe for more videos & updates