Random Forest Classification | Machine Learning | Python

Sdílet
Vložit
  • čas přidán 3. 08. 2021
  • Random Forest Classification | Machine Learning | Python
    GitHub JupyterNotebook: github.com/siddiquiamir
    GitHub Data: github.com/siddiquiamir/Data
    About this video: In this video, you will learn how to about Random Forest Classification in Python
    Large Language Model (LLM) - LangChain
    LangChain: • LangChain Tutorial for...
    Large Language Model (LLM) - LlamaIndex
    LlamaIndex: • LlamaIndex Tutorial fo...
    Machine Learning Model Deployment
    ML Model Deployment: • ML Model Deployment us...
    Spark with Python (PySpark)
    PySpark: https: • PySpark with Python
    Data Preprocessing (scikit-learn)
    Data Preprocessing Python: • Data Preprocessing Python
    Social Media Links
    CZcams: / statswire
    Twitter (X) : / statswire
    #llm #ai #genai #generativeai #statswire #pyspark #python #pythonprogramming #pythontutorial #machinelearning #datascience #randomforest

Komentáře • 62

  • @rubenneftali6498
    @rubenneftali6498 Před 20 dny

    Thanks a lot for your excellent explanation :)

  • @mybiru
    @mybiru Před 10 měsíci

    Most underrated video...the explanation was exactly what I want... awesome 👍

    • @StatsWire
      @StatsWire  Před 10 měsíci

      Thank you so much for your kind words!

  • @Ben-qe8ju
    @Ben-qe8ju Před rokem

    Really helpful. Thanks!

  • @bandanapahi5514
    @bandanapahi5514 Před rokem +1

    Hi! It will be really helpful if you will provide the details of the datasets you have taken for the analysis.

    • @StatsWire
      @StatsWire  Před rokem

      I think you will find the dataset here: github.com/siddiquiamir/Data

  • @sunidhimalgotra2703
    @sunidhimalgotra2703 Před 7 měsíci

    you saved me thanks

  • @metinmercan8139
    @metinmercan8139 Před 2 lety +1

    what should I DO if my target variable(dependent) is continuous number? how can i do it?

    • @StatsWire
      @StatsWire  Před 2 lety +1

      You have to use random forest regressor instead of random forest classifier

  • @maishareddy
    @maishareddy Před 2 měsíci +1

    i have dataset in which the values are strings and im not able to convert them to numeric values

    • @StatsWire
      @StatsWire  Před 2 měsíci

      Can you put the error messages?

  • @user-nk4zd2nu5p
    @user-nk4zd2nu5p Před 9 měsíci

    Very helpful !! But i want to asky you how to deploy such code into arduino nano 33 ble sense board? And what code should be written for inference in arduino ide? Did yoi explained that in a video??

    • @StatsWire
      @StatsWire  Před 9 měsíci

      I have not explored that part.

  • @mazharalamsiddiqui6904

    Nice tutorial

  • @xaloftal
    @xaloftal Před 5 měsíci

    from the scikit-learn documentation, the classification_report should get the y_true first and then the y_pred and now i'm confused

    • @StatsWire
      @StatsWire  Před 5 měsíci

      For latest information, please refer the documentation:)

  • @user-cv9by1ye5l
    @user-cv9by1ye5l Před rokem

    Hello, I would like to know hot to get an accuracy like yours, I mean, how it´s possible to have that score?

    • @StatsWire
      @StatsWire  Před rokem +1

      If you have same data and if provide the same random_state number then you should get almost same result:)

  • @tomasmarly418
    @tomasmarly418 Před rokem

    What does it mean to have a gini of zero?
    how could i interpret it?

    • @StatsWire
      @StatsWire  Před rokem +1

      The Gini coefficient measures the inequality among values of a frequency distribution, such as the levels of income. A Gini coefficient of 0 reflects perfect equality, where all income or wealth values are the same, while a Gini coefficient of 1 (or 100%) reflects maximal inequality among values. For example, if everyone has the same income, the Gini coefficient will be 0. In contrast, if for a large number of people only one person has all the income or consumption and all others have none, the Gini coefficient will be 1

  • @shahriaralom4547
    @shahriaralom4547 Před rokem

    Thnks brother! I will be very greatful If I get the notebook please!

    • @StatsWire
      @StatsWire  Před rokem +1

      Thank you. I will check on my machine and upload it on my GitHub and share it with you.

    • @shahriaralom4547
      @shahriaralom4547 Před rokem

      @@StatsWire actually don't worry about it, I have copied it from the video and tried myself in jupyter notbook, Thank you anyway ☺️

  • @gabriellefiore5097
    @gabriellefiore5097 Před 7 měsíci

    Where can I find the heart rate csv to follow the example?

    • @StatsWire
      @StatsWire  Před 7 měsíci

      You can find the dataset here: github.com/siddiquiamir/Data

    • @gabbyf2906
      @gabbyf2906 Před 7 měsíci +2

      All I can find is the heart.csv but not not heart_disease.csv

  • @Mark-jm9jc
    @Mark-jm9jc Před rokem +2

    Hi. I don't see the plot.show() result like yours. I just see the blue target bar not the orange one in the plot.

    • @StatsWire
      @StatsWire  Před rokem

      Can you please try once more time by following the video.

    • @anvithaatyam3201
      @anvithaatyam3201 Před rokem +1

      Same here

    • @jurgan92
      @jurgan92 Před 8 měsíci

      try this:
      sns.countplot(x=df["target"], hue=df["target"], dodge=False)

    • @adlinaaminuddin6234
      @adlinaaminuddin6234 Před 7 měsíci

      same here. it does not show distinct classes , it sum up all classes as one

  • @MF-ye5fe
    @MF-ye5fe Před 5 měsíci

    Where can I get this dataset?

    • @StatsWire
      @StatsWire  Před 5 měsíci

      You can find it here
      github.com/siddiquiamir/Data

  • @Aiman-pn2dy
    @Aiman-pn2dy Před 8 měsíci

    From where I can get dataset?

    • @StatsWire
      @StatsWire  Před 8 měsíci +1

      Please find the dataset here: github.com/siddiquiamir/Data

    • @Aiman-pn2dy
      @Aiman-pn2dy Před 8 měsíci

      @@StatsWire may I know your linkedin ID

    • @StatsWire
      @StatsWire  Před 8 měsíci +1

      It is in my bio you can find and send me invite @@Aiman-pn2dy

  • @Thanatos2062
    @Thanatos2062 Před 2 měsíci

    bro your heart disease data set is not there in ur github pls help

    • @StatsWire
      @StatsWire  Před 2 měsíci

      Please find it here: github.com/siddiquiamir/Data/blob/master/Heart_Disease_Prediction.csv

    • @StatsWire
      @StatsWire  Před 2 měsíci

      Even this one: github.com/siddiquiamir/Data/blob/master/heart.csv

    • @Thanatos2062
      @Thanatos2062 Před 2 měsíci

      @@StatsWire thank you soo much bro 😇😇

    • @StatsWire
      @StatsWire  Před 2 měsíci

      @@Thanatos2062 You're welcome!

  • @Rolling_panda00
    @Rolling_panda00 Před 2 lety

    I am getting an error " __init__() got an unexpected keyword argument 'min_sample_split' " .
    Can you help me out and tell me how I can resolve this error?

    • @StatsWire
      @StatsWire  Před 2 lety

      This is a hyperparameter, can you check the version of scikit-learn

    • @Rolling_panda00
      @Rolling_panda00 Před 2 lety

      @@StatsWire sklearn.__version__ '1.0.2'

    • @StatsWire
      @StatsWire  Před 2 lety

      @@Rolling_panda00 Then it should not throw any error because it is a hyperparameter

    • @Rolling_panda00
      @Rolling_panda00 Před 2 lety

      @@StatsWire it's giving the same error in regression and decision tree too
      And I've tried updating but it's not as helpful as I thought.

    • @StatsWire
      @StatsWire  Před 2 lety

      @@Rolling_panda00 can you email me your jupyternotebook at statswire2@gmail.com

  • @paretare4946
    @paretare4946 Před 2 měsíci

    where is that heart diesesase file

    • @StatsWire
      @StatsWire  Před 2 měsíci +1

      You can find it here: github.com/siddiquiamir/Data

    • @paretare4946
      @paretare4946 Před 2 měsíci

      @@StatsWire thanks ♥️ really appreciate this kindness

    • @StatsWire
      @StatsWire  Před 2 měsíci

      @@paretare4946 You're welcome :)

  • @pranavpnvkeyworld2502

    Bro I need code and data set

    • @pranavpnvkeyworld2502
      @pranavpnvkeyworld2502 Před rokem

      Plz upload it bro

    • @StatsWire
      @StatsWire  Před rokem

      Hi, sorry for the delay. Please find the dataset here: github.com/siddiquiamir/Data

    • @BadboyGaming-yk1vj
      @BadboyGaming-yk1vj Před 10 měsíci +1

      @@StatsWire Which data set in the one you sent ?

    • @StatsWire
      @StatsWire  Před 10 měsíci

      @@BadboyGaming-yk1vj You can get all the dataset in the github link.

    • @gabriellefiore5097
      @gabriellefiore5097 Před 7 měsíci

      All I see is heart rate no csv that says Heart_diseases@@StatsWire