Finally! Powerful Cluster Analysis with Microsoft Excel is HERE!

Sdílet
Vložit
  • čas přidán 23. 07. 2024
  • Cluster analysis using Python in Excel is valuable to ANY professional wanting to have more impact at work using data!
    Python in Excel is a game-changer for professionals who want to delight stakeholders with insights that are just not possible with out-of-the-box Microsoft Excel features.
    Cluster analysis with Python in Excel is a perfect example.
    ☕ If you found this content useful and would like to support the channel, you can buy me a coffee: www.buymeacoffee.com/DaveOnData
    --------------------------------------------------------------------------------------------
    VIDEO CHAPTERS
    --------------------------------------------------------------------------------------------
    00:00 Intro
    00:47 The Data
    02:44 Loading Data into Python
    05:13 Scaling the Data
    07:48 Clustering with K-means
    10:28 Output the DataFrame
    11:17 Analyzing the Clusters
    --------------------------------------------------------------------------------------------
    FREE MACHINE LEARNING CRASH COURSES
    --------------------------------------------------------------------------------------------
    If you're ready to see what Python in Excel skills make possible, be sure to check out my FREE on-demand machine learning crash courses!
    Here's what you get with any of these courses:
    1️⃣ On-Demand: Watch the course when it fits your schedule
    2️⃣ PDF of all slides
    3️⃣ All Python code as a Jupyter Notebook
    4️⃣ All data so you can experiment
    5️⃣ Resources to continue your learning.
    Machine learning with Python in Excel. You've got this if you want it:
    💻 Decision Trees with Python:
    bit.ly/DecisionTreesWithPython
    💻 Tuning Decision Trees with Python:
    bit.ly/TuningDecisionTreesWit...
    💻 Logistic Regression with Python:
    bit.ly/LogisticRegressionWith...
    💻 Cluster Analysis with Python:
    bit.ly/ClusterAnalysisWithPython
    --------------------------------------------------------------------------------------------
    GET THE MICROSOFT EXCEL WORKBOOK
    --------------------------------------------------------------------------------------------
    Here's the link to the GitHub for my Python in Excel video workbooks:
    github.com/DaveOnData/PythonI...
    NOTE - You have to have access to Python in Excel to run the code!
    #pythoninexcel #pythonexcel #pythonforexcel
  • Věda a technologie

Komentáře • 20

  • @DaveOnData
    @DaveOnData  Před 5 měsíci +1

    Stuck waiting on Microsoft to grant you access to Python in Excel? Don't wait! Check out my video on how you can get started NOW: czcams.com/video/0iV4FtpSriY/video.html

  • @michaelt312
    @michaelt312 Před 5 měsíci +2

    Still new to Python and learning. Really thankful for your videos.
    But I do know that likes, comments, and shares will get this video in front of more people.

    • @DaveOnData
      @DaveOnData  Před 5 měsíci

      Thank you so much for your continued support of the channel! It is much appreciated.

  • @banihas22
    @banihas22 Před 5 měsíci +1

    Awesome series David!

    • @DaveOnData
      @DaveOnData  Před 5 měsíci +1

      Thank you! I am glad you are enjoying the content. I'm on a mission to uplevel the quality of Python in Excel content on CZcams.

  • @user-je9iq8kb1d
    @user-je9iq8kb1d Před 5 měsíci +1

    This was great. Thank you for posting this.
    I have a question - can we read and write data from local folder ? For eg: can I run pandas.read_csv() and read a csv file located at C:/User ?

    • @DaveOnData
      @DaveOnData  Před 5 měsíci +1

      Python in Excel cannot directly access the local file system, networks, or the Excel object model for security. However, you can use Power Query to source data from local files, databases, APIs, etc. For more details on Python in Excel security, check out the following Microsoft article:
      support.microsoft.com/en-us/office/data-security-and-python-in-excel-33cc88a4-4a87-485e-9ff9-f35958278327

    • @DaveOnData
      @DaveOnData  Před 5 měsíci +1

      BTW - I will have an upcoming video demonstrating sourcing 300,000 rows from SQL Server into Python in Excel via Power Query. Stay tuned!

  • @ScreamingTurtleRacing
    @ScreamingTurtleRacing Před 3 měsíci +1

    hey David, How do you handle data sets that are incomplete, for example some rows have no value for a specific column?

    • @DaveOnData
      @DaveOnData  Před 3 měsíci +1

      Great question! Assuming you can't correct the data (which is common), here are some strategies you can employ:
      1 - Use an algorithm that can handle missing values.
      2 - If only a small percentage of observations have missing data, remove the observations.
      3 - Remove the feature with missing data.
      4 - Find a “proxy feature” for the feature with missing data.
      5 - Fill in the missing data (i.e., impute the missing data).

    • @ScreamingTurtleRacing
      @ScreamingTurtleRacing Před 3 měsíci +1

      @@DaveOnData Thank you for such a complete response! I tried putting in a value that would flag them as not actual data -99999 for example, but I think that just throws the who analysis off. Next I will try removing the rows with null data, then after characterizing the clusters, I will see where the rows with null data might fit back in. sort of a post-hoc assignment to a cluster. Thanks again!

    • @DaveOnData
      @DaveOnData  Před 3 měsíci

      You are welcome. Happy data sleuthing!

  • @ScreamingTurtleRacing
    @ScreamingTurtleRacing Před 4 měsíci +1

    Hey, great video. I have very little experience in this area and I got this running in an hour or so. I am trying to cluster customer sights(lat/long) in to sales territories. The issue is that with K-means I get all different size clusters. I have done some digging, and I want to try kmedoids clustering. Is it possible to use this method in Excel/Python? Thanks

    • @DaveOnData
      @DaveOnData  Před 3 měsíci +1

      A quick scan of the libraries included with Anaconda Python (which Python in Excel uses) showed that k-medoids doesn't appear to be available at this time. The main machine learning library provided with Python in Excel is scikit-learn which does not offer k-medoids. However, you could use a local Python installation and use a package like the following: python-kmedoids.readthedocs.io/en/latest/

    • @ScreamingTurtleRacing
      @ScreamingTurtleRacing Před 3 měsíci +1

      @@DaveOnData thank you, I will try to do a local implementation of Python.

  • @FelixLizama
    @FelixLizama Před 3 měsíci +1

    hey @daveondata Is there any way that we can get the excel file so we can do this ourselves?

    • @DaveOnData
      @DaveOnData  Před 3 měsíci

      @FelixLizama - I've added a GitHub repo link to the video description where you can get the Excel workbook file.

    • @FelixLizama
      @FelixLizama Před 3 měsíci

      @@DaveOnData thanks!.. you are the best!

  • @oil_lord
    @oil_lord Před 5 měsíci +1

    When are they fully releasing python in excel for everyone

    • @DaveOnData
      @DaveOnData  Před 5 měsíci +1

      Microsoft has not announced the date for Python in Excel General Availability. However, you don't need to wait to start building skills. Check out this video on my channel for more details: czcams.com/video/0iV4FtpSriY/video.html