How To Detect And Visualize Outliers Using DAX In Power BI [2023 Update]

Sdílet
Vložit
  • čas přidán 4. 08. 2024
  • Knowing how to detect outliers in datasets and how to visualize them is important for data analysts because it helps them identify potential errors or anomalies in data that can skew statistical analyses and modeling results.
    Outliers can also provide valuable insights into unusual or unexpected patterns in the data that may be indicative of underlying trends or issues, facilitating more effective data-driven decision-making. In this video, Sam is going to demonstrate how you can detect and visualize outliers using DAX in Power BI.
    ****Video Details****
    00:00 Introduction
    01:13 Determining the logic
    03:44 Doing logic in a dynamic way
    04:00 Criteria for outliers
    05:22 Outlier detection logic
    08:10 Grouping
    09:15 Conclusion
    **** Learning Power BI? ****
    FREE COURSE - Ultimate Beginners Guide To Power BI - www.enterprisedna.co/courses/ultimate-beginners-guide-to-power-bi
    FREE COURSE - Ultimate Beginners Guide To DAX - www.enterprisedna.co/courses/ultimate-beginners-guide-to-dax
    FREE - Power BI Resources - www.enterprisedna.co/power-bi-resources
    Enterprise DNA On-Demand - app.enterprisedna.co
    Enterprise DNA Subscription - app.enterprisedna.co/pricing
    Enterprise DNA Events - www.enterprisedna.co/events
    #EnterpriseDNA #PowerBI #PowerBIDesktop #PowerBITutorial #DataVisualization #OutlierAnalysis

Komentáře • 11

  • @EnterpriseDNA
    @EnterpriseDNA  Před rokem

    Check out our FREE courses: bit.ly/3N00AJw

  • @RelaxingGamingMusic85
    @RelaxingGamingMusic85 Před rokem +1

    Great video as always Sam. :) Love the work you and the guys at EDNA are doing

    • @EnterpriseDNA
      @EnterpriseDNA  Před rokem

      Hi Damien, glad that you appreciated our content! If you haven't yet, you can subscribe to our channel to see all our upcoming Power BI and Power Platform video tutorials and announcements. Cheers!

  • @ezequielaguirre3
    @ezequielaguirre3 Před rokem +1

    Great video!!
    If I may... I think it would be more efficient if you indicate in the Outlier Sales measure, in an explicit way a "MAX": [Total Sales] >= MAX('Outliers Detection Logic'[Total Sales min]).
    And a MIN in the Non Outlier...
    Then the measure would only calculate the Total Sales for the Customers needed.

    • @EnterpriseDNA
      @EnterpriseDNA  Před rokem

      Hi Alvaro, glad that you appreciated our content! Thanks for sharing your top as well! If you haven't yet, you can subscribe to our channel to see all our upcoming Power BI and Power Platform video tutorials and announcements. Cheers!

  • @rajarshisingh2547
    @rajarshisingh2547 Před rokem +1

    Pretty cool, will implement it right away. 😋

    • @EnterpriseDNA
      @EnterpriseDNA  Před rokem

      Hi Rajarshi, glad that you appreciated our content! If you haven't yet, you can subscribe to our channel to see all our upcoming Power BI and Power Platform video tutorials and announcements. Cheers!

  • @yemunnsoe8450
    @yemunnsoe8450 Před 2 měsíci

    In statistics, data points that fall beyond 3 standard deviations (3rd stdev) from the mean are often considered outliers.

  • @ashrafadel54
    @ashrafadel54 Před 9 měsíci

    Great content! However, I have a few questions:
    1. at 6:16, will it make a difference if I choose a different column name for the `Customers` table instead of customer name? (e.g., VALUES(Customers[Customer ID])).
    what i mean by "make a difference" is this:
    when I hover over each scatter point now, will the tooltip display the customer ID instead of the customer name? also, isn't it more accurate to use ID, since there's a rare chance that names are duplicated?
    2. `Total Sales` column is found in the `Sales` table which is implicitly related to the `Customers` table via the `VALUES(Customers[Customer Name]) expression, correct?
    3. at 8:55, from what I understand, `Sales Grouping` measure is evaluated for each data point in the scatter chart. If that's the case, then for each of these points, how does SELECTEDVALUE() function correctly evaluate to `Outlier` or `Non-Outlier`, given that the `Outlier Detection Logic` table is a detached table (i.e., not connected to `Sales` or `Customers` tables)?

  • @pepper_lab
    @pepper_lab Před rokem +1

    🎉🎉🎉🎉🎉😮