What are dataflows in Power BI?

Sdílet
Vložit
  • čas přidán 7. 08. 2024
  • A brief introduction into the concept of dataflows
    Power Query - centralised online
    Post Production Note: my comment re Merging Tables - as long as the "helper" tables aren't loaded then this is fine, it's only when merging tables (entities) that are also loaded in their own right that you need Premium.
    00:15 what is a dataflow and why would you use it, plus how to create one
    01:38 Creating an empty dataflow to paste your existing Power Query tables into
    05:00 Save and Refresh your dataflow
    05:53 Connecting to a dataflow
    07:03 Signing in tip when using Multiple Tenants
    07:55 A few nice features in dataflows that aren't in Desktop Power Query yet
    For an in-depth discussion between Matthew Roche and Kasper de Jonge around dataflows then check this out
    • Everything Power BI da...
    Follow me on LinkedIn
    / wynhopkins
    Twitter
    / wynhopkins
    Access Analytic Blog
    accessanalytic.com.au/blog
  • Jak na to + styl

Komentáře • 79

  • @joaorataoo
    @joaorataoo Před 3 lety +17

    Finally, someone who has the good sense to present things simply and without jargon.
    I wonder, why does Microsoft give a different name to the same things?
    On the one hand, they seek to simplify and integrate everything, on the other hand, it seems that they are happy to shuffle the user with 3 or 4 different words for the same thing.
    Thank you so much.

    • @AccessAnalytic
      @AccessAnalytic  Před 3 lety +2

      You’re welcome Joāo, thanks for leaving a comment. Microsoft are notorious for poor naming choices and renaming things or picking clashing names for features and products.

  • @tamonwancharoensripongsa4740
    @tamonwancharoensripongsa4740 Před měsícem +1

    Thanks! It's straightforward to understand and follow.

    • @AccessAnalytic
      @AccessAnalytic  Před měsícem +1

      I appreciate you taking the time to let me know you found it useful

  • @davidkelly1634
    @davidkelly1634 Před 8 měsíci +1

    Refresh data flow and then dataset is a helpful prompt. Thanks for the content.

  • @AccessAnalytic
    @AccessAnalytic  Před 3 lety +5

    Post Production Note: my comment re Merging Tables - as long as the "helper" tables aren't loaded then this is fine, it's only when merging tables (entities) that are also loaded in their own right that you need Premium.

  • @EyiBillion
    @EyiBillion Před 6 měsíci +2

    Thanks a lot. My best video so far on Dataflow. Subscribed sharp sharp😅

    • @AccessAnalytic
      @AccessAnalytic  Před 6 měsíci

      Thank you. Welcome to the channel

    • @mickeyteller
      @mickeyteller Před měsícem +1

      Soon as I saw your sharp sharp comment, I knew you was Nigerian lol. I love to see it!

  • @DavidLiLove888
    @DavidLiLove888 Před 3 lety

    that is great, Wyn. thanks for sharing. I will use it in my job.

  • @resagratia
    @resagratia Před 2 lety +1

    "You've gotta refresh the dataflow, then refresh the dataset", looks sideways, "Hope you found that useful".
    Reminds me of Michael Scott speaking to the TV crew 😅
    Really great intro that avoids all the jargon. I really don't understand why Microsoft has bad naming convention for arguably the same products/features.

    • @AccessAnalytic
      @AccessAnalytic  Před 2 lety

      Thanks, naming things is not Microsoft’s strong point. At least they renamed dataflow entities to tables 😀

  • @antoniofelez4674
    @antoniofelez4674 Před měsícem +1

    Awesome video Wyn, can you do a video of the limitations for us pro license folks😂. Source being SharePoint as well

    • @AccessAnalytic
      @AccessAnalytic  Před měsícem +1

      Cheers. The only real limitation is no linked dataflows. So you can’t load a table and also connect to it with another query / dataflow.

  • @Sivakumarpoornima
    @Sivakumarpoornima Před 3 lety

    awesome and thank you for sharing

  • @joseluisbeltramone599
    @joseluisbeltramone599 Před rokem +1

    Thank you very much for the good video!

  • @tracylippiatt3238
    @tracylippiatt3238 Před 3 lety

    Diagram View looks great

    • @AccessAnalytic
      @AccessAnalytic  Před 3 lety

      Yeah, it will hopefully make it to desktop eventually

  • @Ahmad_Askar
    @Ahmad_Askar Před 3 lety +1

    thank you

  • @zaheerahmad3896
    @zaheerahmad3896 Před 10 měsíci +1

    GOD bless you !

  • @jwilliamson01
    @jwilliamson01 Před rokem

    thanks Wyn, great video. but wondering is it possible to create a mySQL data flow. doesn't want to work for me. I'm trying to get a schedule refresh set up that isn't dependant on a personal gateway, i.e. works without my computer needing to be switched on

    • @AccessAnalytic
      @AccessAnalytic  Před rokem

      Is the data source online? If on a network server or computer then a gateway is always needed

  • @eleftheriakoniari3392

    Hello! Thank you for this video! How many dataflows can we have per workspace, if the workspace is backed by a premium capacity?

  • @chrisc2084
    @chrisc2084 Před rokem +1

    Hi Wyn, are there limitation to how many files you can use in a dataflow. What I mean by this, I am saving numerous excel(xlsx format) files to SharePoint and the aim is to do my mashup or ETL process in the dataflow area and then merge the files as one table. We use Power BI Premium

    • @AccessAnalytic
      @AccessAnalytic  Před rokem +7

      There's no technical limit, the refresh just gets slower the more files you add. One approach is to have one dataflow that simply consolidates the files and then another that links to it and does any additional complex ETL

    • @chrisc2084
      @chrisc2084 Před rokem +1

      @@AccessAnalytic many thanks CC

  • @arturoramirez712
    @arturoramirez712 Před 4 měsíci

    Are there times where dataflows are not the best approach? Also in terms of speed, is this a slower process compared to running power query directly within the power bi file? Last question can Excel connect to a dataflow table? Thank you

    • @AccessAnalytic
      @AccessAnalytic  Před 4 měsíci

      They’re not always necessary. If you don’t need to re-use a table on multiple reports or your refresh against the data source isn’t slow then no real need. You’d just be adding an extra process and refresh step into your report update.
      Yes Excel can connect to dataflows.

  • @dataguys2192
    @dataguys2192 Před rokem

    When you refresh a dataset that uses a dataflow as a source does it run queries against the database?

    • @AccessAnalytic
      @AccessAnalytic  Před rokem

      No, the dataflow stores the data (in csv files in the background). So you need to refresh the dataflow in addition to the dataset

    • @dataguys2192
      @dataguys2192 Před rokem +1

      @@AccessAnalytic I just did a test. No impact on database when dataset is refreshed which is what I wanted. Great.

  • @eleftheriakoniari3392
    @eleftheriakoniari3392 Před 10 měsíci

    Hello, as I using dataflows more questions arise. If I have created a dataflow and then I am importing some tables to a power bi report, I have duplication of these data? Thank you in advance!

    • @AccessAnalytic
      @AccessAnalytic  Před 10 měsíci

      Yes, you have a centralised table in the dataflow and then one or more datasets will pull copies in.

    • @eleftheriakoniari3392
      @eleftheriakoniari3392 Před 10 měsíci

      How can I avoid the duplication? Creating a composite model I suppose, but what are the restrictions?

    • @AccessAnalytic
      @AccessAnalytic  Před 10 měsíci

      @eleftheriakoniari3392 I don’t see a need to avoid the duplication. Think of the data model as an in-memory cache.

    • @eleftheriakoniari3392
      @eleftheriakoniari3392 Před 10 měsíci

      ​@@AccessAnalytic What I ment is if we have duplication of data in the workspace

    • @AccessAnalytic
      @AccessAnalytic  Před 10 měsíci

      Having data in a Dataflow and in datasets is normal best approach. The duplication is not a problem

  • @javedabdool1187
    @javedabdool1187 Před měsícem

    How is it different from doing it in power query then export it i to the service and everyone can use the dataset??

    • @javedabdool1187
      @javedabdool1187 Před měsícem

      Is it like anyone can modify it?

    • @AccessAnalytic
      @AccessAnalytic  Před měsícem +1

      It’s clean table(s) that can be re-used in multiple reports and edited like any other data source on import.
      Good if you want to centralise some tables to be re-used, or you have a slow data source that you want to pull from once / occasionally and then your data model refreshes will be quicker.

    • @javedabdool1187
      @javedabdool1187 Před měsícem

      @@AccessAnalytic "It’s clean table(s) that can be re-used in multiple reports and edited like any other data source on import." for this part i could have done it in power query on pbi desktop then publish it for everyone to use right?

    • @AccessAnalytic
      @AccessAnalytic  Před měsícem +1

      Sharing the data model is generally for people to build visualisations in “thin” reports.
      Sharing tables allows people to build data models from a common source.

    • @javedabdool1187
      @javedabdool1187 Před měsícem +1

      Nice many thnx great video🎉

  • @tiago5a
    @tiago5a Před 9 měsíci

    Is it possible to use Python in the dataflow to transform data? Thanks

    • @AccessAnalytic
      @AccessAnalytic  Před 9 měsíci +1

      I don’t think so. You might like to look into Fabric ( currently in preview ) where you can write Python to cleanse data ready for Power BI to consume learn.microsoft.com/en-us/fabric/data-science/tutorial-data-science-explore-notebook?WT.mc_id=M365-MVP-5002589

    • @tiago5a
      @tiago5a Před 9 měsíci +1

      @@AccessAnalytic Ok Thanks! The problem of the Fabric it's the price...

    • @AccessAnalytic
      @AccessAnalytic  Před 9 měsíci +1

      @tiago5a - yep, it should eventually be around USD $200 per month for the cheapest version when it comes out of preview I think.

  • @martyc5674
    @martyc5674 Před 2 lety

    Hi Wyn- Im late to the party on this one-- I dont have a Pro Licence, is thata reason why i cannot create a new dataflow?

  • @bloodomen6919
    @bloodomen6919 Před 2 lety

    Hello Sir . I wish to connect an excel to a data flow. Please help.

    • @AccessAnalytic
      @AccessAnalytic  Před 2 lety

      Not possible yet.

    • @Mukeshkumar-cr3yc
      @Mukeshkumar-cr3yc Před 2 lety

      @@AccessAnalytic If we have onedrive business account can we not add excel still? via upload file(preview) option

    • @AccessAnalytic
      @AccessAnalytic  Před 2 lety

      Hi Mukesh, I’m not sure what you mean sorry.

    • @apoorvupadhyaya7899
      @apoorvupadhyaya7899 Před 2 lety

      @@Mukeshkumar-cr3yc Yes, it is possible. When creating the Dataflow, select Excel Workbook as the option. Only concern I can foresee is scheduled refreshes when the user is available (maybe leaves the company)

  • @remuslupinhp
    @remuslupinhp Před 4 měsíci

    I want to ask one thing
    Let's say I have an existing dataflow that's using a sharepoint folder to grab data files
    Now I want to replicate this dataflow to grab files from another folder
    The files in the new folder have slightly different columns, I want to adjust the query.
    And then finally, I want to use an existing Power BI file to use the new data flow
    I already have my old visuals
    Can these visuals get updated with the new dataflow?
    Wherever there is a column error I can change it easily but will this approach work or would I need to start developing all the visuals from scratch?
    Thanks

    • @AccessAnalytic
      @AccessAnalytic  Před 4 měsíci +1

      Edit your existing Power Query to point to the new dataflow. Your visuals & measures however will break if you rename the columns that are used in them. So you’d then need to go into each visual to fix them up

    • @remuslupinhp
      @remuslupinhp Před 4 měsíci

      @@AccessAnalytic Thanks a ton! You are a Godsend. One final question, the relationship model will not break will it? I can edit the visuals to refer to new columns but the relationships should not break? There is a very minor difference between the new and the old data flow, just a few columns not present in the new source file while a few new columns coming in. So that's about it. And as we want to keep both the dataflow running we don't want to mess up the existing for the new and also create a new one

    • @AccessAnalytic
      @AccessAnalytic  Před 4 měsíci +1

      @@remuslupinhp as long as the relationship columns and table names don't change then should be fine

    • @remuslupinhp
      @remuslupinhp Před 4 měsíci +1

      @@AccessAnalytic Super Thanks!

  • @tadstar
    @tadstar Před 2 lety +1

    I still struggle to see why this is better or gives you anything different from a shared dataset.. well presented video though . Thanks

    • @AccessAnalytic
      @AccessAnalytic  Před 2 lety +3

      Thanks tadstar. To build a dataset you need clean tables of data, you may have multiple datasets that use the same cleaned up table, so rather than doing the clean up multiple times you can do it once in a dataflow.
      Another reason is you may have a slow source system, and you have multiple datasets feeding off that system. Building a dataflow that pulls the tables you need and then building datasets off those rather than direct off the source can speed up your dataset refreshes and take the load off your source systems.
      Hope that helps a bit

    • @tadstar
      @tadstar Před 2 lety +1

      @@AccessAnalytic many thanks, that really does clear it up for me, I didn't consider shared clean tables .. brilliant explanation... 🙏

    • @AccessAnalytic
      @AccessAnalytic  Před 2 lety +1

      No worries

  • @danieljust295
    @danieljust295 Před 3 lety

    It wasn’t showed that the whole point is that the single dataset can use multiple data flows, so essentially model will consist of data flows only (or mostly).

    • @AccessAnalytic
      @AccessAnalytic  Před 3 lety

      That’s definitely a common scenario Daniel.

    • @McIlravyInc
      @McIlravyInc Před 4 měsíci

      How would this work if the power query pieces are built in Excel instead of power bi? Or is this a reason to use power bi over excel even if you don't use the reports/create dashboards (if only using to clean data/create worklists/export data that via VBA).
      This would reduce the need to either repeat myself in multiple tools or export cleaned data for other tools to read as it would refresh all the data in a schedule?

    • @AccessAnalytic
      @AccessAnalytic  Před 4 měsíci

      @McIlravyInc - excel can connect to dataflows. So yes centralise and re-use in your Excel and Power BI reports