Microsoft Fabric: Data Warehouse vs Lakehouse vs KQL Database

Sdílet
Vložit
  • čas přidán 27. 06. 2024
  • In Microsoft Fabric, we have the data warehouse, the lakehouse and the KQL database as options to choose from if we want to store some data, BUT which of these should YOU choose? And how should you begin to decide?
    This video steps through the data warehouse, the lakehouse and the KQL database in detail, comparing their features and limitations, so that can make the right decision for your business.
    Towards the end of the video, we look at how to combine these data stores into end-to-end architectures (medallion architecture).
    Catch up on the Power BI to Microsoft Fabric Transition Guide series here: • Power BI to Fabric Tra...
    Timeline
    0:00 Intro
    1:08 It's all delta parquet
    2:30 Lakehouse overview
    7:17 Data Warehouse oveview
    8:28 KQL Database overview
    11:05 Side-by-side comparison
    12:07 Data Types
    13:08 Languages
    15:00 Copilot + low/no code
    16:45 Ecosystems
    18:40 Security granularity options
    20:55 Typical Workloads
    22:33 Other unique features (including Git)
    23:49 Personas
    24:36 Practical applications (end-to-end examples)
    #microsoftfabric #powerbi #dataanalytics

Komentáře • 97

  • @LearnMicrosoftFabric
    @LearnMicrosoftFabric  Před 3 měsíci +11

    Hey Fabricators! If you found this video helpful, please LIKE, SUBSCRIBE and SHARE with colleagues to get the word out - THANK YOU! 🙌🙌👍👍

    • @_TheDataTrain
      @_TheDataTrain Před 3 měsíci +3

      Like the pragmatic approach. You're walking the tour in a naturally calm way enabling viewers to follow your thinking very easily.

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před 3 měsíci

      ah thanks for the feedback! were you talking about the first half in Fabric or on the tablet? Just curious to hear which style people prefer, or a mixture of the two? Thanks for watching!

    • @LiamMcCauley
      @LiamMcCauley Před 3 měsíci +3

      @@LearnMicrosoftFabric I liked the mix of both - the tablet was useful for methodically going through each point, but I also like being able to relate that to seeing the tool on screen.

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před 3 měsíci

      Good to know, thanks!!

    • @mattroberts9665
      @mattroberts9665 Před 3 měsíci +1

      @@LiamMcCauley Agreed. A mix of both was great in this video. But divvying out each section into more digestible sized videos maybe the way to go. As much as i watched this video all the way through. 30 minutes may be too long for others.

  • @pimentelallanap
    @pimentelallanap Před 3 měsíci +5

    This content is pure gold. Uncredible quality!

  • @brianszeto3418
    @brianszeto3418 Před 3 měsíci +5

    Great video! The additional data security options for data warehouse is another reason to implement the gold tier using data warehouse instead of data lakehouse, for example if you want to have a unified data warehouse within your organization but you want to control who can access what dimensions/facts based on criteria such as departments, job roles, etc.

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před 3 měsíci

      Yes exactly! Thanks for watching!! Have you been building things in Fabric yet Brian?

    • @brianszeto3418
      @brianszeto3418 Před 3 měsíci

      @@LearnMicrosoftFabric not yet but looking at getting the DP-600 cert in the next few months hopefully!

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před 3 měsíci +5

      @@brianszeto3418 nice, good luck with that! I will probably be moving to DP-600 specific content after this series (April time) which might be useful for you 👍👍

  • @mattroberts9665
    @mattroberts9665 Před 3 měsíci +1

    I've got to say your videos are great and your delivery style is excellent. Remarkably personable.
    Keep doing what your doing. Distilling down this huge transition that some of us are going through, in that moving from the traditional Azure data services (ADF, Databricks, Synapse etc) into Fabric. Other CZcamsrs show similar material, but you're diving just that little bit deeper. Nice one.
    Matt

  • @tyronefrielinghaus3467
    @tyronefrielinghaus3467 Před 3 měsíci +1

    Will, that was a a really great video : nice and comprehensive , and the flow at the end was great : thanks!!!

  • @azwarmzafar
    @azwarmzafar Před 3 měsíci

    your content is really straightforward, to the point, well designed and perfectly communicated. thanks a million.

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před 3 měsíci +1

      Thanks a lot for your kind words! Glad you're finding it helpful 🙌

  • @deepakgupta4479
    @deepakgupta4479 Před 3 měsíci +2

    Superb Explanations...! and Great Video

  • @bkundrat
    @bkundrat Před 3 měsíci

    Outstanding. The layout of the content is extremely helpful.

  • @alexshteinberg1227
    @alexshteinberg1227 Před 3 měsíci

    Great Video! really appreciate the breakdown and the possible outcomes.👍

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před 3 měsíci

      Thanks a lot Alex! Glad you enjoyed the breakdown! Have you been building with any of these data stores yet?

  • @SureshGuddanti
    @SureshGuddanti Před 2 měsíci

    Will, love the way you explained this much discussed topic . Thank you

  • @ClintElyonson
    @ClintElyonson Před 3 měsíci

    NIce one Will, good to see you dominating the Fabric space. well done on great videos. I'm learning from you :)

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před 3 měsíci +1

      Hi Clint, great to see you here :) hope you're doing well, and thanks for the support 🙌

  • @carloschebair3406
    @carloschebair3406 Před 7 dny

    great info! thanks for sharing!

  • @peternguynguyen5208
    @peternguynguyen5208 Před 3 měsíci

    Very detailed explanations. Thank you

  • @fabianaltendorfer11
    @fabianaltendorfer11 Před 2 měsíci +1

    Insane, thank you so much Will.

  • @kates1736
    @kates1736 Před 2 měsíci +1

    Very helpful explanation of the topic! Thank you Will!

  • @hansvetters8026
    @hansvetters8026 Před 3 měsíci

    Great content indeed! It helps me to clear the mist in my head, many thanks for the work you are putting in this series! I am pretty much an on prem DWH architect making my feet wet to build out an ETL architecture in fabric.

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před 3 měsíci +1

      That's good to hear! Glad the videos are helping :) hopefully it's making sense for you with your background? Lots of exciting opportunities in Fabric over an on-prem DWH 😊

  • @rabihbadr54
    @rabihbadr54 Před 3 měsíci

    Very informative, thank you!

  • @juanm555
    @juanm555 Před 2 měsíci

    Awesome video man, didn't know your channel, subscribed! thx a lot

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před 2 měsíci

      Welcome to the channel, lots of video to catch up on 😉 haha thanks for watching!!

  • @AdrianMcGavock
    @AdrianMcGavock Před 3 měsíci

    this is a great series, really enjoying the content... very nicely explained without unnecessary jargon or unnecessary complexity - well done sir 🍻

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před 3 měsíci

      Awesome, glad you're enjoying it!! Next video coming out on Friday 😊

  • @rameshpaskarathas6512
    @rameshpaskarathas6512 Před 2 měsíci

    Thanks! Keep up the great work.

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před 2 měsíci

      very generous, thanks a lot Ramesh 🙌🏽 i will do, just getting started ☺️

  • @fabricioferreiraz
    @fabricioferreiraz Před 2 měsíci

    Simple amazing! I believe this is the best video that I’ve seen to a end to end solution for fabric.

  • @csenaraths
    @csenaraths Před 3 měsíci

    Good Explanation !

  • @samirvaghasiya9918
    @samirvaghasiya9918 Před 2 měsíci

    Thank you very much Will.
    Really useful :)

  • @datboi6066
    @datboi6066 Před 2 měsíci +1

    Great great video, thank you

  • @TheRamiroB
    @TheRamiroB Před 3 měsíci

    Good video, so really clear...

  • @FrankyLeeuwerck
    @FrankyLeeuwerck Před 2 měsíci

    Top presentation!

  • @VigneshKiswanth
    @VigneshKiswanth Před 5 dny

    Thank you for the excellent video. I have a question: Which is preferable in terms of Audit Framework and Watermarking? We encountered update conflicts in the warehouse table when attempting to update audit tables while ingest multiple tables simultaneously, which updates watermark and audit values. To circumvent this issue, we've adopted an insert-only approach for both Audit and Watermark frameworks, which has proven effective. There's also discussion about using a KQL database, but we're uncertain about its complexity.

  • @rameshpaskarathas6512
    @rameshpaskarathas6512 Před 2 měsíci

    Great video Will. Thoughts on having a lakehouse for gold layer as well but perhaps then loading this to warehouse for SQL specific tasks like masking and object or column security. I just like the flexibility of lakehouse languages offer.

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před 2 měsíci

      yeh I’m pretty sure you can do that stuff on the Lakehouse SQL endpoint (no need to move over to DWH). just be careful that your audience has no more than Viewer role in the workspace otherwise anything you set on the SQL endpoint can be sidestepped (using Spark). hope that makes sense?

  • @AbelGarcia-ki5nd
    @AbelGarcia-ki5nd Před měsícem

    Amazing thanks for the breakdown! I'd love to throw pricing in the comparison, maybe for another video?

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před měsícem

      The cost for each will be similar. In Fabric, most of ‘cost’ comes from Compute (I.e. how you operate on the data), rather than which data store you use 👍

  • @danielkrumov5095
    @danielkrumov5095 Před 3 měsíci +1

    Hey, Will, I have a question. In the previous video comparing Data pipeline vs Dataflow.., you mentioned that it is not possible to make a shortcut from a Lakehous to a Data Warehouse. However, in the end-to-end architecture example in this video, the third layer is a Data Warehouse that is getting data from the previous layer - a Lakehouse. How can we set up this workflow, then?
    Otherwise, I find this video and your content extremely helpful and insightful! Keep up the amazing work!

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před 3 měsíci +1

      Hey Daniel, great question, I probably should have clarified that point in one of the videos. Although you can't create a shortcut from a lakehouse into a DWH, you can actually query a lakehouse from a DWH using something like select * from [LakehouseName].[dbo].[TableName] - this makes your 'Silver' layer accessible in your 'Gold' DWH. You can build views from the DWH (from Silver lakehouse data) or if you want to 'materialize' some gold analytical models, you could use stored procedures with an INSERT INTO you gold table. Hope that makes sense? Another option is to use a data pipeline or dataflow to write gold tables into the data warehouse (unfortunately we can write to a data warehouse using a Notebook).

    • @danielkrumov5095
      @danielkrumov5095 Před 3 měsíci

      ​@@LearnMicrosoftFabric Thank you so much for your quick and extensive answer! It definitely answered my question and I have more clarity now. Keep crushing it with the amazing content!

  • @TheXSpud
    @TheXSpud Před 2 měsíci

    Very clear and informative video of the different options in Fabric. A related question, admittedly not in the scope of this video, is using semantic models vs data warehouses to share data with customers (effectively the Power BI devs). At my org, we're generally planning to create semantic models to give users access to gold data, how do you stack that up against giving access to a gold data warehouse? Are there use cases for both for this task?

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před 2 měsíci

      Yeh depends on your requirements, and how much access/flexibility you want to give them.

  • @andynovak9838
    @andynovak9838 Před měsícem

    Greetings Will -- fantastic job! You happen to have your hand drawn diagram available in electronic form that you could share?

  • @sabastian61
    @sabastian61 Před měsícem

    For me. It is always the Lakehouse because everything else is derived from there.

  • @FatherLizard
    @FatherLizard Před 3 měsíci

    Another amazing video - thank you. Question: how does one decide on how many lakehouses/warehouses to have? Do you divide by the many functional areas of your business or can you dump all your organization’s data into just a few lakehouses?

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před 3 měsíci +1

      Thanks for watching! Really depends on your requirements and preference as a business! Normally the biggest factors are whether you want git integration and separation of data for security reasons

    • @TheRamiroB
      @TheRamiroB Před 3 měsíci +1

      Hi, my two cents. Depends of the method that you use for organization data, if you came from mesh you focus on owner data, if you came form fabric probably y use funtional aproachh.. if you start from scratch yo divede by responsability grousps and security requeriments. but remember you can change in the future.

  • @shunlai4218
    @shunlai4218 Před 3 měsíci

    Hi Will,
    Thanks for all the insightful video.
    got a question about Power BI DirectLake mode,
    can a DirectLake dataset be created based on Data Warehouse as well?
    Many thanks!

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před 3 měsíci

      Yes can be built from both Lakehouse or data warehouse 👍 next video will focus on Direct Lake ☺️

  • @chinmaykajalwa
    @chinmaykajalwa Před 3 měsíci

    Great video Will. I have few questions.
    1. "Materialized views" are supported in warehouse, but not in lakehouse. Please confirm.
    2. In my ongoing project of Power BI to Fabric migration, i am thinking of the following pattern. Bronze and Silver layer in lakehouse and Gold layer spread across LH and WH. To be precise, star schema in LH and materialized views in WH. Your opinion about this ?
    3. There is one more entity "Data Mart". It has good RLS and OLS features. Any idea when to use this and how Data Mart is different from LH and WH?

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před 3 měsíci

      1. No, materialised views only in KQL database, not data warehouse. In Lakehouse you can imitate the effect by writing to a different table (which is effectively what a materialised view is)
      2. It’s difficult to give opinion on architecture without context, but in general LAKE-Lake-warehouse is a common pattern.
      3. Don’t have much experience with data mart so can’t really comment. Look at Radacad’s article on the subject I believe he mentions data marts in his comparison 👍

  • @04nimmot
    @04nimmot Před 18 dny

    Preparing for DP-600, my first Microsoft exam, but KQL wasn't covered in the learning material, thanks!

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před 18 dny

      KQL is not assessed in the DP-600 exam

    • @04nimmot
      @04nimmot Před 18 dny

      @@LearnMicrosoftFabric strange as its in the practise exam

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před 18 dny +1

      @@04nimmot oh really? that's surprising to me... I'm 98% sure it's not in the exam. I didn't get any KQL questions in my exam. And it's not mentioned in the Study Guide: learn.microsoft.com/en-us/credentials/certifications/resources/study-guides/dp-600

    • @04nimmot
      @04nimmot Před 18 dny

      @@LearnMicrosoftFabric you're definitely correct its not in the learning guide, had a question on ALM Toolkit too. Bit of looking up XMLA and should be okay for the exam in two weeks, I think..

  • @user-eq9tr9lp8r
    @user-eq9tr9lp8r Před měsícem

    Hi, this is Gabee from Brazil. Thank you for this amazing lesson! ⌛ I have a question about RLS in the Lakehouse. Does it apply to Power BI as well?

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před měsícem

      Currently, no, you will have to define in the semantic model too. In the future, yes (when they roll-out a feature called OneSecurity, possibly by the end of the year).

  • @michaeldelaney102
    @michaeldelaney102 Před 3 měsíci

    @LearnMicrosoftFabric do you have the side-by-side comparison as a downloadable file? Thank you

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před 3 měsíci +1

      Hi Michael, I've added it to the resources section here: www.skool.com/microsoft-fabric/classroom/c75b239c?md=b89a483c2f8146d0adfc7366f714d3d3

  • @PaulBailey3
    @PaulBailey3 Před 3 měsíci

    Hey Will, I was reading today that Lakehouse default function is replace and not append any longer. Will these deltas still apply to lakehouses after that announcement?

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před 3 měsíci

      Hi Paul, not sure I entirely understand your question, but the underlying format for all tables in Lakehouse is delta parquet, so parquet files with a delta log that tracks changes (whether they be through a replace or an append)

  • @patdia2906
    @patdia2906 Před 3 měsíci

    I love u

  • @darins
    @darins Před měsícem

    MARK
    ✅ Dynamic FilePath
    ✅ Add Date from Within File (this is good)
    ✅ Flatten Header Rows with fx (ooo la la, add-in's ftw)
    ✅ Remove null Columns (Wow!!)
    ✅ Change Header of Column (I forget to use reference tricks!)
    ✅ Filter by a List with Merge (hyperspeed!)
    ✅ Clean data in 1 step (I really really love fewer steps!)
    CHANDEEP
    ✅ Eliminate Promoted Header Step (love fewer steps!)
    ✅ Create Columns from a Record (I really love fewer steps!!)
    ✅ Add Total Row to Table (wierd, but cool)
    ✅ is Number or Structure (weirder, but cool)
    ✅ Projections (weirdest, but cooler)
    ✅ Apply a function to all columns of a table
    ✅ Group Kind. Local to count stints (🤯)
    It's a tie! Which mean I win! 🙏Thanks guys!!!

  • @relaxplease2472
    @relaxplease2472 Před 7 dny

    I have one doubt, in this video you are telling we need warehouse for data modelling. can we use warehouse that is automatically created for each lakehouse for data modelling? Instead of creating a new one? Can anyone explain this?

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  Před 7 dny

      I'm not telling people what I think they need, just talking through the characteristics of each to allow people to make their own decisions 👍 every company is different, with different requirements.
      You can use the T-SQL endpoint of a Lakehouse to perform some data modelling, but you will be limited to read-only operations.

    • @relaxplease2472
      @relaxplease2472 Před 7 dny

      @@LearnMicrosoftFabric thank you for your response man. I got it

  • @carlosnavia1361
    @carlosnavia1361 Před měsícem