Data Quality Testing in the Medallion Architecture with Pytest and PySpark

Sdílet
Vložit
  • čas přidán 20. 11. 2020
  • [ Lightning talk from Data + AI Summit 2020. Speaker: Carter Kilgour]
    Why data quality is especially important in the medallion architecture, and how to ensure it with scheduled testing and reporting. Databricks is proud to announce that Gartner has named us a Leader in both the 2021 Magic Quadrant for Cloud Database Management Systems and the 2021 Magic Quadrant for Data Science and Machine Learning Platforms. Download the reports here. databricks.com/databricks-nam...
  • Věda a technologie

Komentáře • 9

  • @puneetsharma4391
    @puneetsharma4391 Před rokem +4

    @Databricks Thanks for the video, I would like to setup same kind of data quality framework in my project , could you please share a step by step guide to setup the framework.

  • @user-hn2os3mu7x
    @user-hn2os3mu7x Před rokem +1

    Great, thanks a lot for share your knowledge

  • @mrkstein1
    @mrkstein1 Před rokem

    I would also appreciate a repo link, this is very helpful.

  • @Ruined_Pirate
    @Ruined_Pirate Před rokem

    Hi Team, it's a fantastic demo.. could you please share the bit bucket repo or the code reference. Thank You

  • @arturp.6780
    @arturp.6780 Před 3 lety +2

    Hi, thanks for the video. I cant find any link to the source code. Can you share it somewhere?

  • @chiragbansod4003
    @chiragbansod4003 Před 9 měsíci

    Hi team, can you provide us with the repo link for the code