[Coinbase Lakehouse Architecture] Achieving Data Warehouse Performance on a Data Lakehouse

Sdílet
Vložit
  • čas přidán 25. 07. 2024
  • Join Sida Shen from CelerData and Eric Sun from Coinbase in this video as they dive into the latest advancements in data lakehouse querying and share tips to make the most out of your data lakehouse. They'll cover:
    🌟Why you shouldn't rely on proprietary data warehouses just to speed up queries
    🌟The latest cool stuff in query engines boosting lakehouse performance
    🌟A close look at how Coinbase is using StarRocks, Delta Lake, and Unity Catalog
    ----------------------------------------------------------------------------------------------------------------------
    Timestamps
    00:00 Intro
    00:30 Data Lakehouse - Data Warehouse Features on Data Lake
    04:46 Challenges of Fast Data Lake Queries on Data Lake
    06:45 How to Accelerate Data Lake Query Performance
    08:14 What Is StarRocks
    08:54 How Fast Is a Purposely Built Lakehouse Engine
    09:43 SSB Benchmark Test - StarRocks vs. ClickHouse vs. Apache Druid, Out-Of-Box
    10:51 Benchmark: StarRocks as a Data Warehouse vs. StarRocks as a Lakehouse Query Engine - TPC-DS 1TB Benchmark
    11:13 Comparing to Other Query Engines: StarRocks vs. Trino
    13:11 Coinbase - Data Lake with Open Format, Unity Catalog, and Multiple Query Engines
    14:29 Coinbase' Data Stack
    15:12 How Coinbase Uses StarRocks + Unity Catalog + Delta Lake
    18:45 PuppyGraph
    19:45 DuckDB
    20:34 The True Benefit of an Open Lakehouse
    21:40 Conclusion
    ----------------------------------------------------------------------------------------------------------------------
    Learn more at celerdata.com/
    Connect with us:
    LinkedIn: / celerdata
    Twitter: / celerdata
    StarRocks GitHub: github.com/StarRocks/StarRocks
    StarRocks Website: www.starrocks.io/
    Slack: try.starrocks.com/join-starro...
    #DataAnalytics #DataEngineering #DataLakeAnalytics #OLAP #DataAnalyst #DataEngineer #DataInfrastructure #Database #AnalyticalDatabase #DataLake #DataLakeHouse #Trino #Presto #DataWarehouse #DataScience #ApacheIceberg
  • Věda a technologie

Komentáře • 1

  • @celerdata
    @celerdata  Před měsícem

    🌟Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack