Getting Started Tutorial: Building a Data Lakehouse With StarRocks, Apache Hudi, and MinIO

Sdílet
Vložit
  • čas přidán 24. 07. 2024
  • 🌟 Discover how to use StarRocks and Apache Hudi to build a robust open data lakehouse architecture capable of handling the most demanding data warehouse workloads on open and standardized storage.
    Learn how Apache Hudi enhances data freshness and provides a solid foundation for fast queries, while StarRocks offers incredible querying speeds and data warehouse-like performance across various open data lakes.
    This demo will guide you through setting up StarRocks and Apache Hudi with MinIO for storage using Docker Compose.
    Discover the benefits of this integration, including simplified data architecture, improved data governance, and the elimination of the need for data copying for query acceleration.
    ----------------------------------------------------------------------------------------------------------------------
    Timestamps
    00:00 Intro
    00:33 Apache Hudi Overview
    01:47 StarRocks as a Data Lakehouse Query Engine
    03:58 Performance Comparison: Trino vs. StarRocks
    04:27 Demo Walkthrough
    05:05 Deployed StarRocks and a Hudi/Spark/MinIO environment in Docker
    07:44 Configure MinIO
    08:34 Ingest Data
    10:15 Connect StarRocks with Apache Hudi and Query the Data
    11:35 Verify That Data Is Stored in MinIO
    12:12 QuickStart Doc: docs.starrocks.io/docs/quick_...
    Download StarRocks: www.starrocks.io/download/com...
    -----------------------------------------------------------------------------------------------------------------------
    Learn more at starrocks.com/
    Connect with us:
    LinkedIn: / celerdata
    Twitter: / celerdata
    CelerData Website: celerdata.com/
    StarRocks GitHub: github.com/StarRocks/StarRocks
    StarRocks Website: www.starrocks.io/
    Slack: try.starrocks.com/join-starro...
    #DataAnalytics #DataEngineering #DataLakeAnalytics #OLAP #DataAnalyst #DataEngineer #DataInfrastructure #UserFacingAnalytics #Database #AnalyticalDatabase #DataLake #DataLakeHouse #DataWarehouse #DataScience #ObjectStore #Docker #MinIO #Hudi #ApacheHudi
  • Věda a technologie

Komentáře • 1

  • @celerdata
    @celerdata  Před 4 měsíci +2

    🌟Download StarRocks: www.starrocks.io/download/community
    🌟Tutorial - Apache Hudi Lakehouse: docs.starrocks.io/docs/quick_start/hudi/
    🌟Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack
    🌟StarRocks GitHub: github.com/StarRocks/StarRocks
    🌟StarRocks Website: www.starrocks.io/