CelerData
CelerData
  • 93
  • 46 343
Data Lake Query Engines: Trino vs StarRocks
Explore the differences between Trino and StarRocks as data lake query engines, their architectures, performance benchmarks, and suitable use cases for modern data analytics.
---------------------------------------------------------------------------------------------------------------------
Timestamps
00:00Intro
00:10 What is Trino?
00:56 What is StarRocks?
01:59 Performance Comparison: Trino vs StarRocks
02:33 Trino vs. StarRocks: Which to Use for Specific Use Cases
03:28 Conclusion
Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack
Try CelerData Cloud for free: celerdata.com/celerdata-cloud-free-trial
For a thorough exploration and more detailed learning, check out this webinar recording: czcams.com/video/1Ehnmtl60dQ/video.htmlsi=VhRGfz0NBA1JJ_7w
----------------------------------------------------------------------------------------------------------------------
Learn more at starrocks.com/
Connect with us:
LinkedIn: www.linkedin.com/company/celerdata/
Twitter: celerdata
CelerData Website: celerdata.com/
StarRocks GitHub: github.com/StarRocks/StarRocks
StarRocks Website: www.starrocks.io/
Slack: try.starrocks.com/join-starrocks-on-slack
#DataAnalytics #DataEngineering #DataLakeAnalytics #OLAP #DataAnalyst #DataEngineer #DataInfrastructure #UserFacingAnalytics #Database #AnalyticalDatabase #DataLake #DataLakeHouse #DataWarehouse #datasciencebasics
zhlédnutí: 177

Video

What Is StarRocks and What Are Its Use Cases?
zhlédnutí 181Před 21 dnem
StarRocks is an Apache-licensed open-source project under the Linux Foundation. As of June 2024, it has garnered 8.3 thousand stars on GitHub and has 354 contributors globally. Its adoption by major industry players attests to its reliability and efficiency in handling various data analytics needs. Get to know StarRocks in this video and discover how it excels as a data warehouse for real-time ...
How to Solve Data Upserts Challenges in OLAP Databases
zhlédnutí 164Před 21 dnem
Discover the challenges of data upserts in OLAP systems and explore effective strategies, including the delete and insert method, with real-world examples like Airbnb’s fraud detection. Timestamps 00:00 Intro 00:05 The Challenges of Data Upserts Challenges in OLAP 01:25 Delete and Insert Strategy 03:24 Data Upsert Example: Airbnb Fraud Detection - Challenges 04:42 Data Upsert Example: Airbnb Fr...
Materialized Views: Tips, Tricks, and Use Cases
zhlédnutí 307Před 21 dnem
Materialized views are one of StarRocks’ most popular and powerful features, but are you getting the most out of them? Murphy Wang, the technical mind behind the project’s materialized views, is ready to share all the latest tips and tricks to help you get the best query performance for your data pipeline. Session Highlights: 🌟Best practices for rolling out materialized views: Learn what causes...
[Coinbase Lakehouse Architecture] Achieving Data Warehouse Performance on a Data Lakehouse
zhlédnutí 458Před měsícem
Join Sida Shen from CelerData and Eric Sun from Coinbase in this video as they dive into the latest advancements in data lakehouse querying and share tips to make the most out of your data lakehouse. They'll cover: 🌟Why you shouldn't rely on proprietary data warehouses just to speed up queries 🌟The latest cool stuff in query engines boosting lakehouse performance 🌟A close look at how Coinbase i...
StarRocks 3.3 is Here: Key Features and Improvements
zhlédnutí 590Před měsícem
StarRocks 3.3 is here, and it's more powerful than ever! In this video, we'll walk you through everything you need to know to get the most out of this release. Let's dive in and explore the new features and enhancements together! 00:00 Intro & Agenda 01:21 StarRocks Use Cases - Lakehouse Query Engine 03:07 StarRocks Use Cases - Real-Time Analytics Workloads 05:24 StarRocks 3.3: Shared-Data 05:3...
Getting Started with CelerData Cloud Serverless: Intro and Live Demo
zhlédnutí 88Před 2 měsíci
🌟Our long-awaited CelerData Cloud Serverless solution is now available for public preview. This service provides a fully-managed StarRocks experience for enterprise workloads, with no infrastructure or VMs to manage. Explore what CelerData Cloud Serverless offers and discover the benefits of using this innovative solution. Dive into the details by watching the live demo, where we'll showcase it...
The Register & CelerData: Ditch Your Data Warehouse with Superior Lakehouse Performance
zhlédnutí 123Před 2 měsíci
Sida Shen, CelerData's Product Manager, recently joined The Register's Tim Phillips to explore how to ditch your data warehouse with superior lakehouse performance. While your data lakehouse serves as a single (open) source of truth for your data, it can also become a single source of frustration if you aim for sub-second queries at scale. Query engines not optimized for data-warehouse-like wor...
Going Serverless for Warehouse-Free Lakehouse Analytics
zhlédnutí 233Před 2 měsíci
In this video, CelerData PM Sida Shen pulls back the curtain on CelerData Cloud Serverless, a fully managed lakehouse query engine built on StarRocks. Highlights: 🌟Why you should keep your data in the lakehouse instead of proprietary warehouses. 🌟The StarRocks tech powering Serverless that enables data warehouse performance on the lakehouse 🌟Serverless features that make handling enterprise lak...
Challenges and Accelerations in Running Data Warehouse Workloads on Open Data Lake
zhlédnutí 71Před 2 měsíci
Running low-latency, high-concurrency queries on a data lake poses challenges due to various factors. Fetching data and metadata, often bottlenecked by slow network IO, is a primary concern. Data shuffling during query execution further compounds the issue, requiring significant network IO. Moreover, data lake storage devices like HDFS or cloud object storage can exhibit slow performance, with ...
How to Accelerate Data Lake Queries
zhlédnutí 74Před 2 měsíci
🌊 Delves into techniques to supercharge your data lake queries, from leveraging caching and in-memory data processing to utilizing advanced query engines like StarRocks. Learn how to achieve near-data warehouse speeds with lakehouse flexibility and discover the transformative potential of materialized views for on-demand query acceleration. Explore the details and see how fast your data engine ...
What Is Single Instruction Multiple Data and the Role of SIMD in Boosting OLAP Database Efficiency
zhlédnutí 147Před 3 měsíci
🌟 Uncover the advantages of using vectorized query engines with SIMD technology in OLAP databases. Vectorized engines, which store data in columns, are particularly beneficial for performing large-scale aggregations like summations-essential for tasks like weekly sales reports or regional employee counts. Unlike traditional databases that process data row by row, SIMD allows for multiple data p...
StarRocks Connect: Sling - Extract & Load Data From Your CLI With Ease and Speed
zhlédnutí 264Před 3 měsíci
Meet Fritz Larco, the brain behind Sling. Discover what sets Sling apart and the value it brings to StarRocks users. Sling (slingdata.io/) is a powerful data integration CLI tool that offers an easy solution to create and maintain high-volume data pipelines using the Extract & Load (EL) approach. Timestamps 00:00 Intro 02:17 The Origin Story of Sling 06:47 What Is Sling, What Does It Do, and Wh...
Unlock User Behavior with 87M Events Using Hudi, StarRocks & MinIO - Apache Hudi Community Call
zhlédnutí 167Před 3 měsíci
💡 During a recent @apachehudi community call, Albert Wong, who leads the StarRocks community, offered a detailed introduction to StarRocks and illustrated the process of constructing a modern data lakehouse with the help of #ApacheHudi and #MinIO. Working with a dataset containing 87 million events and 4 million unique products across 10,000 categories, Albert showed participants how to extract...
Tencent's A/B Testing SaaS Platform Unifies All SQL Workloads on the Data Lakehouse with StarRocks
zhlédnutí 49Před 3 měsíci
🎮 Tencent, a leading gaming company globally, developed an AB testing framework internally and sought to transform it into a Software as a Service (SaaS) platform called "ABetterChoice" to assist other gaming companies. However, this transition posed several challenges. Firstly, the internal system relied heavily on data warehousing and had limited query support, requiring extensive denormaliza...
How to Accelerate Apache Iceberg Queries
zhlédnutí 106Před 3 měsíci
How to Accelerate Apache Iceberg Queries
Apache Iceberg + StarRocks: Your Recipe for Superior Lakehouse Performance
zhlédnutí 1,5KPřed 3 měsíci
Apache Iceberg StarRocks: Your Recipe for Superior Lakehouse Performance
StarRocks Architecture: StarRocks as a Data Warehouse & StarRocks as a Lakehouse Query Engine
zhlédnutí 251Před 3 měsíci
StarRocks Architecture: StarRocks as a Data Warehouse & StarRocks as a Lakehouse Query Engine
StarRocks Connect: RisingWave
zhlédnutí 198Před 4 měsíci
StarRocks Connect: RisingWave
Getting Started Tutorial: Building a Data Lakehouse With StarRocks, Apache Hudi, and MinIO
zhlédnutí 480Před 4 měsíci
Getting Started Tutorial: Building a Data Lakehouse With StarRocks, Apache Hudi, and MinIO
StarRocks on Open Data Lakehouse Tutorial: StarRocks + Apache Iceberg + MinIO
zhlédnutí 581Před 4 měsíci
StarRocks on Open Data Lakehouse Tutorial: StarRocks Apache Iceberg MinIO
Tutorial: Getting Started with StarRocks - Storage-Compute Separation
zhlédnutí 590Před 4 měsíci
Tutorial: Getting Started with StarRocks - Storage-Compute Separation
What Is StarRocks: Features and Use Cases
zhlédnutí 1,5KPřed 4 měsíci
What Is StarRocks: Features and Use Cases
Challenges With Accelerating Data Lake Queries
zhlédnutí 61Před 5 měsíci
Challenges With Accelerating Data Lake Queries
How WeChat’s Lakehouse Design Efficiently Handles Trillions of Records
zhlédnutí 236Před 5 měsíci
How WeChat’s Lakehouse Design Efficiently Handles Trillions of Records
5 Brilliant Lakehouse Architectures from Tencent, WeChat, and More
zhlédnutí 1,1KPřed 5 měsíci
5 Brilliant Lakehouse Architectures from Tencent, WeChat, and More
StarRocks Connect: StarRocks + Helical Insight
zhlédnutí 123Před 5 měsíci
StarRocks Connect: StarRocks Helical Insight
TFiR X CelerData: CelerData Enables Data Engineers To Build New Analytics Projects Faster
zhlédnutí 52Před 5 měsíci
TFiR X CelerData: CelerData Enables Data Engineers To Build New Analytics Projects Faster
New Features and Updates in StarRocks 3.2
zhlédnutí 115Před 6 měsíci
New Features and Updates in StarRocks 3.2
StarRocks Community Call: What’s New in 3.2
zhlédnutí 506Před 6 měsíci
StarRocks Community Call: What’s New in 3.2

Komentáře

  • @sizhezhang7929
    @sizhezhang7929 Před 6 dny

    Thank you for promoting our platform to the industry, and letting more people know about it.

  • @BergHageman-ry1xr
    @BergHageman-ry1xr Před 22 dny

    What about Dremio vs StarRocks?

    • @SidaShen
      @SidaShen Před 22 dny

      While Dremio explicitly forbids publishing performance tests or analysis against them, where you'll see the biggest fundamental difference is in the codebase. StarRock's is developed in C++ where Dremio is based on Java. Being C++ gives StarRocks the advantage of utilizing lower-level optimizations such as SIMD, which is crucial for fast OLAP queries. Additionally, StarRocks is fully open-source and actively maintained by a global community of contributors. In contrast, although Dremio is open-source, there has been no update for the past three months and currently, it does not support the submission of new issues.

  • @celerdata
    @celerdata Před 23 dny

    🌟Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 23 dny

    🌟Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack

  • @andriifadieiev9757
    @andriifadieiev9757 Před 24 dny

    For querying datalake, can StarRocks just connect to it or should do a full/incremental copy?

    • @celerdata
      @celerdata Před 24 dny

      StarRocks directly query data lake with its external catalog framework, no ingestion/data copying needed. Link to the docs: docs.starrocks.io/docs/sql-reference/sql-statements/data-definition/CREATE_EXTERNAL_CATALOG/

  • @dirlt
    @dirlt Před 24 dny

    Great stuff. Thanks for sharing.

  • @celerdata
    @celerdata Před 25 dny

    🌟Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack

  • @andriifadieiev9757
    @andriifadieiev9757 Před 27 dny

    Great episode, keep going!

  • @sakesun
    @sakesun Před 28 dny

    Meh. All talk, no demo at all ?

  • @tratkotratkov126
    @tratkotratkov126 Před měsícem

    how StarRocks compares with Databricks with Photon engine ?

  • @abdullahmajed7554
    @abdullahmajed7554 Před měsícem

    Where i can find the full video

  • @abdullahmajed7554
    @abdullahmajed7554 Před měsícem

    Great content! Am really excited to test starrocks, but i have a question please what the difference between impala and starrocks ?

    • @SidaShen
      @SidaShen Před měsícem

      There is a huge performance difference. StarRocks' operators are fully vectorized so it is way faster for OLAP queries. Also query planning, StarRocks' cost-based optimizer can generate much more efficient query plans

    • @abdullahmajed7554
      @abdullahmajed7554 Před měsícem

      @@SidaShen this is really promising, the reason why i asked is because we are using on premise cloudera solution, and impala is already configured, is there any official docs on setting up starrocks with cloudera

    • @abdullahmajed7554
      @abdullahmajed7554 Před měsícem

      @@SidaShen great!, the reason i asked because we are running on premise cloudera solution, which impala is already configured, is there an official guide on setting up starrocks with cloudera?

    • @celerdata
      @celerdata Před měsícem

      @@abdullahmajed7554 Are you using Apache lceberg with Cloudera? If it is it should work. You can join the StarRocks channel and we can chat there about specific integrations bit.ly/starrocks-slack :D

  • @user-vx1rr2yg7l
    @user-vx1rr2yg7l Před měsícem

    Got error ERRcOR 1064 (HY000): Unexpected exception: Failed to create shards. error: INVALID_ARGUMENT:shard info can not be empty. when try create table, also starrocks-cn is not started.

    • @celerdata
      @celerdata Před 24 dny

      Would you please join the StarRocks slack channel? We can help you out there. bit.ly/Join-StarRocks

  • @celerdata
    @celerdata Před měsícem

    🌟Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack

  • @dirlt
    @dirlt Před měsícem

    About internal shuffling, yes, it's via gRPC to shuffle chunks. About data lake, yes, we can read avro files. Sida, you really did a great job!

  • @celerdata
    @celerdata Před měsícem

    Useful Links: 🌟Join StarRocks Community on Slack: try.starrocks.com/join-starrocks-on-slack 🌟StarRocks 3.3 RC02 release note: docs.starrocks.io/releasenotes/release-3.3/ 🌟StarRocks Best Practices: www.starrocks.io/blog/starrocks-best-practices-data-modeling

  • @celerdata
    @celerdata Před 2 měsíci

    Useful Links: 🌟Try CelerData Cloud Serverless for Free: serverless.celerdata.com/login 🌟Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack StarRocks GitHub: github.com/StarRocks/StarRocks StarRocks Website: www.starrocks.io/

  • @zamw4276
    @zamw4276 Před 2 měsíci

    Hi! Which catalog for Iceberg would you recommend for Financial company with high reqs for regulatory reporting?

    • @celerdata
      @celerdata Před 2 měsíci

      @zamw4276 Good question, to be completely honest it'll really depend on your inifrastructure. Generally speaking, a good place to start would be this video by Tabular: czcams.com/video/G2YMCPdQfgM/video.htmlsi=BXMT0KRViUWuXzc7 but if you want a more personalized answer I'd suggest asking this question in the StarRocks Slack channel here: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 2 měsíci

    Useful Links: Try CelerData Cloud Serverless for Free: serverless.celerdata.com/login Join StarRocks on Slack: join.slack.com/t/starrocks/shared_invite/zt-2fou0ynxe-mZpcW54KewxWocehcBNoKQ StarRocks GitHub: github.com/StarRocks/StarRocks StarRocks Website: www.starrocks.io/

  • @celerdata
    @celerdata Před 2 měsíci

    Useful Links: 🌟Try CelerData Cloud Serverless for Free: serverless.celerdata.com/login 🌟Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack StarRocks GitHub: github.com/StarRocks/StarRocks StarRocks Website: www.starrocks.io/

  • @SebastianSativaLivemore
    @SebastianSativaLivemore Před 2 měsíci

    Love starrocks! Do you guys have any plans to support Redpanda / Kafka as the storage layer? Especially for enterprises with the tiered aws storage and shadow indexing in Redpanda, I think Starrocks as the lakehouse engine would be absolutely amazing!

    • @celerdata
      @celerdata Před 2 měsíci

      Thank you for your support and interest in StarRocks! We currently don’t have plans to integrate Redpanda/Kafka as the storage layer in our H1 or Q3 roadmap. If you haven't already, we invite you to join our StarRocks community on Slack - join.slack.com/t/starrocks/shared_invite/zt-2fou0ynxe-mZpcW54KewxWocehcBNoKQ to discuss this further and stay updated on future developments!

  • @celerdata
    @celerdata Před 3 měsíci

    Useful Links: 🌟 Load Data into StarRocks from Any Database vis Sling: blog.slingdata.io/load-data-into-starrocks-from-any-database 🌟Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    Useful Links: 🌟Tutorial: github.com/StarRocks/demo/tree/master/documentation-samples/datalakehouse 🌟Join StarRocks Slack Channel: try.starrocks.com/join-starrocks-on-slack 🌟 Apache Hudi Quick Start: docs.starrocks.io/docs/quick_start/hudi/ 🌟Slides: www.slideshare.net/slideshow/unlock-user-behavior-with-87-million-events-using-hudi-starrocks-minio/266628987

  • @celerdata
    @celerdata Před 3 měsíci

    Useful Links: 🌟Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack 🌟[doc] Quick start: Apache Iceberg Lakehouse docs.starrocks.io/docs/quick_start/iceberg/ 🌟Case Studies and More: www.starrocks.io/blog 🌟StarRocks GitHub: github.com/StarRocks/StarRocks 🌟StarRocks Website: www.starrocks.io/

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks Community on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks Community on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks Community on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks Community on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks Community on Slack: try.starrocks.com/join-starrocks-on-slack 🌟 Explore the Case Study: hubs.la/Q02s0yLC0

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks Community on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks Community on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks Community on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks Community on Slack: try.starrocks.com/join-starrocks-on-slack 🌟 Read the Case Studies: hubs.la/Q02s0yKN0

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks Community on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks Community on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks Community on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks Community on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    🌟Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack

  • @celerdata
    @celerdata Před 3 měsíci

    Useful Links: 🌟Join StarRocks on Slack: try.starrocks.com/join-starrocks-on-slack 🌟[doc] Quick start: Apache Iceberg Lakehouse docs.starrocks.io/docs/quick_start/iceberg/ 🌟Case Studies and More: www.starrocks.io/blog 🌟StarRocks GitHub: github.com/StarRocks/StarRocks 🌟StarRocks Website: www.starrocks.io/