S2024 #01 - Modern OLAP Database Systems (CMU Advanced Database Systems)

Sdílet
Vložit
  • čas přidán 27. 08. 2024
  • Andy Pavlo (www.cs.cmu.edu...)
    Slides: 15721.courses....
    Notes: 15721.courses....
    15-721 Advanced Database Systems (Spring 2024)
    Carnegie Mellon University
    15721.courses....

Komentáře • 15

  • @DharanAditya
    @DharanAditya Před 19 hodinami

    This is very nice talk. Improved my understanding on modern OLAP systems. Thank You

  • @kushagrajain2407
    @kushagrajain2407 Před 12 dny

    Best Introduction to OLAP databases, It took me 2 days to fully understand the paper but I am so excited to read other papers and finish all the lectures bit by bit. Thanks for the videos Professor :)

  • @mephistotel87
    @mephistotel87 Před 7 měsíci +4

    It's said we don't see anything on the screen. While it's not required, but sometimes might be helpful, when Andy point out something on slides. Not complaining, just mention. Andy is the best lecturer I know on CZcams.

    • @andypavlo
      @andypavlo Před 7 měsíci +6

      This is my fault. I messed up the white balance on the camera. I will fix for next class.

  • @AaronClark-ACM
    @AaronClark-ACM Před 4 měsíci

    Thank you, Dr. Pavlo, for this great lecture!

  • @billyean
    @billyean Před 7 měsíci +2

    Awesome Andy's coming back, look forward to this semester.

  • @BLASTFROMTHEFUTURE
    @BLASTFROMTHEFUTURE Před 7 měsíci

    A note on 38:30 :
    "The queryplan of Postgres is a tree; you only have one parent, whereas a with a DAG you can do some computation here for a nested query and send it to two different parts of the tree."
    While the query plan is always shown as a tree, it does support multiple references to materialized queries when CTEs are involved, effectively making the query plan of one SQL query a DAG. You'll see this as e.g. a "CTE Scan" node:
    EXPLAIN
    WITH a AS (SELECT DISTINCT relname FROM pg_class)
    SELECT min(b.relname), max(c.relname)
    FROM a b, a c;
    It's not super optimized as it doesn't have indexing on the intermediate table, but it does mean it does allow some deduplication of work effort and technically makes it a DAG (even if CTE scan isn't done in parallel with the construction of the materialized CTE).

    • @meamzcs
      @meamzcs Před 7 měsíci

      Yes but with a DAG you don't need to materialize it, you can even pipeline... What you described is essentially just like two base table scans from the same table. Sure, you can draw those as a DAG too but that's not what you're actually doing...

  • @sashibhushanarajput1194
    @sashibhushanarajput1194 Před 7 měsíci +1

    Excited!!!

  • @OldRunningMonk
    @OldRunningMonk Před 6 měsíci +2

    Great lectures Andy, appreciate you uploading all the videos! I’ve been meaning to ask you this: what software do you use to prepare the slides? Doesn’t seem like powerpoint. But if it is, mad skills.

  • @ibrahimrabbani94
    @ibrahimrabbani94 Před 22 dny

    Is there a discord channel for CMU 15-721?

  • @commandmaster7101
    @commandmaster7101 Před 7 měsíci +5

    Why did they move you from that nice classroom in previous semester to this cia blacksite looking basement? Blink twice if you need help

  • @liketheduck
    @liketheduck Před 5 měsíci

    ​ @andypavlo Is this new semester the best place to start as of 2024? I see that there are differences compared to 15-721 Advanced Database Systems (Spring 2023)'s playlist.

    • @andypavlo
      @andypavlo Před 5 měsíci +2

      The course is always evolving as research/industry changes. The latest semester is the most up-to-date.
      -- Andy

  • @Milhouse77BS
    @Milhouse77BS Před 7 měsíci

    Seems that not having the Microsoft OLAP stack of Power BI is a big omission. The Tabular model in Excel Power Pivot, Analysis Services, and Power BI have been using columnar storage since 2010.