S2024 #17 - Google BigQuery / Dremel (CMU Advanced Database Systems)

Sdílet
Vložit
  • čas přidán 26. 08. 2024
  • Andy Pavlo (www.cs.cmu.edu...)
    Slides: 15721.courses....
    Notes: 15721.courses....
    15-721 Advanced Database Systems (Spring 2024)
    Carnegie Mellon University
    15721.courses....

Komentáře • 6

  • @oz5219
    @oz5219 Před 4 měsíci +6

    woud 16th lecture be available?

  • @SteveLoughran
    @SteveLoughran Před 4 měsíci

    Another little detail: spark can delegate saving of shuffle data to the Hadoop Yarn NodeManager process -which can serve data even after the spark worker process terminates. This allows for more agile spark clusters within a Hadoop cluster. However, with the move to kubernetes container hosting spark serves the data itself and assumes that it won’t terminate. This is potentially a problem with deployment on spot-priced cloud VMs as the “your server is fairly reliable” no longer holds.…

  • @cerealpeer
    @cerealpeer Před 4 měsíci +1

    😂😂😂😂😂😂w
    big google
    big stinq