14: Distributed Logging & Metrics Framework | Systems Design Interview Questions With Ex-Google SWE

Sdílet
Vložit
  • čas přidán 25. 07. 2024
  • 200 videos and we're still talking about logs - thanks guys, you're the best
  • Věda a technologie

Komentáře • 51

  • @martinwindsor4424
    @martinwindsor4424 Před 4 měsíci +25

    Jordan might not be a pregnant, but he never fails to deliver.

  • @chadcn
    @chadcn Před 4 měsíci +6

    Congrats on 200 videos mate! Keep up the great work 🚀🚀

  • @siddharth-gandhi
    @siddharth-gandhi Před 4 měsíci +2

    Bro's single handedly making me question studying ml over systems. Bravo on these videos!

  • @doobie91
    @doobie91 Před 4 měsíci +2

    Thanks a lot for your videos. Currently looking for a new job, brushing up/learning a lot about system design, watched lots of your videos recently. Appreciate your work. Keep it up!

  • @jordiesteve8693
    @jordiesteve8693 Před 16 dny +1

    thanks for your work!

  • @helperclass8710
    @helperclass8710 Před 3 měsíci +1

    Great video man. Thanks.

  • @hoyinli7462
    @hoyinli7462 Před 4 měsíci +1

    great job!

  • @prasenjitsutradhar3368
    @prasenjitsutradhar3368 Před 4 měsíci +2

    Great content!....pls make a video on code deployment!

  • @DavidWoodMusic
    @DavidWoodMusic Před 4 měsíci +3

    My interview is in 9 hours
    I hear your voice in my sleep
    I have filled a notebook with diagrams and concepts
    And I am taking a poopy at this very second
    We just prevail

    • @jordanhasnolife5163
      @jordanhasnolife5163  Před 4 měsíci +2

      Just imagine me doing ASMR as I tell you about my day in a life

    • @bezimienny5
      @bezimienny5 Před 2 měsíci +2

      Yo how did it go? Are you in your dream team? I sure hope so

    • @DavidWoodMusic
      @DavidWoodMusic Před 2 měsíci

      @@bezimienny5 thanks friend. Offer was made but I turned it down. Turned out to be a really poor fit.

    • @bezimienny5
      @bezimienny5 Před 2 měsíci +1

      @@DavidWoodMusic oh damn. That's a Shame. I'm kinda struggling with a similar decision right now. I passed all the interview stages but even at the offer stage I'm still learning new key pieces of info about the position that no one told me about before....
      But hey, you beat the systems design interview! That's an amazing win and now you know you can do it 😉

  • @sohansingh2022
    @sohansingh2022 Před 4 měsíci +2

    Thanks

  • @SunilKumar-qh2ln
    @SunilKumar-qh2ln Před 4 měsíci +1

    Very informative video as always.
    Was just thinking how the metric is pulled by prometheus (which will eventually store in the DB).
    How the different clients responsibility is assigned to the aggregator pods so that metric is pulled exactly once from each client pods.

    • @jordanhasnolife5163
      @jordanhasnolife5163  Před 4 měsíci

      I'm not too familiar with prometheus personally, feel free to expand on what you're mentioning here!

  • @deepitapai2269
    @deepitapai2269 Před 24 dny +1

    Great video as always! Why do you store the files on S3 as well as a data warehouse? Why not just store on the data warehouse directly from Parquet files? Is it that we need a Spark consumer to transform the S3 files before putting the data into the data warehouse?

    • @jordanhasnolife5163
      @jordanhasnolife5163  Před 23 dny

      Depends on the format of the S3 data. If it's unstructured, then we'd likely need some additional ETL job to format it and load it into a data warehouse.

  • @mukundkrishnatrey3308
    @mukundkrishnatrey3308 Před měsícem +1

    Hi Jordan,
    Regarding the post processing of unstructured data, can we do the batch processing in Flink itself, as it does support that, or it's not suitable for large scale of data?
    What could be the size of data which can dealt by flink itself, after which we might need to use HDFS/ Spark?
    PS :- Thanks for the amazing content, you're the best resource I've found till date for system design content :)

    • @jordanhasnolife5163
      @jordanhasnolife5163  Před měsícem +1

      Flink isn't bounded in the amount of data it can handle, you can always add more nodes. The difference is that flink is for stream processing. Feel free to watch the flink concepts video, it may give you a better sense of what I mean here.

    • @mukundkrishnatrey3308
      @mukundkrishnatrey3308 Před měsícem +1

      @@jordanhasnolife5163 Okay, got it now, thanks a lot again!

  • @shibhamalik1274
    @shibhamalik1274 Před 3 měsíci +1

    Hey jordan what is the data source in the last diagram here ? Is it the VM pushing logs / serialised Java objects etc to kafka ?? U mean the application when it logs a statement that statement makes a push to kafka ?
    Then what should be the partition key of this kafka cluster ? Should it be server id or a combination of server id + app name or how should be we structure this partition key ?

    • @jordanhasnolife5163
      @jordanhasnolife5163  Před 3 měsíci

      Yes the application is pushing to Kafka. I think that you should probably use the app/microservice name as the Kafka topic, and then within that partition by server ID in kafka

  • @nithinsastrytellapuri291
    @nithinsastrytellapuri291 Před 3 měsíci +1

    Hi Jordan, I am trying to cover infrastructure-based system design questions like this one first. Can you please clarify if I need to watch video 11, 12, 13 to understand this? Any prerequistes ?( I have covered concepts 2.0). Is it same for 17, 18, 19 videos as well?

  • @shibhamalik1274
    @shibhamalik1274 Před 3 měsíci +1

    Hey Jordan do u have a video on pull vs push based models of consumption ? I blv kafka is pull based but I want to understand who uses push based

    • @jordanhasnolife5163
      @jordanhasnolife5163  Před 3 měsíci

      Nothing regarding which message brokers do push based messages, feel free to Google it and report back

    • @shibhamalik1274
      @shibhamalik1274 Před 3 měsíci +1

      Ok
      @jordan. I think kafka is push based not pull based .
      Pull based could be custom implemented I think …

  • @31737
    @31737 Před 2 měsíci +1

    hey Jordan great video, does this require any sort of API design? given that we need to read through the data metrics does it makes sense to also describe the API structure, let me know your thoughts, thanks.

    • @jordanhasnolife5163
      @jordanhasnolife5163  Před měsícem

      Sure. You need an endpoint to read your metrics by time range, and it probably returns paginated results. (Perhaps taking in a list of servers)
      Anything else you're looking for?

    • @31737
      @31737 Před měsícem +1

      @@jordanhasnolife5163 right also for elastic search result you gonna need an API unless you wanna combine it with metrics which I don't think it's a good idea

    • @31737
      @31737 Před měsícem +1

      Also a request for making a video for tracking autonomous cars + collecting other metrics sensors/etc, thanks man your work is gold and I love the depth them

  • @timavilov8712
    @timavilov8712 Před 4 měsíci +3

    U forgot to mention the tradeoff between polling and pushing for event producers

    • @timavilov8712
      @timavilov8712 Před 4 měsíci +2

      Great video tho !

    • @jordanhasnolife5163
      @jordanhasnolife5163  Před 4 měsíci

      I'm assuming you mean event consumers not producers. Yeah this is one of those things where it's kinda built into the stream processing consumer that you use, so under the hood I assume we'll be using long polling.
      I don't know that I see the case made here for web sockets since we don't need bidirectional communication. Server sent events may also be not great because we'll try to re-establish connections automatically, which may not be what we want if we rebalance our kafka partitions.

  • @shibhamalik1274
    @shibhamalik1274 Před 3 měsíci +1

    Hey jordan nice video do u have any video on which databases support cdc and how ?

    • @jordanhasnolife5163
      @jordanhasnolife5163  Před 3 měsíci

      I think you can figure out a way to make it work on basically any of the major ones, don't have a video on it though

  • @bimalgupta3648
    @bimalgupta3648 Před 3 měsíci +1

    Watching this while taking a dump

  • @aryanpandey7835
    @aryanpandey7835 Před 4 měsíci +1

    sir please share slides with us

  • @zuowang5185
    @zuowang5185 Před 2 měsíci +1

    do these new videos replace the old? czcams.com/video/_KoiMoZZ3C8/video.html

  • @user-se9zv8hq9r
    @user-se9zv8hq9r Před 4 měsíci +2

    can we design onlyfans or fansly